Distributed Multi-modal Similarity Retrieval
David Novak Seminar of DISA Lab, October 14, 2014
David Novak Multi-modal Similarity Retrieval DISA Seminar 1 / 17
Distributed Multi-modal Similarity Retrieval David Novak Seminar of - - PowerPoint PPT Presentation
Distributed Multi-modal Similarity Retrieval David Novak Seminar of DISA Lab, October 14, 2014 David Novak Multi-modal Similarity Retrieval DISA Seminar 1 / 17 Outline of the Talk Motivation 1 Similarity Search E ff ectiveness and E ffi
David Novak Multi-modal Similarity Retrieval DISA Seminar 1 / 17
David Novak Multi-modal Similarity Retrieval DISA Seminar 2 / 17
Motivation Similarity Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 3 / 17
Motivation Similarity Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 3 / 17
Motivation Similarity Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 3 / 17
Motivation Similarity Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 3 / 17
Motivation Similarity Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 3 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Effectiveness and Efficiency
David Novak Multi-modal Similarity Retrieval DISA Seminar 4 / 17
Motivation Multi-modal Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 5 / 17
Motivation Multi-modal Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 5 / 17
Motivation Multi-modal Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 6 / 17
Motivation Multi-modal Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 6 / 17
Motivation Multi-modal Search
David Novak Multi-modal Similarity Retrieval DISA Seminar 6 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 7 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 7 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 8 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 8 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 8 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 8 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 8 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 9 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 9 / 17
Existing Solutions Similarity Indexing
candidate set CX⊆ X
refined answer
data storage 1
calculate δ(q,c), c ∊ CX
2 3 David Novak Multi-modal Similarity Retrieval DISA Seminar 9 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 10 / 17
Existing Solutions Similarity Indexing
David Novak Multi-modal Similarity Retrieval DISA Seminar 10 / 17
Existing Solutions Similarity Indexing
2
1
2
1
K2 N
2
K
1
N
1
K3 N3 N
4
K N K4
+2 +16 +8 +4
+1
0 = 32 0 = 32 David Novak Multi-modal Similarity Retrieval DISA Seminar 10 / 17
Existing Solutions Distributed Key-value Stores
David Novak Multi-modal Similarity Retrieval DISA Seminar 11 / 17
Existing Solutions Distributed Key-value Stores
David Novak Multi-modal Similarity Retrieval DISA Seminar 11 / 17
Big Data Similarity Retrieval Generic Architecture
key-value store (ID-object) on the whole dataset X
worker worker worker worker worker
David Novak Multi-modal Similarity Retrieval DISA Seminar 12 / 17
Big Data Similarity Retrieval Generic Architecture
key-value store (ID-object) on the whole dataset X
worker worker
similarity index IXi
field
worker worker worker
David Novak Multi-modal Similarity Retrieval DISA Seminar 12 / 17
Big Data Similarity Retrieval Generic Architecture
key-value store (ID-object) on the whole dataset X
worker worker
similarity index IXi
field
worker worker
k-NN(q.field)
candidate set CXi⊆ Xi 1 worker refinement on part of CXi 2 merge partial answers 3 refinement on part of CXi 2
David Novak Multi-modal Similarity Retrieval DISA Seminar 12 / 17
Big Data Similarity Retrieval Generic Architecture
key-value store (ID-object) on the whole dataset X
worker worker
similarity index IXi
field
inverted file index IXi
field2
attribute index IXi
field3
similarity index IXj
field
worker worker
k-NN(q.field)
candidate set CXi⊆ Xi 1 worker refinement on part of CXi 2 merge partial answers 3 refinement on part of CXi 2
David Novak Multi-modal Similarity Retrieval DISA Seminar 12 / 17
Big Data Similarity Retrieval Generic Architecture
David Novak Multi-modal Similarity Retrieval DISA Seminar 13 / 17
Big Data Similarity Retrieval Generic Architecture
David Novak Multi-modal Similarity Retrieval DISA Seminar 13 / 17
Big Data Similarity Retrieval Generic Architecture
David Novak Multi-modal Similarity Retrieval DISA Seminar 13 / 17
Big Data Similarity Retrieval Specific System
David Novak Multi-modal Similarity Retrieval DISA Seminar 14 / 17
Big Data Similarity Retrieval Specific System
index
Ispn node
Ispn node
index
Ispn node
Ispn node
index
Ispn node
index
David Novak Multi-modal Similarity Retrieval DISA Seminar 15 / 17
Big Data Similarity Retrieval Specific System
demo David Novak Multi-modal Similarity Retrieval DISA Seminar 16 / 17
Conclusions
David Novak Multi-modal Similarity Retrieval DISA Seminar 17 / 17
Conclusions
David Novak Multi-modal Similarity Retrieval DISA Seminar 17 / 17
Conclusions
David Novak Multi-modal Similarity Retrieval DISA Seminar 17 / 17