Linear-Complexity Data-Parallel Earth Movers Distance Approximations - - PowerPoint PPT Presentation

linear complexity data parallel earth mover s distance
SMART_READER_LITE
LIVE PREVIEW

Linear-Complexity Data-Parallel Earth Movers Distance Approximations - - PowerPoint PPT Presentation

Linear-Complexity Data-Parallel Earth Movers Distance Approximations Kubilay Atasu, Thomas Mittelholzer Earth/Word Movers Distance: Discrete Wasserstein Distance The Queen to tour Canada Royal visit to Halifax Canada Halifax


slide-1
SLIDE 1

Linear-Complexity Data-Parallel Earth Mover’s Distance Approximations

Kubilay Atasu, Thomas Mittelholzer

slide-2
SLIDE 2

Earth/Word Mover’s Distance: Discrete Wasserstein Distance

2

Canada Queen tour Royal visit Halifax embedding space

  • ut-flow constraints

in-flow constraints

Canada Queen Halifax Royal

Search Accuracy Complexity GPU friendly Optimality EMD/WMD Very high ℎ" log ℎ No Yes Sinkhorn Very high (ℎ' log ℎ) /𝜗' Yes Within 𝜗 RWMD High ℎ Yes No Our Work Very high ℎ𝑙 Yes No

The Queen to tour Canada Royal visit to Halifax

slide-3
SLIDE 3

Our Solution: Iterative Constrained Transfers (ICT) Algorithm

3

  • Sort the edges in the increasing order of costs
  • Iterative mass transfers under capacity constraints
  • Relaxed in-flow constraints
  • Edge capacity constraints
  • Approximate ICT (ACT) algorithm: only k iterations
  • ICT & ACT are tighter lower bounds than RWMD: RWMD ≤ ACT ≤ ICT ≤ EMD

in-flow constraints

slide-4
SLIDE 4

Experiments: Runtime vs Nearest-Neighbors-Search Accuracy

4

Ø ACT effective on sparse as well as dense, low- as well as high-dimensional datasets Ø 20’000 faster than WMD and matches its search accuracy on 20 Newsgroups Ø 10’000 faster and offers a slightly higher search accuracy than Sinkhorn on MNIST

WCD: Word centroid distance (Euclidean) BoW: Bag-of-Words (Cosine similarity) WMD: Word Mover’s Distance (Kusner et al.) RWMD: Relaxed Word Mover’s Distance OMR and ACT-k: the new algorithms 20News: high-dimensional, sparse histograms MNIST: two-dimensional, dense histograms

slide-5
SLIDE 5

Linear-Complexity Data-Parallel Earth Mover’s Distance Approximations

Thank You!

Check-out our poster #218 in Pacific Ballroom!