linear complexity data parallel earth mover s distance
play

Linear-Complexity Data-Parallel Earth Movers Distance Approximations - PowerPoint PPT Presentation

Linear-Complexity Data-Parallel Earth Movers Distance Approximations Kubilay Atasu, Thomas Mittelholzer Earth/Word Movers Distance: Discrete Wasserstein Distance The Queen to tour Canada Royal visit to Halifax Canada Halifax


  1. Linear-Complexity Data-Parallel Earth Mover’s Distance Approximations Kubilay Atasu, Thomas Mittelholzer

  2. Earth/Word Mover’s Distance: Discrete Wasserstein Distance The Queen to tour Canada Royal visit to Halifax Canada Halifax Canada Queen Royal Halifax tour Queen visit Royal out-flow constraints in-flow constraints embedding space Search Accuracy Complexity GPU friendly Optimality ℎ " log ℎ EMD/WMD Very high No Yes (ℎ ' log ℎ) /𝜗 ' Within 𝜗 Sinkhorn Very high Yes ℎ RWMD High Yes No Our Work Very high ℎ𝑙 Yes No 2

  3. Our Solution: Iterative Constrained Transfers (ICT) Algorithm • Relaxed in-flow constraints • Edge capacity constraints • Sort the edges in the increasing order of costs • Iterative mass transfers under capacity constraints in-flow constraints • Approximate ICT (ACT) algorithm: only k iterations • ICT & ACT are tighter lower bounds than RWMD: RWMD ≤ ACT ≤ ICT ≤ EMD 3

  4. Experiments: Runtime vs Nearest-Neighbors-Search Accuracy Ø ACT effective on sparse as well as dense, low- as well as high-dimensional datasets Ø 20’000 faster than WMD and matches its search accuracy on 20 Newsgroups Ø 10’000 faster and offers a slightly higher search accuracy than Sinkhorn on MNIST 20News : high-dimensional, sparse histograms MNIST : two-dimensional, dense histograms WCD : Word centroid distance (Euclidean) BoW : Bag-of-Words (Cosine similarity) WMD : Word Mover’s Distance (Kusner et al.) RWMD : Relaxed Word Mover’s Distance OMR and ACT-k : the new algorithms 4

  5. Linear-Complexity Data-Parallel Earth Mover’s Distance Approximations Thank You! Check-out our poster #218 in Pacific Ballroom!

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend