Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Introduction using MapReduce - - PowerPoint PPT Presentation
Dimension Independent Matrix Square Reza Zadeh Dimension Independent Matrix Square Introduction using MapReduce The Problem Why Bother MapReduce First Pass Naive Reza Bosagh Zadeh Analysis DIMSUM Algorithm Shuffle Size Correctness
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
1
2
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
1
2
3
4
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
1
2
3
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
0.1 0.2 0.3 0.4 0.5 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
similarity threshold avg relative err
DISCO Cosine Similarity
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
2 4 6 8 10 12 14 16 18 0.2 0.4 0.6 0.8 1
log(p / ε)
DISCO Cosine shuffle size vs accuracy tradeoff DISCO Shuffle / Naive Shuffle 2 4 6 8 10 12 14 16 18 0.5 1 1.5 2 2.5 avg relative err DISCO Shuffle / Naive Shuffle avg relative err
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
#(x,y)
#(x)√ #(y)
#(x,y) #(x)+#(y)−#(x,y)
#(x,y) min(#(x),#(y))
2#(x,y) #(x)+#(y)
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results
Dimension Independent Matrix Square Reza Zadeh Introduction
The Problem Why Bother MapReduce
First Pass
Naive Analysis
DIMSUM
Algorithm Shuffle Size Correctness Singular values Similarities
Experiments
Large Small
More Results