Distributed Data Classification
Chih-Jen Lin
Department of Computer Science National Taiwan University
Talk at ICML workshop on New Learning Frameworks and Models for Big Data, June 25, 2014
Chih-Jen Lin (National Taiwan Univ.) 1 / 37
Distributed Data Classification Chih-Jen Lin Department of Computer - - PowerPoint PPT Presentation
Distributed Data Classification Chih-Jen Lin Department of Computer Science National Taiwan University Talk at ICML workshop on New Learning Frameworks and Models for Big Data, June 25, 2014 Chih-Jen Lin (National Taiwan Univ.) 1 / 37
Chih-Jen Lin (National Taiwan Univ.) 1 / 37
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 2 / 37
Introduction: why distributed classification
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 3 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 4 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 5 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 6 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 7 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 8 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 9 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 10 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 11 / 37
Introduction: why distributed classification
Chih-Jen Lin (National Taiwan Univ.) 12 / 37
Example: a distributed Newton method for logistic regression
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 13 / 37
Example: a distributed Newton method for logistic regression
w f (w),
l
Chih-Jen Lin (National Taiwan Univ.) 14 / 37
Example: a distributed Newton method for logistic regression
s
Chih-Jen Lin (National Taiwan Univ.) 15 / 37
Example: a distributed Newton method for logistic regression
Chih-Jen Lin (National Taiwan Univ.) 16 / 37
Example: a distributed Newton method for logistic regression
1 D1X1s + · · · + X T p DpXps
Chih-Jen Lin (National Taiwan Univ.) 17 / 37
Example: a distributed Newton method for logistic regression
1 D1X1s
2 D2X2s
3 D3X3s
i DiXix, ∀i) to a single
Chih-Jen Lin (National Taiwan Univ.) 18 / 37
Example: a distributed Newton method for logistic regression
Chih-Jen Lin (National Taiwan Univ.) 19 / 37
Example: a distributed Newton method for logistic regression
fw,1D(Xfw,1v1+· · ·+Xfw,pvp)
Chih-Jen Lin (National Taiwan Univ.) 20 / 37
Example: a distributed Newton method for logistic regression
Chih-Jen Lin (National Taiwan Univ.) 21 / 37
Example: a distributed Newton method for logistic regression
Chih-Jen Lin (National Taiwan Univ.) 22 / 37
Example: a distributed Newton method for logistic regression
20 40 60 10
−5
10 Time (s)
Relative function value difference
ADMM−IW ADMM−FW TRON−IW TRON−FW 200 400 600 800 10
−5
10 Time (s)
Relative function value difference
ADMM−IW ADMM−FW TRON−IW TRON−FW
Chih-Jen Lin (National Taiwan Univ.) 23 / 37
Example: a distributed Newton method for logistic regression
Chih-Jen Lin (National Taiwan Univ.) 24 / 37
Discussion from the viewpoint of the application workflow
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 25 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 26 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 27 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 28 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 29 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 30 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 31 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 32 / 37
Discussion from the viewpoint of the application workflow
20 40 60 80 −8 −6 −4 −2 2 Training time (seconds) Relative function value difference (log)
spark−one−core spark−multi−cores spark−one−core−coalesce spark−multi−cores−coalesce mpi
10 20 30 40 −8 −6 −4 −2 2 Training time (seconds) Relative function value difference (log)
spark−one−core spark−multi−cores spark−one−core−coalesce spark−multi−cores−coalesce mpi
Chih-Jen Lin (National Taiwan Univ.) 33 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 34 / 37
Discussion from the viewpoint of the application workflow
Chih-Jen Lin (National Taiwan Univ.) 35 / 37
Conclusions
1
2
3
4
Chih-Jen Lin (National Taiwan Univ.) 36 / 37
Conclusions
Chih-Jen Lin (National Taiwan Univ.) 37 / 37