PASSCoDe: Parallel ASynchronous Stochastic dual Co-ordinate Descent
Cho-Jui Hsieh Department of Computer Science University of Texas at Austin
Joint work with H.-F. Yu and I. S. Dhillon
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 1 / 29
PASSCoDe : P arallel AS ynchronous S tochastic dual Co -ordinate De - - PowerPoint PPT Presentation
PASSCoDe : P arallel AS ynchronous S tochastic dual Co -ordinate De scent Cho-Jui Hsieh Department of Computer Science University of Texas at Austin Joint work with H.-F. Yu and I. S. Dhillon Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 1
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 1 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 2 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 3 / 29
i (·): the conjugate of ℓi(·)
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 4 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 5 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 6 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 7 / 29
1 Randomly pick an index i 2 Compute w Txi 3 Update αi ← αi + δ∗ where δ∗ = Ti(w Txi, αi) 4 Update w ← w + δ∗xi.
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 8 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 9 / 29
1
2
3
4
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 10 / 29
1
2
3
4
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 11 / 29
1
2
3
4
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 11 / 29
1
2
3
4
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 11 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 12 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 13 / 29
CPU1: CPU2: w = w + 0.2 w = w + 0.5 OP R1 w OP R2 0.0 1.0 0.0 1 load w 1.0 1.0 load w 1.0 2 add 0.2 1.2 1.0 add 0.5 1.5 3 save w 1.2 1.2 1.5 4 1.2 1.5 save w 1.5 Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 14 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.4 1 R1 R2 R3 R4 R5 R6 R7 R8
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.4 1 R1 R2 R3 2 δ∗ R5 R6 R7 R8
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.4 1 R1 0.1 x11 R3 2 δ∗ R5 R6 R7 R8
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.4 1 w1 0.1 x11 R3 2 δ∗ R5 R6 R7 R8
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.4 1 0.2 w1 0.1 x11 R3 2 δ∗ R5 0.1 x61 R7 R8
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.2 0.4 1 0.2 w1 0.1 x11 R3 2 δ∗ 0.2 w1 0.1 x61 R7 R8
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.2 0.4 1 0.2 w1 0.2 x12 R3 2 δ∗ 0.2 w1 0.1 x61 0.02 wTxi R8
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.2 0.4 1 0.2 w2 0.2 x12 R3 2 δ∗ 0.2 w1 0.6 x66 0.02 wTxi R8
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.2 0.4 1 0.6 w2 0.2 x12 R3 2 δ∗ w6 0.6 x66 0.02 wTxi R8
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.6 0.4 1 0.6 w2 0.2 x12 R3 2 δ∗ w6 0.6 x66 0.02 wTxi R8
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.6 0.4 1 0.6 w2 0.3 x13 R3 2 δ∗ w6 α6 0.02 wTxi R8
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.6 0.4 1 w3 0.3 x13 R3 2 δ∗ w6 α6 0.02 wTxi 5 δ∗
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.6 0.4 1 0.6 w3 0.3 x13 R3 2 δ∗ w6 α6 0.02 wTxi 5 δ∗
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.6 0.6 0.4 1 0.6 w3 0.3 x13 R3 2 δ∗ w6 α6 0.02 wTxi 5 δ∗
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.6 0.6 0.4 1 0.6 w3 0.4 x14 R3 2 δ∗ w6 α6 0.02 wTxi 5 δ∗
0.2
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.6 0.6 0.4 1 0.4 w4 0.4 x14 R3 2 δ∗ w6 5 α6 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.6 0.6 0.4 1 5 1.2 w4 0.4 x14 R3 2 δ∗ w6 5 α6 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.6 0.6 1.2 1 5 1.2 w4 0.4 x14 R3 2 δ∗ w6 0.1 x61 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.6 0.6 1.2 1 5 1.2 w4 0.5 x15 R3 2 δ∗ 0.2 w1 0.1 x61 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.2 0.6 0.6 1.2 1 5 w5 0.5 x15 R3 2 δ∗ 0.7 w1 0.1 x61 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 5 1 w5 0.5 x15 R3 2 δ∗ 0.7 w1 0.1 x61 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 1 5 1 w5 0.5 x15 R3 2 δ∗ 0.7 w1 0.6 x66 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 1 5 1 w5 0.6 x16 R3 2 δ∗ w6 0.6 x66 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 1 5 w6 0.6 x16 R3 2 δ∗ 3 w6 0.6 x66 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 3 1 5 1.2 w6 0.6 x16 R3 2 δ∗ 3 w6 0.6 x66 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 1.2 1 5 1.2 w6 0.6 x16 R3 2 δ∗ 3 w6 0.6 x66 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
0.1 0.2 0.3 0.4 0.5 0.6
0.4 0.5
0.2 0.4
0.2 0.4
0.2 0.5
0.1 0.6
0.7 0.6 0.6 1.2 1 1.2 1 5 1.2 w6 0.6 x16 R3 2 δ∗ 3 w6 0.6 x66 0.02 wTxi 5 δ∗
0.2 0.4
0.2
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 15 / 29
1 Randomly pick an index i 2 Lock {wj | (xi)j = 0} 3 Compute w Txi 4 Update αi ← αi + δ∗ where δ∗ = Ti(w Txi, αi) 5 Update w ← w + δ∗xi. 6 Unlock the variables. Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 16 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 17 / 29
1 Randomly pick an index i 2 Compute w Txi 3 Update αi ← αi + δ∗ where δ∗ = Ti(w Txi, αi) 4 For each j ∈ N(i) 5
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 18 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 19 / 29
i=1 ˆ
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 20 / 29
1 Randomly pick an index i 2 Compute w Txi 3 Update αi ← αi + δ∗ where δ∗ = Ti(w Txi, αi) 4 Update w ← w + δ∗xi Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 21 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 22 / 29
n
n
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 23 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 24 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 25 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 26 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 27 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 28 / 29
Cho-Jui Hsieh (UT Austin) PASSCoDe July 7, 2015 29 / 29