Part 2 SUVRIT SRA & STEFANIE JEGELKA
Laboratory for Information and Decision Systems
Massachusetts Institute of Technology
ml.mit.edu
Neural information Processing Systems, 2018
Negative Dependence, Stable Polynomials etc in ML Part 2 SUVRIT - - PowerPoint PPT Presentation
Negative Dependence, Stable Polynomials etc in ML Part 2 SUVRIT SRA & STEFANIE JEGELKA Laboratory for Information and Decision Systems Massachusetts Institute of Technology Neural information Processing Systems, 2018 ml.mit.edu
Laboratory for Information and Decision Systems
Massachusetts Institute of Technology
ml.mit.edu
Neural information Processing Systems, 2018
Negative dependence, stable polynomials etc. in ML - part 1
Stefanie Jegelka (stefje@mit.edu)
Theory & Applications
Learning a DPP (and some variants) Approximating partition functions Applications Perspectives and wrap-up
Intro & Theory
Introduction
Prominent example: Determinantal Point Processes
Stronger notions of negative dependence Implications: Sampling
Recommender systems, Nyström method,
negative mining, anomaly detection, etc.
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 4
(exponential number of terms to sum over, or evaluation of high-dimensional integrals / volumes)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 5
S⊆[n]
det(LS)
<latexit sha1_base64="xVYHoL9gl7pPdmi1xQPxjMtBCA=">ADTnicfVJNb9QwEHVTPpblo1s4crFYgYqEqs0KCS5IFXDgUEpR2XbLOocZ7Jr1XaC7ZSurPwDfg1X+BFc+SPcEDi7OeymiJGSeXrvTJjT1Ibuxg8HMj2Lxy9dr1zo3uzVu372z1tu8em7zUDEYsF7keJ9SA4ApGlsB40IDlYmAk+TsVa2fnIM2PFcf7LyASNKp4hln1Hoq7j36GO/jF5iYUsbuqM6JAQuf8ERFSYp2J39+Ohx3OsPdgeLwJdB2IA+auIw3g56JM1ZKUFZJqgxk3BQ2MhRbTkTUHVJaCg7IxOYeKhohJM5BYDVfihZ1Kc5do/yuIFu1rhqDRmLhPvlNTOTFuryX9qiVz7szNWUj3Xasfmz2PHFdFaUGxZTtZKbDNcX2GOUamBVzDyjT3E+E2Yxqyqw/6S5R8JnlUlKVOsKLajKMHBGQWSKomgrA/fAJ7g8x0Xw6s0QvyGq97PyiciSR7qJqCUr4m/KafxOVCy65NS3LAfN6PTqjwh20PzBeVcdt9XRVPa3V1+CvT8Nbz70rQFOba0eonkrqW2zyf1xcLV0+x0K2xtzGRwPd0OP3z/t71stqmD7qMHaAeF6BnaQ2/QIRohr6gr+gb+h78CH4Fv4M/S2uw0dTcQ2ux2fkLHv0ZhA=</latexit><latexit sha1_base64="xVYHoL9gl7pPdmi1xQPxjMtBCA=">ADTnicfVJNb9QwEHVTPpblo1s4crFYgYqEqs0KCS5IFXDgUEpR2XbLOocZ7Jr1XaC7ZSurPwDfg1X+BFc+SPcEDi7OeymiJGSeXrvTJjT1Ibuxg8HMj2Lxy9dr1zo3uzVu372z1tu8em7zUDEYsF7keJ9SA4ApGlsB40IDlYmAk+TsVa2fnIM2PFcf7LyASNKp4hln1Hoq7j36GO/jF5iYUsbuqM6JAQuf8ERFSYp2J39+Ohx3OsPdgeLwJdB2IA+auIw3g56JM1ZKUFZJqgxk3BQ2MhRbTkTUHVJaCg7IxOYeKhohJM5BYDVfihZ1Kc5do/yuIFu1rhqDRmLhPvlNTOTFuryX9qiVz7szNWUj3Xasfmz2PHFdFaUGxZTtZKbDNcX2GOUamBVzDyjT3E+E2Yxqyqw/6S5R8JnlUlKVOsKLajKMHBGQWSKomgrA/fAJ7g8x0Xw6s0QvyGq97PyiciSR7qJqCUr4m/KafxOVCy65NS3LAfN6PTqjwh20PzBeVcdt9XRVPa3V1+CvT8Nbz70rQFOba0eonkrqW2zyf1xcLV0+x0K2xtzGRwPd0OP3z/t71stqmD7qMHaAeF6BnaQ2/QIRohr6gr+gb+h78CH4Fv4M/S2uw0dTcQ2ux2fkLHv0ZhA=</latexit><latexit sha1_base64="xVYHoL9gl7pPdmi1xQPxjMtBCA=">ADTnicfVJNb9QwEHVTPpblo1s4crFYgYqEqs0KCS5IFXDgUEpR2XbLOocZ7Jr1XaC7ZSurPwDfg1X+BFc+SPcEDi7OeymiJGSeXrvTJjT1Ibuxg8HMj2Lxy9dr1zo3uzVu372z1tu8em7zUDEYsF7keJ9SA4ApGlsB40IDlYmAk+TsVa2fnIM2PFcf7LyASNKp4hln1Hoq7j36GO/jF5iYUsbuqM6JAQuf8ERFSYp2J39+Ohx3OsPdgeLwJdB2IA+auIw3g56JM1ZKUFZJqgxk3BQ2MhRbTkTUHVJaCg7IxOYeKhohJM5BYDVfihZ1Kc5do/yuIFu1rhqDRmLhPvlNTOTFuryX9qiVz7szNWUj3Xasfmz2PHFdFaUGxZTtZKbDNcX2GOUamBVzDyjT3E+E2Yxqyqw/6S5R8JnlUlKVOsKLajKMHBGQWSKomgrA/fAJ7g8x0Xw6s0QvyGq97PyiciSR7qJqCUr4m/KafxOVCy65NS3LAfN6PTqjwh20PzBeVcdt9XRVPa3V1+CvT8Nbz70rQFOba0eonkrqW2zyf1xcLV0+x0K2xtzGRwPd0OP3z/t71stqmD7qMHaAeF6BnaQ2/QIRohr6gr+gb+h78CH4Fv4M/S2uw0dTcQ2ux2fkLHv0ZhA=</latexit><latexit sha1_base64="xVYHoL9gl7pPdmi1xQPxjMtBCA=">ADTnicfVJNb9QwEHVTPpblo1s4crFYgYqEqs0KCS5IFXDgUEpR2XbLOocZ7Jr1XaC7ZSurPwDfg1X+BFc+SPcEDi7OeymiJGSeXrvTJjT1Ibuxg8HMj2Lxy9dr1zo3uzVu372z1tu8em7zUDEYsF7keJ9SA4ApGlsB40IDlYmAk+TsVa2fnIM2PFcf7LyASNKp4hln1Hoq7j36GO/jF5iYUsbuqM6JAQuf8ERFSYp2J39+Ohx3OsPdgeLwJdB2IA+auIw3g56JM1ZKUFZJqgxk3BQ2MhRbTkTUHVJaCg7IxOYeKhohJM5BYDVfihZ1Kc5do/yuIFu1rhqDRmLhPvlNTOTFuryX9qiVz7szNWUj3Xasfmz2PHFdFaUGxZTtZKbDNcX2GOUamBVzDyjT3E+E2Yxqyqw/6S5R8JnlUlKVOsKLajKMHBGQWSKomgrA/fAJ7g8x0Xw6s0QvyGq97PyiciSR7qJqCUr4m/KafxOVCy65NS3LAfN6PTqjwh20PzBeVcdt9XRVPa3V1+CvT8Nbz70rQFOba0eonkrqW2zyf1xcLV0+x0K2xtzGRwPd0OP3z/t71stqmD7qMHaAeF6BnaQ2/QIRohr6gr+gb+h78CH4Fv4M/S2uw0dTcQ2ux2fkLHv0ZhA=</latexit>S⊆[n]
S⊆[n]
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 6
Using properties of stable polynomials, these can be approximated within factor en (ek for k-homogeneous, e.g., k-DPP): [Straszak, Vishnoi, 2016; Nikolov, Singh, 2016; Anari, Gharan,
Saberi, Singh, 2016; Anari, Gharan 2017]
using convex relaxation afforded by stable polynomials
S⊆[n]
S⊆[n]
z>0
z=exp(y): yields convex optim. (a geometric program - GP)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 7
σ∈Sn n
i=1
z>0
p(z1, . . . , zn) z1z2 · · · zn
<latexit sha1_base64="TdPbfM0CAKwVqeAje1lclMmNaHI=">ADpnicfVJb9MwFHYXLqNc1jHeDFUSEWqTi+oIm4IEXypDo2qnuiuM4qTXbiWxnrLXC/+SdH4KTRlMbEdyfPJ95zs+9jlhxpk2v+rtefduHnr9v6d9t179x8cdA4fnuo0V4SOScpTNQ2xpxJOjbMcDrNFMUi5HQSXnwo+cklVZql8ptZXQucCJZzAg2Dlp0fqJYWJRhpVhmJ9LmBXf3C9CBCJUqO3EFnANko3Ajlk8LKc1kgJuOFXcN30C9qKus5eR/xUt93ueFdcB6MdykrFItOl1/4FcG/cFLP3j7KoDXSFA7XVDbyeJwr4OilOSCSkM41noW+JmZ27I8wmnRrmGSYXOKEz50osqJ7b6qEK+MwhEYxT5ZY0sEK3FRYLrVcidJECm6VuciX4Ty4UOydbQRWKxU16jHxm7lMsNlWRTpxzaFJY9gZGTFi+Mo5mCjmbgTJEru3NK6DbSTpD5IKgWVkEcuK2XBuEaexQRzLhFPYDfqwO4RIsWRpkKrAYld2eW6Gwp7VTQIyXUuHOe+SKacCWZ0I2REHF9enWBuR80E0212mTPtmzkv1IXfsU/eywLxlV2KTKIqwSgV2J9f6fKCY3UayaoaA5MX87p8NB4PyvL7rH7+tp2gePwVPQAwF4DY7BJ3ACxoCA36126j1yOt5I2/sTahe61acwR2zPv+B/dPOEs=</latexit><latexit sha1_base64="TdPbfM0CAKwVqeAje1lclMmNaHI=">ADpnicfVJb9MwFHYXLqNc1jHeDFUSEWqTi+oIm4IEXypDo2qnuiuM4qTXbiWxnrLXC/+SdH4KTRlMbEdyfPJ95zs+9jlhxpk2v+rtefduHnr9v6d9t179x8cdA4fnuo0V4SOScpTNQ2xpxJOjbMcDrNFMUi5HQSXnwo+cklVZql8ptZXQucCJZzAg2Dlp0fqJYWJRhpVhmJ9LmBXf3C9CBCJUqO3EFnANko3Ajlk8LKc1kgJuOFXcN30C9qKus5eR/xUt93ueFdcB6MdykrFItOl1/4FcG/cFLP3j7KoDXSFA7XVDbyeJwr4OilOSCSkM41noW+JmZ27I8wmnRrmGSYXOKEz50osqJ7b6qEK+MwhEYxT5ZY0sEK3FRYLrVcidJECm6VuciX4Ty4UOydbQRWKxU16jHxm7lMsNlWRTpxzaFJY9gZGTFi+Mo5mCjmbgTJEru3NK6DbSTpD5IKgWVkEcuK2XBuEaexQRzLhFPYDfqwO4RIsWRpkKrAYld2eW6Gwp7VTQIyXUuHOe+SKacCWZ0I2REHF9enWBuR80E0212mTPtmzkv1IXfsU/eywLxlV2KTKIqwSgV2J9f6fKCY3UayaoaA5MX87p8NB4PyvL7rH7+tp2gePwVPQAwF4DY7BJ3ACxoCA36126j1yOt5I2/sTahe61acwR2zPv+B/dPOEs=</latexit><latexit sha1_base64="TdPbfM0CAKwVqeAje1lclMmNaHI=">ADpnicfVJb9MwFHYXLqNc1jHeDFUSEWqTi+oIm4IEXypDo2qnuiuM4qTXbiWxnrLXC/+SdH4KTRlMbEdyfPJ95zs+9jlhxpk2v+rtefduHnr9v6d9t179x8cdA4fnuo0V4SOScpTNQ2xpxJOjbMcDrNFMUi5HQSXnwo+cklVZql8ptZXQucCJZzAg2Dlp0fqJYWJRhpVhmJ9LmBXf3C9CBCJUqO3EFnANko3Ajlk8LKc1kgJuOFXcN30C9qKus5eR/xUt93ueFdcB6MdykrFItOl1/4FcG/cFLP3j7KoDXSFA7XVDbyeJwr4OilOSCSkM41noW+JmZ27I8wmnRrmGSYXOKEz50osqJ7b6qEK+MwhEYxT5ZY0sEK3FRYLrVcidJECm6VuciX4Ty4UOydbQRWKxU16jHxm7lMsNlWRTpxzaFJY9gZGTFi+Mo5mCjmbgTJEru3NK6DbSTpD5IKgWVkEcuK2XBuEaexQRzLhFPYDfqwO4RIsWRpkKrAYld2eW6Gwp7VTQIyXUuHOe+SKacCWZ0I2REHF9enWBuR80E0212mTPtmzkv1IXfsU/eywLxlV2KTKIqwSgV2J9f6fKCY3UayaoaA5MX87p8NB4PyvL7rH7+tp2gePwVPQAwF4DY7BJ3ACxoCA36126j1yOt5I2/sTahe61acwR2zPv+B/dPOEs=</latexit><latexit sha1_base64="TdPbfM0CAKwVqeAje1lclMmNaHI=">ADpnicfVJb9MwFHYXLqNc1jHeDFUSEWqTi+oIm4IEXypDo2qnuiuM4qTXbiWxnrLXC/+SdH4KTRlMbEdyfPJ95zs+9jlhxpk2v+rtefduHnr9v6d9t179x8cdA4fnuo0V4SOScpTNQ2xpxJOjbMcDrNFMUi5HQSXnwo+cklVZql8ptZXQucCJZzAg2Dlp0fqJYWJRhpVhmJ9LmBXf3C9CBCJUqO3EFnANko3Ajlk8LKc1kgJuOFXcN30C9qKus5eR/xUt93ueFdcB6MdykrFItOl1/4FcG/cFLP3j7KoDXSFA7XVDbyeJwr4OilOSCSkM41noW+JmZ27I8wmnRrmGSYXOKEz50osqJ7b6qEK+MwhEYxT5ZY0sEK3FRYLrVcidJECm6VuciX4Ty4UOydbQRWKxU16jHxm7lMsNlWRTpxzaFJY9gZGTFi+Mo5mCjmbgTJEru3NK6DbSTpD5IKgWVkEcuK2XBuEaexQRzLhFPYDfqwO4RIsWRpkKrAYld2eW6Gwp7VTQIyXUuHOe+SKacCWZ0I2REHF9enWBuR80E0212mTPtmzkv1IXfsU/eywLxlV2KTKIqwSgV2J9f6fKCY3UayaoaA5MX87p8NB4PyvL7rH7+tp2gePwVPQAwF4DY7BJ3ACxoCA36126j1yOt5I2/sTahe61acwR2zPv+B/dPOEs=</latexit>per(A) = ∂np(0) ∂z1 · · · ∂zn p(z1, . . . , zn) =
n
Y
i=1
⇣Xn
j=1 aijzj
⌘
<latexit sha1_base64="NTFnPR6b0xinoKWsY0dhNdzWpc=">ADuHicfZJdb9MwFIbdho9Rvjq4hAuLalMrVNSIYGQJo3BTeMIdGtU90Fx3FSd7YT2e5oZ+U/8Pf4I1zjpBVqM4SlxEfvc45znPdEOWfa+P6vRtO7c/fe/Z0HrYePHj952t59dqazuSJ0SDKeqVGENeVM0qFhtNRrigWEafn0dWHkp9fU6VZJr+ZU4nAqeSJYxg46Sw/RMZujA2p6rovu/B/UOIEoWJRTlWhmF+KWHe9XvFXwHehAEicWb0hiILhFp516E+4iXrO21Wq6yOLTsMCguJTpmKe8iyfVchHZWaRA7OituwlJVS9sd/wDv1rwdhCsgw5Yr9Nwt9lGcUbmgkpDONZ6HPi5mdiyOcJp0UJzTXNMrnBKxy6UWFA9sdWvK+CeU2KYZMo90sBK3aywWGi9FJHLFNhMdZ2V4j9ZJLa+bLURWC1VXOvHJG8nlsl8bqgkq3aSOYcmg6VbMGaKEsOXLsBEMXcjSKbYGWScpy0k6Q+SCYFlbBHLi/FgYhGniUEcy5RT2An6sDOASLF0apCqxGK7HrhvI2EXRQ1ULnkmHsjmXEmNG1lBPieHl1grk9qR8w2qSjOr3YpBcl/UidfYp+dtoXN4/YZMoirFKBXYvr/T9ZTK6y3O5mKhPzO3gbHAQuPjr687R8XqadsAL8Ap0QDegCPwCZyCISDgd+NlY6+x73zvnupx1apzca65jnYWp76A41HPbY=</latexit><latexit sha1_base64="NTFnPR6b0xinoKWsY0dhNdzWpc=">ADuHicfZJdb9MwFIbdho9Rvjq4hAuLalMrVNSIYGQJo3BTeMIdGtU90Fx3FSd7YT2e5oZ+U/8Pf4I1zjpBVqM4SlxEfvc45znPdEOWfa+P6vRtO7c/fe/Z0HrYePHj952t59dqazuSJ0SDKeqVGENeVM0qFhtNRrigWEafn0dWHkp9fU6VZJr+ZU4nAqeSJYxg46Sw/RMZujA2p6rovu/B/UOIEoWJRTlWhmF+KWHe9XvFXwHehAEicWb0hiILhFp516E+4iXrO21Wq6yOLTsMCguJTpmKe8iyfVchHZWaRA7OituwlJVS9sd/wDv1rwdhCsgw5Yr9Nwt9lGcUbmgkpDONZ6HPi5mdiyOcJp0UJzTXNMrnBKxy6UWFA9sdWvK+CeU2KYZMo90sBK3aywWGi9FJHLFNhMdZ2V4j9ZJLa+bLURWC1VXOvHJG8nlsl8bqgkq3aSOYcmg6VbMGaKEsOXLsBEMXcjSKbYGWScpy0k6Q+SCYFlbBHLi/FgYhGniUEcy5RT2An6sDOASLF0apCqxGK7HrhvI2EXRQ1ULnkmHsjmXEmNG1lBPieHl1grk9qR8w2qSjOr3YpBcl/UidfYp+dtoXN4/YZMoirFKBXYvr/T9ZTK6y3O5mKhPzO3gbHAQuPjr687R8XqadsAL8Ap0QDegCPwCZyCISDgd+NlY6+x73zvnupx1apzca65jnYWp76A41HPbY=</latexit><latexit sha1_base64="NTFnPR6b0xinoKWsY0dhNdzWpc=">ADuHicfZJdb9MwFIbdho9Rvjq4hAuLalMrVNSIYGQJo3BTeMIdGtU90Fx3FSd7YT2e5oZ+U/8Pf4I1zjpBVqM4SlxEfvc45znPdEOWfa+P6vRtO7c/fe/Z0HrYePHj952t59dqazuSJ0SDKeqVGENeVM0qFhtNRrigWEafn0dWHkp9fU6VZJr+ZU4nAqeSJYxg46Sw/RMZujA2p6rovu/B/UOIEoWJRTlWhmF+KWHe9XvFXwHehAEicWb0hiILhFp516E+4iXrO21Wq6yOLTsMCguJTpmKe8iyfVchHZWaRA7OituwlJVS9sd/wDv1rwdhCsgw5Yr9Nwt9lGcUbmgkpDONZ6HPi5mdiyOcJp0UJzTXNMrnBKxy6UWFA9sdWvK+CeU2KYZMo90sBK3aywWGi9FJHLFNhMdZ2V4j9ZJLa+bLURWC1VXOvHJG8nlsl8bqgkq3aSOYcmg6VbMGaKEsOXLsBEMXcjSKbYGWScpy0k6Q+SCYFlbBHLi/FgYhGniUEcy5RT2An6sDOASLF0apCqxGK7HrhvI2EXRQ1ULnkmHsjmXEmNG1lBPieHl1grk9qR8w2qSjOr3YpBcl/UidfYp+dtoXN4/YZMoirFKBXYvr/T9ZTK6y3O5mKhPzO3gbHAQuPjr687R8XqadsAL8Ap0QDegCPwCZyCISDgd+NlY6+x73zvnupx1apzca65jnYWp76A41HPbY=</latexit><latexit sha1_base64="NTFnPR6b0xinoKWsY0dhNdzWpc=">ADuHicfZJdb9MwFIbdho9Rvjq4hAuLalMrVNSIYGQJo3BTeMIdGtU90Fx3FSd7YT2e5oZ+U/8Pf4I1zjpBVqM4SlxEfvc45znPdEOWfa+P6vRtO7c/fe/Z0HrYePHj952t59dqazuSJ0SDKeqVGENeVM0qFhtNRrigWEafn0dWHkp9fU6VZJr+ZU4nAqeSJYxg46Sw/RMZujA2p6rovu/B/UOIEoWJRTlWhmF+KWHe9XvFXwHehAEicWb0hiILhFp516E+4iXrO21Wq6yOLTsMCguJTpmKe8iyfVchHZWaRA7OituwlJVS9sd/wDv1rwdhCsgw5Yr9Nwt9lGcUbmgkpDONZ6HPi5mdiyOcJp0UJzTXNMrnBKxy6UWFA9sdWvK+CeU2KYZMo90sBK3aywWGi9FJHLFNhMdZ2V4j9ZJLa+bLURWC1VXOvHJG8nlsl8bqgkq3aSOYcmg6VbMGaKEsOXLsBEMXcjSKbYGWScpy0k6Q+SCYFlbBHLi/FgYhGniUEcy5RT2An6sDOASLF0apCqxGK7HrhvI2EXRQ1ULnkmHsjmXEmNG1lBPieHl1grk9qR8w2qSjOr3YpBcl/UidfYp+dtoXN4/YZMoirFKBXYvr/T9ZTK6y3O5mKhPzO3gbHAQuPjr687R8XqadsAL8Ap0QDegCPwCZyCISDgd+NlY6+x73zvnupx1apzca65jnYWp76A41HPbY=</latexit>A is
doubly stochastic
Permanents via stable polynomials (Gurvits 2006)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 9
Kulesza, Taskar ICML 2011, UAI 2011 Affandi, Fox, Adams, Taskar, ICML 2014 Gillenwater, Kulesza, Fox, Taskar, NIPS 2014
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 10
L0 φ(L) := N
X
i=1
log Pr(Yi) =
N
X
i=1
log det(LYi) det(I + L)
<latexit sha1_base64="stmjghIiJuwsighZLra2u38VaU=">ADk3icfVJdb9MwFHUbPkb5WAfiReLCqkVaEoqJBCo0jT2AFJXhkS3VnWJHMdprdpOZDujlZUfySO/hFectg9tirhS4pNznXse2+UcaN7/+u1b07d+/dP3rQePjo8ZPj5snTa53mitAhSXmqRhHWlDNJh4YZTkeZolhEnN5Ei0+lfnNLlWap/G5WGZ0KPJMsYQbR4XNBRJ4Gdo+RDonBPoFyuas3e/AD72SEqFlvaD4MYCIpzNk6NLYK1W0xyHr9A71RGFiUxNux9a5yk6xebzy+t+pwibLf/UXwc8BMEWtMA2rsKTehPFKckFlYZwrPUk8DMztVgZRjgtGijXNMNkgWd04qDEguqpXVelgK8cE8MkVe6RBq7Z3QyLhdYrETmnwGauq1pJ/lOLxN6frTYCq5WK+cxyfupZTLDZVkc5wk59CksGwEjJmixPCVA5go5m4EyRy7AhrXrgaS9CdJhcAytohlxaQ7tYjTxCO5YxT2ArewFYXIsVmc4PUmiz2026XrvqRsMuiIkjuOuc090Yy5UwoyuWAXF6eXWCuR1UNxjtqOqOt5Vx6V6QV37FL103NeMKmxSZRFWMzd7hd2u/3ExuXG51c1QUJ2YQ3DdPQ0c/va2dXa+naYj8AK8BG0QgHfgDHwGV2AICPgF/tRAreY9z56597FxlqvbXOegb3wLv8CjUgvNQ=</latexit><latexit sha1_base64="stmjghIiJuwsighZLra2u38VaU=">ADk3icfVJdb9MwFHUbPkb5WAfiReLCqkVaEoqJBCo0jT2AFJXhkS3VnWJHMdprdpOZDujlZUfySO/hFectg9tirhS4pNznXse2+UcaN7/+u1b07d+/dP3rQePjo8ZPj5snTa53mitAhSXmqRhHWlDNJh4YZTkeZolhEnN5Ei0+lfnNLlWap/G5WGZ0KPJMsYQbR4XNBRJ4Gdo+RDonBPoFyuas3e/AD72SEqFlvaD4MYCIpzNk6NLYK1W0xyHr9A71RGFiUxNux9a5yk6xebzy+t+pwibLf/UXwc8BMEWtMA2rsKTehPFKckFlYZwrPUk8DMztVgZRjgtGijXNMNkgWd04qDEguqpXVelgK8cE8MkVe6RBq7Z3QyLhdYrETmnwGauq1pJ/lOLxN6frTYCq5WK+cxyfupZTLDZVkc5wk59CksGwEjJmixPCVA5go5m4EyRy7AhrXrgaS9CdJhcAytohlxaQ7tYjTxCO5YxT2ArewFYXIsVmc4PUmiz2026XrvqRsMuiIkjuOuc090Yy5UwoyuWAXF6eXWCuR1UNxjtqOqOt5Vx6V6QV37FL103NeMKmxSZRFWMzd7hd2u/3ExuXG51c1QUJ2YQ3DdPQ0c/va2dXa+naYj8AK8BG0QgHfgDHwGV2AICPgF/tRAreY9z56597FxlqvbXOegb3wLv8CjUgvNQ=</latexit><latexit sha1_base64="stmjghIiJuwsighZLra2u38VaU=">ADk3icfVJdb9MwFHUbPkb5WAfiReLCqkVaEoqJBCo0jT2AFJXhkS3VnWJHMdprdpOZDujlZUfySO/hFectg9tirhS4pNznXse2+UcaN7/+u1b07d+/dP3rQePjo8ZPj5snTa53mitAhSXmqRhHWlDNJh4YZTkeZolhEnN5Ei0+lfnNLlWap/G5WGZ0KPJMsYQbR4XNBRJ4Gdo+RDonBPoFyuas3e/AD72SEqFlvaD4MYCIpzNk6NLYK1W0xyHr9A71RGFiUxNux9a5yk6xebzy+t+pwibLf/UXwc8BMEWtMA2rsKTehPFKckFlYZwrPUk8DMztVgZRjgtGijXNMNkgWd04qDEguqpXVelgK8cE8MkVe6RBq7Z3QyLhdYrETmnwGauq1pJ/lOLxN6frTYCq5WK+cxyfupZTLDZVkc5wk59CksGwEjJmixPCVA5go5m4EyRy7AhrXrgaS9CdJhcAytohlxaQ7tYjTxCO5YxT2ArewFYXIsVmc4PUmiz2026XrvqRsMuiIkjuOuc090Yy5UwoyuWAXF6eXWCuR1UNxjtqOqOt5Vx6V6QV37FL103NeMKmxSZRFWMzd7hd2u/3ExuXG51c1QUJ2YQ3DdPQ0c/va2dXa+naYj8AK8BG0QgHfgDHwGV2AICPgF/tRAreY9z56597FxlqvbXOegb3wLv8CjUgvNQ=</latexit><latexit sha1_base64="stmjghIiJuwsighZLra2u38VaU=">ADk3icfVJdb9MwFHUbPkb5WAfiReLCqkVaEoqJBCo0jT2AFJXhkS3VnWJHMdprdpOZDujlZUfySO/hFectg9tirhS4pNznXse2+UcaN7/+u1b07d+/dP3rQePjo8ZPj5snTa53mitAhSXmqRhHWlDNJh4YZTkeZolhEnN5Ei0+lfnNLlWap/G5WGZ0KPJMsYQbR4XNBRJ4Gdo+RDonBPoFyuas3e/AD72SEqFlvaD4MYCIpzNk6NLYK1W0xyHr9A71RGFiUxNux9a5yk6xebzy+t+pwibLf/UXwc8BMEWtMA2rsKTehPFKckFlYZwrPUk8DMztVgZRjgtGijXNMNkgWd04qDEguqpXVelgK8cE8MkVe6RBq7Z3QyLhdYrETmnwGauq1pJ/lOLxN6frTYCq5WK+cxyfupZTLDZVkc5wk59CksGwEjJmixPCVA5go5m4EyRy7AhrXrgaS9CdJhcAytohlxaQ7tYjTxCO5YxT2ArewFYXIsVmc4PUmiz2026XrvqRsMuiIkjuOuc090Yy5UwoyuWAXF6eXWCuR1UNxjtqOqOt5Vx6V6QV37FL103NeMKmxSZRFWMzd7hd2u/3ExuXG51c1QUJ2YQ3DdPQ0c/va2dXa+naYj8AK8BG0QgHfgDHwGV2AICPgF/tRAreY9z56597FxlqvbXOegb3wLv8CjUgvNQ=</latexit>complexity: [Urschel, Brunel, Moitra, Rigollet, ICML 2017]
Related recent work
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 11
LR-DPP: Write L=VVT for low-rank V (can sample size ≤ k) [Gartrell, Paquet, Koenigstein, 2017] k-DPP: Restrict DPP to subsets of size exactly ‘k’ [Kulesza, Taskar, 2011] Kron-DPP: Write (can sample any size) [Mariet, Sra, 2017]
L = L1 ⊗ L2
<latexit sha1_base64="Mcj0ZdFzc8oxIU6DHnJRzqBGpM4=">ADO3icfZJNbxMxEIbd5aMlfKX0yMVihcQBVdkICS6VKuDAoZQikTZVHEWziSx6o+V7S2NrP0tXOFH8EM4c0NcueNcki2iJF2/ep9ZtZj7+SFM53Oj+2khs3b93e3rnTunv/oOH7d1Hp86UlmOPG2lsPweHUmjseEl9guLoHKJZ/nFm5qfXaJ1wuhPfl7gUMFUi4ng4KM1au8d0QN6NMoM14odF3R+20s9ZBL0uspVIySpORrtJm40NLxVqzyU4N8g6hR8GsF5wiVWLlQ4L4BcwxUGUGuJOw7DovqJPozOmE2Pjoz1duOsVAZRzc5XHTAV+5pqsNv/JcrWxc3BegZ3bcaMfP3k1DEIXpUfNl+1MSkm9ofWF0bGwyL2cRwHcingiymdgft4rS2m8TM3SoEeByaKatAdBiZx4pkEPZVI0+w5TbuUWTGdeWYXZrVZdnlVBZarcFU1gJauVJHFN9NGCiW8a6Qc8jro3OQ4bj5gf467Tfp+To9r+lbjL/P4vofSjQgjc2MLBTBbHF1fqfLKGXWXGNM5Q1J+a6O3uZ1F/fJEevl5N0w5TJ6QZyQjL8kheUdOSI9wMidfyFfyLfme/Ex+Jb+XqcnWqmaPbETy5y8XphJh</latexit><latexit sha1_base64="Mcj0ZdFzc8oxIU6DHnJRzqBGpM4=">ADO3icfZJNbxMxEIbd5aMlfKX0yMVihcQBVdkICS6VKuDAoZQikTZVHEWziSx6o+V7S2NrP0tXOFH8EM4c0NcueNcki2iJF2/ep9ZtZj7+SFM53Oj+2khs3b93e3rnTunv/oOH7d1Hp86UlmOPG2lsPweHUmjseEl9guLoHKJZ/nFm5qfXaJ1wuhPfl7gUMFUi4ng4KM1au8d0QN6NMoM14odF3R+20s9ZBL0uspVIySpORrtJm40NLxVqzyU4N8g6hR8GsF5wiVWLlQ4L4BcwxUGUGuJOw7DovqJPozOmE2Pjoz1duOsVAZRzc5XHTAV+5pqsNv/JcrWxc3BegZ3bcaMfP3k1DEIXpUfNl+1MSkm9ofWF0bGwyL2cRwHcingiymdgft4rS2m8TM3SoEeByaKatAdBiZx4pkEPZVI0+w5TbuUWTGdeWYXZrVZdnlVBZarcFU1gJauVJHFN9NGCiW8a6Qc8jro3OQ4bj5gf467Tfp+To9r+lbjL/P4vofSjQgjc2MLBTBbHF1fqfLKGXWXGNM5Q1J+a6O3uZ1F/fJEevl5N0w5TJ6QZyQjL8kheUdOSI9wMidfyFfyLfme/Ex+Jb+XqcnWqmaPbETy5y8XphJh</latexit><latexit sha1_base64="Mcj0ZdFzc8oxIU6DHnJRzqBGpM4=">ADO3icfZJNbxMxEIbd5aMlfKX0yMVihcQBVdkICS6VKuDAoZQikTZVHEWziSx6o+V7S2NrP0tXOFH8EM4c0NcueNcki2iJF2/ep9ZtZj7+SFM53Oj+2khs3b93e3rnTunv/oOH7d1Hp86UlmOPG2lsPweHUmjseEl9guLoHKJZ/nFm5qfXaJ1wuhPfl7gUMFUi4ng4KM1au8d0QN6NMoM14odF3R+20s9ZBL0uspVIySpORrtJm40NLxVqzyU4N8g6hR8GsF5wiVWLlQ4L4BcwxUGUGuJOw7DovqJPozOmE2Pjoz1duOsVAZRzc5XHTAV+5pqsNv/JcrWxc3BegZ3bcaMfP3k1DEIXpUfNl+1MSkm9ofWF0bGwyL2cRwHcingiymdgft4rS2m8TM3SoEeByaKatAdBiZx4pkEPZVI0+w5TbuUWTGdeWYXZrVZdnlVBZarcFU1gJauVJHFN9NGCiW8a6Qc8jro3OQ4bj5gf467Tfp+To9r+lbjL/P4vofSjQgjc2MLBTBbHF1fqfLKGXWXGNM5Q1J+a6O3uZ1F/fJEevl5N0w5TJ6QZyQjL8kheUdOSI9wMidfyFfyLfme/Ex+Jb+XqcnWqmaPbETy5y8XphJh</latexit><latexit sha1_base64="Mcj0ZdFzc8oxIU6DHnJRzqBGpM4=">ADO3icfZJNbxMxEIbd5aMlfKX0yMVihcQBVdkICS6VKuDAoZQikTZVHEWziSx6o+V7S2NrP0tXOFH8EM4c0NcueNcki2iJF2/ep9ZtZj7+SFM53Oj+2khs3b93e3rnTunv/oOH7d1Hp86UlmOPG2lsPweHUmjseEl9guLoHKJZ/nFm5qfXaJ1wuhPfl7gUMFUi4ng4KM1au8d0QN6NMoM14odF3R+20s9ZBL0uspVIySpORrtJm40NLxVqzyU4N8g6hR8GsF5wiVWLlQ4L4BcwxUGUGuJOw7DovqJPozOmE2Pjoz1duOsVAZRzc5XHTAV+5pqsNv/JcrWxc3BegZ3bcaMfP3k1DEIXpUfNl+1MSkm9ofWF0bGwyL2cRwHcingiymdgft4rS2m8TM3SoEeByaKatAdBiZx4pkEPZVI0+w5TbuUWTGdeWYXZrVZdnlVBZarcFU1gJauVJHFN9NGCiW8a6Qc8jro3OQ4bj5gf467Tfp+To9r+lbjL/P4vofSjQgjc2MLBTBbHF1fqfLKGXWXGNM5Q1J+a6O3uZ1F/fJEevl5N0w5TJ6QZyQjL8kheUdOSI9wMidfyFfyLfme/Ex+Jb+XqcnWqmaPbETy5y8XphJh</latexit>Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 12
Problem 2: Efficiently learn a “Power-DPP”, i.e., µ(S)=det(LS)
p
Problem 1: Learning parametrized classes of
Problem 3: Learn the diversity tuning parameter ‘p’ in Power-DPPs
and more generally in Exponentiated SR measures
Problem 4: Explore other learning models; e.g. Deep-DPP to learn
nonlinear features for a DPP [Gartrell, Dohmatob, 2018], or “negative mining” for reducing overfitting [Mariet, Gartrell, Sra, 2018]
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
Practical Diversified Recommendations on YouTube with Determinantal Point Processes
Mark Wilhelm, Ajith Ramanathan, Alexander Bonomo, Sagar Jain, Ed H. Chi, Jennifer Gillenwater
Google Inc. {wilhelm,ajith,bonomo,sagarj,edchi,jengi}@google.com
CIKM 2018
14
Challenges: • Handling mismatch between model’s notion of diversity
versus user’s perception of diversity (true for other applications too)
(e.g. existing pointwise recommenders vs DPP’s setwise!)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 15
(Williams & Seeger 01, Zhang et al 08, Belabbas & Wolfe 09, Gittens & Mahoney 13, Alaoui & Mahoney 15, Deshpande et al 06, Smola & Schölkopf 00, Drineas & Mahoney 05, Drineas et al 06, …)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 16
ratio of elementary symm. polynomials
Approx quality c ≥ k landmarks Expected risk kernel ridge regression
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 17
ratio of elementary symm. polynomials
Approx quality c ≥ k landmarks Expected risk kernel ridge regression Theorems.
(Li, Jegelka, Sra 2016)
error time
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 18
100 200 300 400 500 0.2 0.4 0.6 0.8 1 size of first hidden layer test error
random importance pruning DIVNET
(Mariet, Sra 2016)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
19
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 20
p = 1/2 p = 2 p = 1
uniform distribution
[Mariet, Sra, Jegelka, 2018]
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 21
img: ise.inf.eth.ch
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 22
S⊆[m],|S|=k
i∈S
i
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 23
S⊆[m],|S|=k
i∈S
i
(Wang, Yu, Singh, 2016)
(Bayesian A-opt: Golovin,Krause,Ray, 2013) (Chamon, Ribeiro, 2017) (Chen, Hassani, Karbasi, 2018) (Singh, Xie, 2018) …and many more
(Mariet, Sra, 2017): Φ=Elemenetary Symmetric Polynomial
(recovers A- and D-optimal case extreme cases)
Greedy algo and convex relaxation both work. Success of greedy uses “Dual” volume sampling!
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 24
S )
NOT a DPP …but SR
(Avron & Boutsidis 2013): approximation bounds on Frobenius
norms for A-/E-optimal experimental design from sampling.
(Mariet, Sra, 2017)
generalize to E-Symm. Polynomials
Note: (Derezinski, Warmuth, 2017) and (Li, Jegelka, Sra, 2017) provide
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 25
Uncovers a connection between geometry, optimization, and
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 26
See past tutorials on submodular models in ML (various authors) Reinforcement learning (diversity based exploration) https://arxiv.org/abs/1802.04564 Fairness and diversity https://arxiv.org/abs/1610.07183 Video Summarization https://arxiv.org/abs/1807.10957 Diversified minibatches for SGD https://arxiv.org/abs/1705.00607 Diverse sampling in Bayesian optimization
(Kathuria, Deshpande, Kohli, 2016; Wang, Li, Jegelka, Kohli, 2017)
and of course, many more (see tutorial website for more…)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
Derezinski, Warmuth, Hsu. Leveraged volume sampling for linear regression Zhang, Galley, Gao, Gan, Li, Brockett, Dolan. Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization (based on MI) Chen, Zhang, Zhou. Fast Greedy MAP Inference for Determinantal Point Process to Improve Recommendation Diversity Zhou, Wang, Bilmes. Diverse Ensemble Evolution: Curriculum Data-Model Marriage Hong, Shann, Su, Chang, Fu, Lee. Diversity-Driven Exploration Strategy for Deep Reinforcement Learning (adds a distance based control) Gillenwater, Kulesza, Vassilvitskii, Mariet. Maximizing Induced Cardinality Under a Determinantal Point Process
Assignment Problem Mariet, Sra, Jegelka. Exponentiated Strongly Rayleigh Distributions Djolonga, Jegelka, Krause. Provable Variational Inference for Constrained Log- Submodular Models
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 29
Strongly log-concave (SLC) polynomials — introduced by Gurvits in 2009, many properties laid out. Aim: approximate partition functions over combinatorially large sample spaces Properties further developed by Anari, Gharan, Vinzant (Oct &
Nov 2018) and used to solve: Mason’s conjecture and more!
Matroid Base Exchange Walk: Fast Mixing – so in particular, the SR property is not necessary for fast mixing. Exponentiated SR measures (Mariet, Sra, Jegelka, 2018), with an approximate mixing time analysis and few applications The ESR case 0 < α < 1 falls under the SLC framework, hence fast MCMC sampling (Anari, Liu, Gharan, Vinzant, Nov 2018)
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 30
Negative dependence, stable polynomials etc. in ML - part 2
Suvrit Sra (suvrit@mit.edu) 31
Chengtao Li Zelda Mariet