Obtaining Adjustable Regularization for Free via Iterate Averaging
Jingfeng Wu, Vladimir Braverman, Lin F. Yang Johns Hopkins University & UCLA
June 2020
Obtaining Adjustable Regularization for Free via Iterate Averaging - - PowerPoint PPT Presentation
Obtaining Adjustable Regularization for Free via Iterate Averaging Jingfeng Wu , Vladimir Braverman, Lin F. Yang Johns Hopkins University & UCLA June 2020 <latexit
June 2020
L(w) = 1 n
n
X
i=1
kwT x yk2
2
<latexit sha1_base64="FAcAPX0VgpeDU5bzVZuBTWV3X4=">ACGnicbVDLSsNAFJ34rPUVdelmsAi6sCRFUBdC0YUuXFToQ2jSMJlO6uBkEmYm1pDmO9z4Bf6DGxeKuBM3/o3T1oWvAxcO59zLvf4MaNSWdaHMTE5NT0zW5grzi8sLi2bK6tNGSUCkwaOWCQufCQJo5w0FWMXMSCoNBnpOVfHQ/91jURka8rtKYuCHqcRpQjJSWPNM+2+pvw0PoBALhzM4znkNHJqGX0UM73DoDPqdOryBOzB1Bl6lU/HMklW2RoB/if1FStUTeO94g17NM9+cboSTkHCFGZKybVuxcjMkFMWM5EUnkSRG+Ar1SFtTjkIi3Wz0Wg43tdKFQSR0cQVH6veJDIVSpqGvO0OkLuVvbyj+57UTFey7GeVxogjH40VBwqCK4DAn2KWCYMVSTRAWVN8K8SXSISmdZlGHYP9+S9pVsr2bvngXKdxBMYogHWwAbaADfZAFZyCGmgADG7BA3gCz8ad8Wi8GK/j1gnja2YN/IDx/gkcCqIH</latexit>2
<latexit sha1_base64="f25a0wzOI/NgSEYzE82YR0fnMxg=">ACBXicbVC7SgNBFJ2NrxhfUQsRLQaDEJuwGwRjIQRtLKOYB2TXZXYymwyZfTAzawibNDb+io2FIrb+gthpY+tnOHkUmnjgwuGce7n3HidkVEhd/9ASM7Nz8wvJxdTS8srqWnp9oyKCiGNSxgELeM1BgjDqk7KkpFayAnyHEaqTvts4FdvCBc08K9kNySWh5o+dSlGUkl2evcy2zmAJ9B0OcKx0Y/zfWj2OmbPzl/n7XRGz+lDwGlijEmWPh62/r83i7Z6XezEeDI7EDAlRN/RQWjHikmJG+ikzEiREuI2apK6ojzwirHj4R/uK6UB3YCr8iUcqr8nYuQJ0fUc1ekh2RKT3kD8z6tH0i1YMfXDSBIfjxa5EYMygINIYINygiXrKoIwp+pWiFtI5SFVcCkVgjH58jSp5HPGYe74QqVxCkZIgh2wB7LAEegCM5BCZQBrfgHjyCJ+1Oe9CetZdRa0Ibz2yCP9BefwCil5su</latexit>w L(w) + λR(w)
<latexit sha1_base64="NgkEF/pzBUBbtk0dDOe5Z6kCGk=">ACE3icbVDLSgMxFM3UV62vqks3wSJUhTIjgnZXdCPiop9QGcomUzahmYyQ5KxlGH+wY1bP8ONC0XcunHnb/gFptMutPVA4HDOfeW4IaNSmeaXkZmbX1hcyi7nVlbX1jfym1t1GUQCkxoOWCaLpKEU5qipGmqEgyHcZabj985HfuCNC0oDfqmFIHB91Oe1QjJSW2vkD26e8HQ8SeFUc7MNDGNvp0FgQL4E205M8lMAbzBbNkpoCzxJqQqWMLp3vwmO1nf+0vQBHPuEKMyRlyzJD5cRIKIoZSXJ2JEmIcB91SUtTjnwinThdn8A9rXiwEwj9uIKp+rsjRr6UQ9/VlT5SPTntjcT/vFakOqdOTHkYKcLxeFEnYlAFcBQ9KgWLGhJgLqm+FuIcEwkrHmNMhWNfniX1o5J1XCpf6zTOwBhZsAN2QRFY4ARUwAWoghrA4B48gRfwajwYz8ab8T4uzRiTnm3wB8bHD4ocoFU=</latexit>pk = (1 − p)pk, p = 1 1 + λη
<latexit sha1_base64="DN39c4r+tOUEleqYdUsr2NFJXpY=">ACInicbVDLSgMxFM34rPVdekmKIKiLTMiaBdi0YUuFawVOrVkMndqaGYmJBmxDPMtbvwVNy6U6koQ/BXTx0KtFwIn59x7T3I8wZnStv1hjY1PTE5N52bys3PzC4uFpeUrFSeSQpXGPJbXHlHAWQRVzTSHayGBhB6Hmtc+6em1O5CKxdGl7ghohKQVsYBRog3VLJRFs40P8aZTFvipr3jYmGubiAJTZ0sdbZTt2+SvAzl5vFPslc0CRrFtbtkt0vPAqcIVivH174sHp+fNwpvrxzQJIdKUE6Xqji10IyVSM8ohy7uJAkFom7SgbmBEQlCNtO+e4Q3D+DiIpTmRxn3250RKQqU6oWc6Q6Jv1V+tR/6n1RMdHDRSFolEQ0QHRkHCsY5xLy/sMwlU84BhEpm3orpLTHxaJNq3oTg/P3yKLjaLTl7pfKFSeMYDSqHVtEa2kQO2kcVdIbOURVR9ICe0At6tR6tZ6trvQ9ax6zhzAr6VdbnN7MkplI=</latexit>w L(w) + λR(w)
<latexit sha1_base64="NgkEF/pzBUBbtk0dDOe5Z6kCGk=">ACE3icbVDLSgMxFM3UV62vqks3wSJUhTIjgnZXdCPiop9QGcomUzahmYyQ5KxlGH+wY1bP8ONC0XcunHnb/gFptMutPVA4HDOfeW4IaNSmeaXkZmbX1hcyi7nVlbX1jfym1t1GUQCkxoOWCaLpKEU5qipGmqEgyHcZabj985HfuCNC0oDfqmFIHB91Oe1QjJSW2vkD26e8HQ8SeFUc7MNDGNvp0FgQL4E205M8lMAbzBbNkpoCzxJqQqWMLp3vwmO1nf+0vQBHPuEKMyRlyzJD5cRIKIoZSXJ2JEmIcB91SUtTjnwinThdn8A9rXiwEwj9uIKp+rsjRr6UQ9/VlT5SPTntjcT/vFakOqdOTHkYKcLxeFEnYlAFcBQ9KgWLGhJgLqm+FuIcEwkrHmNMhWNfniX1o5J1XCpf6zTOwBhZsAN2QRFY4ARUwAWoghrA4B48gRfwajwYz8ab8T4uzRiTnm3wB8bHD4ocoFU=</latexit>pk = γ η p γ(α + λ) − √ηα 1 − √ηα ! 1 − p γ(α + λ) 1 − √ηα !k−2
<latexit sha1_base64="yLatCiTSFfTLGHuUplgKYJCfPiI=">AC4XicfVJda9swFJW9rzb7aNY9DoZYGaSUBLsMtj4Uygpj7KmDJS1EWbiW5UREsj1JLnhCP2BvYy92C/r6xj7HVOcbDRN2QXB4Zyjey5XSkrBtYmiyC8dfvO3Xsbm637Dx4+2mo/3h7olKU9WkhCnWgGaC56xvuBHsrFQMZCLYaTI7nun50xpXuQfTV2ykYRJzjNOwXhq3P5Vjmf4EJNMAbVkAlICdpYwAw4TwTLT+avpz8osHR0CopzCniXNAFax1BHhQ1Nwuw538dLsuycztm4u05iovhkanZXk/45r2bhPXxz2v87f7Kz7r4bt3eiXtQUXgfxEuwcvb149vN4cHEybv8maUEryXJDBWg9jKPSjCwow6lgrkUqzUqgM5iwoYc5SKZHthnP4ReSXFWKH9ygxv26g0LUutaJt4pwUz1dW1O3qQNK5O9Hlmel5VhOV0EZXApsDzp8UpV4waUXsAVHE/K6ZT8Cs1/gOspMi6CWn5xcTX17AOBvu9+GXv4IPf0Bu0qA30FD1HRSjV+gIvUMnqI9o8D4ogzr4EtLwa/gt/L6whsHyzhO0UuGPyRM7v0=</latexit>γ = η 1 + λη
<latexit sha1_base64="elwx7KzOo7It7FzUN3VsIz+Nio=">ACGXicbVC7SsRAFJ34dn2tWtoMiCISyKCWoihZYKrgqbZbmZ3KzDziRhZiIuIb9h46/YWChip1ZW/oqzWQtfBwYO59zXnCAVXBvXfXcGBoeGR0bHxisTk1PTM9XZuTOdZIphnSUiURcBaBQ8xrhRuBFqhBkIPA86Bz0/PMrVJon8anptiU0I5xBkYK7Wqrt8GKYHuUD9SwHIfDRS5t5r75excYVj4ws4LoSi9VnXJrbkl6F/ifZGlvd2Pl+u1rcPjVvXVDxOWSYwNE6B1w3NT08xBGc4EFhU/05gC60AbG5bGIFE383J7QZetEtIoUfbFhpbq94cpNZdGdhKCeZS/Z64n9eIzPRVjPncZoZjFl/UZQJahLai4mGXCEzomsJMXtrZRdgk3I2DArNgTv95f/krP1mrdR2z6xaeyTPsbIAlkK8Qjm2SPHJFjUieM3JA78kAenVvn3nlynvulA85Xz5AeftE/DjpJU=</latexit>where
w L(w) + λR(w)
<latexit sha1_base64="NgkEF/pzBUBbtk0dDOe5Z6kCGk=">ACE3icbVDLSgMxFM3UV62vqks3wSJUhTIjgnZXdCPiop9QGcomUzahmYyQ5KxlGH+wY1bP8ONC0XcunHnb/gFptMutPVA4HDOfeW4IaNSmeaXkZmbX1hcyi7nVlbX1jfym1t1GUQCkxoOWCaLpKEU5qipGmqEgyHcZabj985HfuCNC0oDfqmFIHB91Oe1QjJSW2vkD26e8HQ8SeFUc7MNDGNvp0FgQL4E205M8lMAbzBbNkpoCzxJqQqWMLp3vwmO1nf+0vQBHPuEKMyRlyzJD5cRIKIoZSXJ2JEmIcB91SUtTjnwinThdn8A9rXiwEwj9uIKp+rsjRr6UQ9/VlT5SPTntjcT/vFakOqdOTHkYKcLxeFEnYlAFcBQ9KgWLGhJgLqm+FuIcEwkrHmNMhWNfniX1o5J1XCpf6zTOwBhZsAN2QRFY4ARUwAWoghrA4B48gRfwajwYz8ab8T4uzRiTnm3wB8bHD4ocoFU=</latexit>∞
k=1
Dataset CIFAR-10 CIFAR-100 Model
VGG-16 ResNet-18 ResNet-18 Accuracy after
training (%) 92.54 ±0.22 94.54 ±0.04 75.62 ±0.16
Accuracy after
averaging (%) 93.18 ±0.06 94.72 ±0.04 76.24 ±0.05
Time of training
∼ 4.5h ∼ 8.3h ∼ 8.3h
Time of averaging
∼ 47s ∼ 56s ∼ 58s
<latexit sha1_base64="GTX8BMyXCb6APxzhY5d9UfemM7Q=">AEu3ichVNb9MwFPbWAqNctsHeLFYhoa0RUnW25AmbWxi8Aa027SXE2O67ZmthPFDlIV8if4H7zyX3jmPyAecdKly9oBjiKdy3fO+c7xsR9yprTj/JiZrVTv3L03d7/24OGjx/MLi09OVBHhB6TgAfRmY8V5UzSY80p2dhRLHwOT31L3cz/+lnGikWyCM9DGlH4L5kPUawNqaLxdlvNeTPpOJxn7McZQm5Ev+pTU0yLW9rA2BTR8AZGIuWamaCxk4mXINl92bncN1UuMvZAehIvh90KXcuBKkBOYcnuzvr7vNtGQ5pOoD1etu+3ZjnirPlWyZY1gJfEkJ5bzA7hASR5gMIe5pGkGEdISZLIPVy20Yr1Mc+ZFkLXp2Y26hZCFQgEd2/OsSUD9BsCpTwJaDbvpXQPcpgFcd/x/fthcCO7/lSDyRbK5YZveSySakyRyVN1uem/qGaoVtP26mVUw5qeakH2iAkKgx4sZpjls5BiApqpWIOx1rY3prTrGUxmG3dcStey1FhpNMtK2yileVLZHS/nxcKyaSA/cFpwr4Tl7ejrT7D0+/vBxcIv1A1ILKjUhGOlzl0n1J0ER2aPOTXbFCsaYnKJ+/TciBILqjpJ/q5SuGIsXdgLIvNLDXNrOSLBQqmh8A1SYD1Qk7MeJvPNa9didhMow1lWRUqBdzqAOYPVLYZRElmg+NgEnEDFdIBtgskVmgm1XEMu+NrLteLi05oZlDs5lmnhxLPdur350UzsNRidOfAMPAerwAUtsA3egNwDEhlqfKqslvZq25VSfVTlY+gszNXMU/BjVON/wDz7nh5</latexit>