On the Noisy Gradient Descent that Generalizes as SGD
Jingfeng Wu, Wenqing Hu, Haoyi Xiong, Jun Huan, Vladimir Braverman, Zhanxing Zhu
Johns Hopkins University, Missouri University of Science and Technology, Baidu Research, Styling AI, Peking University