CS535 Big Data 3/25/2020 Week 8-B Sangmi Lee Pallickara http://www.cs.colostate.edu/~cs535 Spring 2020 Colorado State University, page 1
CS535 BIG DATA
PART B. GEAR SESSIONS
SESSION 2: MACHINE LEARNING FOR BIG DATA
Sangmi Lee Pallickara Computer Science, Colorado State University http://www.cs.colostate.edu/~cs535
FAQs
- CS535 Online
- Please read announcement on Canvas
- If you have any questions, please post on Piazza
CS535 Big Data | Computer Science | Colorado State University
Topics of Todays Class
- Distributed PyTorch
- Some common advanced optimizations
- You will use it for your term project
- Automatic Differentiation with Backpropagation
- Computation Graph
- Distributed PyTorch Application
CS535 Big Data | Computer Science | Colorado State University
GEAR Session 2. Machine Learning for Big Data
Lecture 4. Distributed Neural Networks-PyTorch
PyTorch: Introduction
CS535 Big Data | Computer Science | Colorado State University
This material is built based on
- Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z.,
Gimelshein, N., Antiga, L. and Desmaison, A., 2019. PyTorch: An imperative style, high- performance deep learning library. In Advances in Neural Information Processing Systems (pp. 8024-8035).
- Baydin, A.G., Pearlmutter, B.A., Radul, A.A. and Siskind, J.M., 2017. Automatic
differentiation in machine learning: a survey. The Journal of Machine Learning Research, 18(1), pp.5595-5637.
- Writing Distributed Applications with PyTorch,
https://pytorch.org/tutorials/intermediate/dist_tuto.html
- PyTorch vs TensorFlow — spotting the difference,
https://towardsdatascience.com/pytorch-vs-tensorflow-spotting-the-difference- 25c75777377b
CS535 Big Data | Computer Science | Colorado State University
Observations
- Array-based programming
- Multidimensional arrays (A.K.A. tensors) became critical mathematical data type
- Automatic differentiation enabled fully automated computing of derivatives
- Open-source Python ecosystem for numerical analysis
- NumPy, SciPy and Pandas
- Availability and commoditization of general-purpose massively parallel hardware
- GPUs
- Specialized libraries, cuDNN
- Caffe, Torch7, TensorFlow take advantage of these hardware accelerators
CS535 Big Data | Computer Science | Colorado State University