Neural Network Training: Old & New Tricks Old: (80s) Stochastic - PowerPoint PPT Presentation

Jun 08, 2023 •268 likes •1.61k views

Neural Network Training: Old & New Tricks Old: (80s) Stochastic Gradient Descent, Momentum, weight decay New: (last 5-6 years) Dropout ReLUs Batch Normalization Reminder: Overfitting, in images Classification just right

Residual Network Naïve solution • If extra layers are an identity mapping, then training errors can not increase 67
Residual Modelling: Basic idea in image processing • Goal: estimate update between an original image and a changed image Preserving base information Some residual Network can treat perturbation 68
Residual Network • Plain block • Difficult to make identity mapping because of multiple non-linear layers 69
Residual Network • Residual block • If identity were optimal, easy to set weights as 0 • If optimal mapping is closer to identity, easier to find small fluctuations Appropriate for treating perturbation as keeping a base information 70
Residual Network: Deeper is better • Deeper ResNets have lower training error 71
Residual Network: Deeper is better 72
CNNs, 2017: DenseNet Densely Connected Convolutional Networks, CVPR 2017 Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger Recently proposed, better performance/parameter ratio 73
Image-to-Image 74
Graphics: Multiresolution 75
Image-to-image • So far we mapped an image image to a number or label • In graphics, output often is “richer”: • An image • A volume • A 3D mesh • … • Note: “ image ” just placeholder name here for any Eulerian data • Architectures • Fully convolutional • Encoder-Decoder • Skip connections 76
Fully-convolutional Neural Networks FCNN 77
Fully-convolutional Neural Networks FCNN 78
Fully-convolutional Neural Networks FCNN 79
Fully-convolutional Neural Networks FCNN 80
Fully-convolutional Neural Networks FCNN Flexible - works with varying input sizes 81

Recommend

The Agile PMP: Teaching an Old Dog New Tricks The Agile PMP: Teaching an Old Dog New Tricks

W8 Concurrent Session Wednesday 11/12/2008 12:45 PM 2:15 PM The Agile PMP: Teaching an Old Dog New Tricks The Agile PMP: Teaching an Old Dog New Tricks Presented by: Mike Cottmeyer VersionOne Presented at: Agile Development Practices

646 views • 60 slides

TEACHING OLD COMPILERS NEW TRICKS TEACHING OLD COMPILERS NEW TRICKS Transpiling C ++ 17 to C ++ 11

TEACHING OLD COMPILERS NEW TRICKS TEACHING OLD COMPILERS NEW TRICKS Transpiling C ++ 17 to C ++ 11 Tony Wasserka Meeting C ++ @ fail _ cluez 17 November 2018 WHO AM I ? WHO AM I ? Berlin - based consultant : Workflow optimization , Code

973 views • 24 slides

Teaching old type systems Teaching old type systems new tricks with type providers new tricks

Teaching old type systems Teaching old type systems new tricks with type providers new tricks with type providers Tomas Petricek Tomas Petricek University of Kent and The Alan Turing Institute http://tomasp.net tomas@tomasp.net @tomaspetricek

839 views • 38 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

889 views • 41 slides

OLD MCDONALD COUNTY JAIL PLAT MAP OLD MCDONALD COUNTY JAIL OLD MCDONALD COUNTY JAIL OLD

OLD MCDONALD COUNTY JAIL PLAT MAP OLD MCDONALD COUNTY JAIL OLD MCDONALD COUNTY JAIL OLD MCDONALD COUNTY JAIL OLD MCDONALD COUNTY JAIL OLD MCDONALD COUNTY JAIL

354 views • 8 slides

Deep Learning Primer Nishith Khandwala Neural Networks Overview Neural Network Basics

Deep Learning Primer Nishith Khandwala Neural Networks Overview Neural Network Basics Activation Functions Stochastic Gradient Descent (SGD) Regularization (Dropout) Training Tips and Tricks Neural Network (NN)

828 views • 45 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

380 views • 9 slides

Teaching an old DAG new tricks Migrating a decade old pipeline to Airflow Outline Cloud native

Teaching an old DAG new tricks Migrating a decade old pipeline to Airflow Outline Cloud native deployment Cloud native deployment Multi-repo DAG management Manage Airflow Variables with code through Terraform Airflow monitoring

478 views • 30 slides

Modern C++ Old Dog, New Tricks Todd L. Montgomery @toddlmontgomery C++ is so old Languages

StoneTor Modern C++ Old Dog, New Tricks Todd L. Montgomery @toddlmontgomery C++ is so old Languages are Tools Learning Tools is Good There are only two kinds of languages: the ones people complain about and the ones nobody uses.

1.61k views • 113 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

358 views • 14 slides

Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training

Neural Networks and their Application to Go A. Bausch Neural Networks and their Application to Go Neural Networks Learning Blackjack Theory Training neural networks Problems AlphaGo Anne-Marie Bausch The Game of Go Policy Network

280 views • 24 slides

Old Dominion University Old Dominion Unive sity Old Dominion University Old Dominion University

Old Dominion University Old Dominion Unive sity Old Dominion University Old Dominion University Real Estate Center Real Estate Center Hampton Roads Real Estate Hampton Roads Real Estate Market Review and Forecast Market Review and Forecast

74 views • 4 slides

Image Processing Tricks in Image Processing Tricks in OpenGL OpenGL Simon Green Simon Green

Image Processing Tricks in Image Processing Tricks in OpenGL OpenGL Simon Green Simon Green NVIDIA Corporation NVIDIA Corporation Overview Overview Image Processing in Games Image Processing in Games Histograms Histograms

565 views • 44 slides

Cute Tricks with Cute Tricks with Virtual Memory Virtual Memory A short history of VM A short

Cute Tricks with Cute Tricks with Virtual Memory Virtual Memory A short history of VM A short history of VM Memory used to be quite limited. (and why they dont work) (and why they dont work) Use secondary storage to emulate. Either by

313 views • 5 slides

Unpacking tips and tricks Protector Techniques Conclusion Samuel Chevet w4kfu@lse.epita.fr

Unpacking tips and tricks Samuel Chevet Presentation Process Unpacking tips and tricks Protector Techniques Conclusion Samuel Chevet w4kfu@lse.epita.fr http://www.lse.epita.fr 12 February 2013 Why this talk ? Unpacking tips and tricks

439 views • 26 slides

NLP with recurrent networks Chapter 9 in Martin/Jurafsky Feed-forward networks for text

12/10/19 NLP with recurrent networks Chapter 9 in Martin/Jurafsky Feed-forward networks for text processing Consider a fixed window of previous words Lump all words in a sentence/document into a bag-of-words representation. y

390 views • 9 slides

ProverBot9000 A proof assistant assistant Proofs are hard Proof assistants are hard Big Idea:

ProverBot9000 A proof assistant assistant Proofs are hard Proof assistants are hard Big Idea: Proofs are hard, make computers do them Proofs are just language with lots of structure Local Context Global Goal Context Want to generate this!

288 views • 18 slides

CSCI 447/547 MACHINE LEARNING Outline Introduction Sequence Data Sequential Memory

Recurrent Neural Networks CSCI 447/547 MACHINE LEARNING Outline Introduction Sequence Data Sequential Memory Recurrent Neural Networks Vanishing Gradient LSTMs and GRUs Introduction Uses: Speech Recognition

616 views • 24 slides

Lecture 6: RNN wrap-up Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel Center Office

CS546: Machine Learning in NLP (Spring 2020) http://courses.engr.illinois.edu/cs546/ Lecture 6: RNN wrap-up Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel Center Office hours: Monday, 11am12:30pm Todays class: RNN architectures

708 views • 25 slides

About me... Musician, Electrical Engineer, Mixing/Mastering Engineer Studied audio DSP at

About me... Musician, Electrical Engineer, Mixing/Mastering Engineer Studied audio DSP at CCRMA 5+ years of making audio plugins, DAWs, etc. Not a great guitarist (but Im learning) 2 Klon Centaur Guitar pedal made by Bill

776 views • 65 slides

Recurrent Neural Networks: Stability analysis and LSTMs M. Soleymani Sharif University of

Recurrent Neural Networks: Stability analysis and LSTMs M. Soleymani Sharif University of Technology Spring 2019 Most slides have been adopted from Bhiksha Raj, 11-785, CMU 2019 and some from Fei Fei Li and colleagues lectures, cs231n,

1.41k views • 105 slides

Wrapup: IE, QA, and Dialog Mausam Grading 50% 40% project 20% final exam 15% 20%

Wrapup: IE, QA, and Dialog Mausam Grading 50% 40% project 20% final exam 15% 20% regular reviews 15% 10% midterm survey 10% presentation Extra credit: participation Plan (1 st half of the course) Classical

697 views • 37 slides

SYNTAX PROCESSING Statistical Natural Language Processing 23.04.19 1 Syntax, Grammars, Parsing

Jurafsky, D. and Martin, J. H. (2009): Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition . Second Edition. Pearson: New Jersey: Chapter 13 Chunking, Syntax trees,

516 views • 38 slides