AutoDiff: Reverse Mode v 0 v 5 v 2 v 3 ln x 1 v 0 v 5 v 2 v 1 - PowerPoint PPT Presentation

x 1 ¯ ¯ ¯ AutoDiff: Reverse Mode v 0 v 5 v 2 ¯ v 3 ln x 1 v 0 v 5 v 2 ¯ v 1 ¯ + x 2 v 4 ¯ y v 6 + v 3 − v 1 Traverse the original graph in the reverse topological v 4 x 2 y v 6 sin order and for each node in the original graph Forwar ard Eval valuat ation Trace: introduce an ad adjo join int node node , which computes f (2 , 5) derivative of the output with respect to the local node (using Chain rule): 2 v 0 = x 1 v 1 = x 2 5 ln(2) = 0.693 v 2 = ln( v 0 ) 2 x 5 = 10 v 3 = v 0 · v 1 sin(5) = -0.959 v 4 = sin ( v 1 ) v 5 = v 2 + v 3 0.693 + 10 = 10.693 v 6 = v 5 − v 4 10.693 + 0.959 = 11.652 "local cal" derivative y = v 6 11.652 104

AutoDiff: Reverse Mode ln x 1 v 0 v 5 v 2 + + v 3 Backwards Derivative Backw ackwar ards Derivat vative ve Trace: − v 1 v 4 x 2 y v 6 sin ∂ v 3 ∂ v 2 1 v 0 = ¯ ¯ + ¯ = ¯ v 3 v 1 + ¯ 5.5 v 3 v 2 v 2 Forwar ard Eval valuat ation Trace: ∂ v 0 ∂ v 0 v 0 ∂ v 1 ∂ v 0 ∂ v 3 ∂ v 4 f (2 , 5) = ¯ v 3 v 0 + ¯ v 4 cos ( v 1 ) 1.716 v 1 = ¯ ¯ 1 = ¯ + ¯ v 3 v 4 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v ∂ v 5 2 v 0 = x 1 = ¯ v 5 · (1) 1x1 = 1 ¯ v 2 = ¯ v 5 ∂ v 2 v 1 = x 2 5 ∂ v 2 ∂ v 5 ln(2) = 0.693 v 2 = ln( v 0 ) = ¯ v 5 · (1) 1x1 = 1 v 3 = ¯ ¯ v 5 ∂ v 3 ∂ v 3 2 x 5 = 10 v 3 = v 0 · v 1 ∂ v 6 = ¯ v 6 · ( − 1) sin(5) = -0.959 v 4 = sin ( v 1 ) 1x-1 = -1 ¯ v 4 = ¯ v 6 ∂ v 4 ∂ v 4 v 5 = v 2 + v 3 0.693 + 10 = 10.693 ∂ v 6 = ¯ v 6 · 1 ¯ v 5 = ¯ 1x1 = 1 v 6 v 6 = v 5 − v 4 10.693 + 0.959 = 11.652 ∂ v 5 ∂ v 5 ∂ y v 6 = ∂ y y = v 6 11.652 1 ¯ ∂ v 6 105

AutoDiff: Reverse Mode ln x 1 v 0 v 5 v 2 + + v 3 Backwards Derivative Backw ackwar ards Derivat vative ve Trace: − v 1 v 4 x 2 y v 6 sin ∂ v 3 ∂ v 2 1 v 0 = ¯ ¯ + ¯ = ¯ v 3 v 1 + ¯ 5.5 v 3 v 2 v 2 Forwar ard Eval valuat ation Trace: ∂ v 0 ∂ v 0 v 0 ∂ v 1 ∂ v 0 ∂ v 3 ∂ v 4 f (2 , 5) = ¯ v 3 v 0 + ¯ v 4 cos ( v 1 ) 1.716 v 1 = ¯ ¯ 1 = ¯ + ¯ v 3 v 4 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v ∂ v 5 2 v 0 = x 1 = ¯ v 5 · (1) 1x1 = 1 ¯ v 2 = ¯ v 5 ∂ v 2 v 1 = x 2 5 ∂ v 2 ∂ v 5 ln(2) = 0.693 v 2 = ln( v 0 ) = ¯ v 5 · (1) 1x1 = 1 v 3 = ¯ ¯ v 5 ∂ v 3 ∂ v 3 2 x 5 = 10 v 3 = v 0 · v 1 ∂ v 6 = ¯ v 6 · ( − 1) sin(5) = -0.959 v 4 = sin ( v 1 ) 1x-1 = -1 ¯ v 4 = ¯ v 6 ∂ v 4 ∂ v 4 v 5 = v 2 + v 3 0.693 + 10 = 10.693 ∂ v 6 = ¯ v 6 · 1 ¯ v 5 = ¯ 1x1 = 1 v 6 v 6 = v 5 − v 4 10.693 + 0.959 = 11.652 ∂ v 5 ∂ v 5 ∂ y v 6 = ∂ y y = v 6 11.652 1 ¯ ∂ v 6 106

AutoDiff: Reverse Mode ln x 1 v 0 v 5 v 2 + + v 3 Backwards Derivative Backw ackwar ards Derivat vative ve Trace: − v 1 v 4 x 2 y v 6 sin ∂ v 3 ∂ v 2 1 v 0 = ¯ ¯ + ¯ = ¯ v 3 v 1 + ¯ 5.5 v 3 v 2 v 2 Forwar ard Eval valuat ation Trace: ∂ v 0 ∂ v 0 v 0 ∂ v 1 ∂ v 0 ∂ v 3 ∂ v 4 f (2 , 5) = ¯ v 3 v 0 + ¯ v 4 cos ( v 1 ) 1.716 v 1 = ¯ ¯ 1 = ¯ + ¯ v 3 v 4 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v ∂ v 5 2 v 0 = x 1 = ¯ v 5 · (1) 1x1 = 1 v 2 = ¯ ¯ v 5 ∂ v 2 v 1 = x 2 5 ∂ v 2 ∂ v 5 ln(2) = 0.693 v 2 = ln( v 0 ) = ¯ v 5 · (1) 1x1 = 1 v 3 = ¯ ¯ v 5 ∂ v 3 ∂ v 3 2 x 5 = 10 v 3 = v 0 · v 1 ∂ v 6 = ¯ v 6 · ( − 1) sin(5) = -0.959 v 4 = sin ( v 1 ) 1x-1 = -1 ¯ v 4 = ¯ v 6 ∂ v 4 ∂ v 4 v 5 = v 2 + v 3 0.693 + 10 = 10.693 ∂ v 6 = ¯ v 6 · 1 ¯ v 5 = ¯ 1x1 = 1 v 6 v 6 = v 5 − v 4 10.693 + 0.959 = 11.652 ∂ v 5 ∂ v 5 ∂ y v 6 = ∂ y y = v 6 11.652 1 1 ¯ ∂ v 6 107

AutoDiff: Reverse Mode ln x 1 v 0 v 5 v 2 + + v 3 Backwards Derivative Backw ackwar ards Derivat vative ve Trace: − v 1 v 4 x 2 y v 6 sin ∂ v 3 ∂ v 2 1 v 0 = ¯ ¯ + ¯ = ¯ v 3 v 1 + ¯ 5.5 v 3 v 2 v 2 Forwar ard Eval valuat ation Trace: ∂ v 0 ∂ v 0 v 0 ∂ v 1 ∂ v 0 ∂ v 3 ∂ v 4 f (2 , 5) = ¯ v 3 v 0 + ¯ v 4 cos ( v 1 ) 1.716 v 1 = ¯ ¯ 1 = ¯ + ¯ v 3 v 4 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v 1 ∂ v ∂ v 5 2 v 0 = x 1 = ¯ v 5 · (1) 1x1 = 1 v 2 = ¯ ¯ v 5 ∂ v 2 v 1 = x 2 5 ∂ v 2 ∂ v 5 ln(2) = 0.693 v 2 = ln( v 0 ) = ¯ v 5 · (1) 1x1 = 1 v 3 = ¯ ¯ v 5 ∂ v 3 ∂ v 3 2 x 5 = 10 v 3 = v 0 · v 1 ∂ v 6 = ¯ v 6 · ( − 1) sin(5) = -0.959 v 4 = sin ( v 1 ) 1x-1 = -1 ¯ v 4 = ¯ v 6 ∂ v 4 ∂ v 4 v 5 = v 2 + v 3 0.693 + 10 = 10.693 ∂ v 6 = ¯ v 6 · 1 1x1 = 1 v 5 = ¯ ¯ 1x1 = 1 v 6 v 6 = v 5 − v 4 10.693 + 0.959 = 11.652 ∂ v 5 ∂ v 5 ∂ y v 6 = ∂ y y = v 6 11.652 1 1 ¯ ∂ v 6 111

AutoDiff: Reverse Mode v 0 v 5 v 2 v 3 ln x 1 v 0 v 5 v 2 v 1 - PowerPoint PPT Presentation

x 1 AutoDiff: Reverse Mode v 0 v 5 v 2 v 3 ln x 1 v 0 v 5 v 2 v 1 + x 2 v 4 y v 6 + v 3 v 1 Traverse the original graph in the reverse topological v 4 x 2 y v 6 sin order and for each node in the original

No more mini-languages: Autodiff in full-featured Python David Duvenaud, Dougal Maclaurin,

Control of switch-mode converters Current Programmed Mode control CPM Mor M. Peretz, Switch-Mode

Reverse Osmosis Reverse Osmosis Background to Market and to Market and Background Technology

NATIVE MODE PROGRAMMING Fiona Reid Overview What is native mode? What codes are suitable

Reverse Logistics Woodfield Distribution, LLC v081617 Reverse Logistics About Us Description

Next-Generation Debuggers For Reverse Engineering For Reverse Engineering The ERESI team

Reverse Mathematics. Antonio Montalb an. University of Chicago. September 2011 Antonio

Remanufacturing of Products Remanufacturing of Products and Reverse Logistics and Reverse

Reverse mathematics and Ramsey theorem for pairs Benoit Monin Universit e Paris-Est Cr

Reverse Traceroute Relaunch David Choffnes, Northeastern (joint work with USC) What is

Direct fibre excitation with a digital laser 1 Proof of principle Mode transfer Mode detection

CMF2012F Series Common Mode SMD Filter for Signal Line FEATURES This common mode filter is

The standard mode of DSQSS/DLA Standard mode files Input files of dla Simple mode file

Org-mode Nick Higham April 22, 2013 Nick Higham Org-mode 1 / 7 University of Manchester What

NATIVE MODE PROGRAMMING Adrian Jackson adrianj@epcc.ed.ac.uk @adrianjhpc Overview What is

switchport mode access switchport mode trunk switchport mode trunk

Factor Vocab Word 2 Fraction Division Its meaning (As it is used A whole number A whole

Lecture 3: Word and document embeddings Plan of the lecture Part 1 : Distributional semantics

Whole Numbers Jumping Jack Snap Game (numeral Cluedo Numerals! cards 0 to 20) Tell your child

Versatility of Singular Value Decomposition (SVD) January 7, 2015 Assumption : Data = Real Data +

CSE 490 Natural Language Processing Spring 2016 Introduction Yejin Choi Slides adapted

OTHER DATA CENTER SERVICES Lecture V Ken Birman Tier two and Inner Tiers 2 If tier one

Creating Tutorial Materials as Lecture Supplements by Integrating Drawing Tablet and Video

Bigtable: A Distributed Storage System for Structured Data Fay Chang, Jeffrey Dean, Sanjay