Algorithmic Differentiation of a Basket Option Code Results GPU - PowerPoint PPT Presentation

AD on GPU Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation dco Algorithmic Differentiation of a Basket Option Code Results GPU Accelerated Application Race Conditions Jacques du Toit Numerical Algorithms Group 1/31 AD on GPU

AD on GPU Disclaimer Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation This is not a “speedup” talk dco Basket Option Code There won’t be any speed or hardware comparisons Results here Race Conditions This is about what is possible and how to do it with the minimum of effort This talk aimed at people who Don’t know much about AD Don’t know much about adjoints Don’t know much about GPU computing Apologies if you’re not in one of these groups ... 2/31 AD on GPU

AD on GPU What is Algorithmic Differentiation? Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation It’s a way to compute dco ∂ ∂ Basket Option Code F ( x 1 , x 2 , x 3 , . . . ) F ( x 1 , x 2 , x 3 , . . . ) Results ∂x 1 ∂x 2 Race Conditions ∂ F ( x 1 , x 2 , x 3 , . . . ) . . . ∂x 3 where F is given by a computer program, e.g. if(x1<x2) then F = x1*x1 + x2*x2 + x3*x3 + ... else F = x1 + x2 + x3 + ... endif 3/31 AD on GPU

AD on GPU Why are Derivatives Useful? Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation If you have a computer program which computes dco something (e.g. price of a contingent claim), AD can give Basket Option Code you the derivatives of the output with respect to the inputs Results The derivatives are exact up to machine precision (no Race Conditions approximations) Why is this interesting in Finance? Risk management Obtaining exact derivatives for mathematical algorithms such as optimisation (gradient and Hessian based methods) There are other uses as well but these are the most common 4/31 AD on GPU

AD on GPU Local Volatility FX Basket Option Jacques du Toit Introduction A while ago (with Isabel Ehrlich, then at Imperial College) Local volatility FX basket option we made a GPU accelerated pricer for a basket option Algorithmic Differentiation Option written on 10 FX rates driven by a 10 factor dco local volatility model, priced by Monte Carlo Basket Option Code Results The implied vol surface for each FX rate has 7 different Race Conditions maturities with 5 quotes at each maturity All together the model has over 400 input parameters Plan: compute gradient of the price with respect to model inputs including market implied volatility quotes Want to differentiate through whatever procedure is used to turn the implied vol quotes into a local vol surface Due to the large gradient, want to use adjoint algorithmic differentiation We also want to use the GPU for the heavy lifting 5/31 AD on GPU

AD on GPU Local Volatility FX Basket Option Jacques du Toit If S ( i ) denotes i th underlying FX rate then Introduction Local volatility FX basket option dS ( i ) Algorithmic Differentiation r d − r ( i ) S ( i ) dW ( i ) t � � dt + σ ( i ) � � = t , t dco f t S ( i ) t Basket Option Code where ( W t ) t ≥ 0 is a correlated N -dimensional Brownian Results motion with � W ( i ) , W ( j ) � t = ρ ( i,j ) t . Race Conditions The function σ ( i ) is unknown and is calibrated from market implied volatility quotes according to the Dupire formula θ 2 + 2 T θθ T + 2 T − r f � r d � KT θθ K σ 2 ( K, T ) = T � . √ √ � 2 + K 2 T θ � � Tθ 2 1 + Kd + Tθ K θ KK − d + K where θ the market observed implied volatility surface. The basket call option price is then � N � + C = e − r d T E w ( i ) S ( i ) � T − K i =1 6/31 AD on GPU

AD on GPU Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation dco Basket Option Code Results Race Conditions Crash Course in Algorithmic Differentiation 7/31 AD on GPU

AD on GPU Algorithmic Differentiation in a Jacques du Toit Nutshell Introduction Local volatility FX basket option Algorithmic Differentiation dco Computers can only add, subtract, multiply and divide Basket Option Code Results floating point numbers. Race Conditions A computer program implementing a model is just many of these fundamental operations strung together It’s elementary to compute the derivatives of these fundamental operations So we can use the chain rule, and these fundamental derivatives, to get the derivative of the output of a computer program with respect to the inputs Classes, templates and operator overloading give a way to do all this efficiently and non-intrusively 8/31 AD on GPU

AD on GPU Adjoints in a Nutshell Jacques du Toit Introduction AD comes in two modes: forward (or tangent-linear) and Local volatility FX basket option reverse (or adjoint) mode Algorithmic Differentiation Consider f : R n → R , take a vector x ( 1 ) ∈ R n and dco define the function F (1) : R 2 n → R by Basket Option Code Results � ∂f � y (1) = F (1) ( x , x ( 1 ) ) = ∇ f ( x ) · x ( 1 ) = Race Conditions · x ( 1 ) ∂ x where the dot is regular dot product. F (1) is the tangent-liner model of f and is the simplest form of AD. Let x ( 1 ) range over Cartesian basis vectors and call F (1) repeatedly to get each partial derivative of f To get full gradient ∇ f , must evaluate the forward model n times Runtime to get whole gradient will be roughly n times the cost of computing f 9/31 AD on GPU

AD on GPU Adjoints in a Nutshell Jacques du Toit Introduction Local volatility FX basket option Take any y (1) in R and consider F (1) : R n +1 → R n given by Algorithmic Differentiation dco ∂f Basket Option Code x ( 1 ) = F (1) ( x , y (1) ) = y (1) ∇ f ( x ) = y (1) Results ∂ x Race Conditions F (1) is called the adjoint model of f Setting y (1) = 1 and calling adjoint model F (1) once gives the full vector of partial derivatives of f Furthermore, can be proved that in general computing F (1) requires no more than five times as many flops as computing f Hence adjoints are extremely powerful, allowing one to obtain large gradients at potentially very low cost. 10/31 AD on GPU

AD on GPU Adjoints in a Nutshell Jacques du Toit Introduction Local volatility FX basket option So how do we construct a function which implements the Algorithmic Differentiation adjoint model? dco Basket Option Code Mathematically, adjoints are defined as partial Results derivatives of an auxiliary scalar variable t so that Race Conditions y (1) = ∂t x ( 1 ) = ∂t and (note: latter is a vector) ∂y ∂ x Consider a computer program computing y from x through intermediate steps x �→ α �→ β �→ γ �→ y How do we compute the adoint model of this calculation? 11/31 AD on GPU

AD on GPU Adjoints in a Nutshell Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation x �→ α �→ β �→ γ �→ y dco Basket Option Code Using the definition of adjoint we can write Results Race Conditions x (1) = ∂t ∂x = ∂α ∂t ∂x ∂α = ∂α ∂β ∂t ∂x ∂α ∂β = ∂α ∂β ∂γ ∂t ∂x ∂α ∂β ∂γ = ∂α ∂β ∂γ ∂y ∂t ∂y = ∂y ∂xy (1) ∂x ∂α ∂β ∂γ which is the adjoint model we require. 12/31 AD on GPU

AD on GPU Adjoints in a Nutshell Jacques du Toit Introduction Note that y (1) is an input to the adjoint model and that Local volatility FX basket option Algorithmic Differentiation �� y (1) · ∂y � · ∂γ � · ∂β � · ∂α dco x (1) = Basket Option Code ∂γ ∂β ∂α ∂x Results Race Conditions Computing ∂y/∂γ will probably require knowing γ (and/or β and α as well) Effectively means have to run the computer program backwards To run the program backwards we first have to run it forwards and store all intermediate values needed to calculate the partial derivatives In general, adjoint codes can require a huge amount of memory to keep all the required intermediate calculations. 13/31 AD on GPU

AD on GPU Adjoints in Practice Jacques du Toit Introduction So to do an adjoint calculation we need to Local volatility FX basket option Run the code forwards storing intermediate Algorithmic Differentiation calculations dco Basket Option Code Then run it backwards and compute the gradient Results This is a complicated and error-prone task Race Conditions Difficult to do by hand: for large codes (few 1000 lines), simply infeasible Very difficult to automate this process efficiently In either case, can be tricky to do without running out of memory This is not something you want to do yourself! Prof Uwe Naumann and his group at Aachen University produce a tool dco which takes care of all of this for you. dco = Derivative Computation through Overloading 14/31 AD on GPU

AD on GPU dco Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation Broadly speaking, dco works as follows: dco Replace all active datatypes in your code with dco Basket Option Code datatypes Results Race Conditions Register the input variables with dco Run the calculation forwards: dco tracks all calculations depending on input variables and stores intermediate values in a tape When forward run complete, set adjoint of output (price) to 1 and call dco::a1s::interpret_adjoint This runs the tape backwards and computes the adjoint (gradient) of output (price) with respect to all inputs 15/31 AD on GPU

Algorithmic Differentiation of a Basket Option Code Results GPU - PowerPoint PPT Presentation

AD on GPU Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation dco Algorithmic Differentiation of a Basket Option Code Results GPU Accelerated Application Race Conditions Jacques du Toit Numerical

Differentiation Differentiation stems from beliefs about differences among learners, how they

JUST THE MATHS SLIDES NUMBER 10.3 DIFFERENTIATION 3 (Elementary techniques of

JUST THE MATHS SLIDES NUMBER 10.4 DIFFERENTIATION 4 (Products and quotients) &

4.4. Vertical Differentiation Matilde Machado Industrial Organization- Matilde Machado Vertical

Beautiful differentiation Conal Elliott LambdaPix 1 September, 2009 ICFP Conal Elliott

Algorithmic Complexity Algorithmic Complexity "Algorithmic Complexity", also called

Applications of Algorithmic Differentiation within Surrogate Model Generation Dr. David Toal,

Creating Meaningful Difference -a workshop about successful differentiation Workshop Team

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Implicit Differentiation Michael Freeze MAT 151 UNC Wilmington Summer 2013 1 / 14 Section 6.4

Numerical Differentiation & Integration Numerical Differentiation I Numerical Analysis (9th

OVERVIEW Contact-induced change Contact-induced differentiation (CID) A cognitive model

Numerical Differentiation CIS 541 - Differentiation The mathematical definition: Roger

Numerical Differentiation & Integration Numerical Differentiation II Numerical Analysis (9th

Algorithmic Meta-Theorems for Restrictions of Treewidth Michael Lampis Computer Science Dept.

Algorithmic Aspects of Example: How to . . . Algorithmic Aspects of . . . Analysis, Prediction,

WHAT DID THE DATA SAY? JK 8 MATH PROGRAM PREK12 WORLD LANGUAGE PROGRAM PREK

Equal Access to Rigor: Middle School ELA BOARD PRESENTATION February 10, 2020 Teachers and

Introduction Overview Instructional Planning and Presentation assists students as they continue

EVALUATING THE IMPACT OF A CANVAS TRAINING MODULE ON TEACHER KNOWLEDGE OF INSTRUCTIONAL

SUMMER INSTITUTE What The Other Students Are Doing During Guided Reading Name Tents Describe

Graduation, Differentiation, and Vulnerability With reference to Out of trap. Supporting the

Marketing for Manufacturers: W hat Todays CEO Needs to Know Agenda Introduction

Symbolic Differentiation for Rapid Model Prototyping in Machine Learning and Data Analysis a

Algorithmic Differentiation of a Basket Option Code Results GPU - PowerPoint PPT Presentation

AD on GPU Jacques du Toit Introduction Local volatility FX basket option Algorithmic Differentiation dco Algorithmic Differentiation of a Basket Option Code Results GPU Accelerated Application Race Conditions Jacques du Toit Numerical

Differentiation Differentiation stems from beliefs about differences among learners, how they

JUST THE MATHS SLIDES NUMBER 10.3 DIFFERENTIATION 3 (Elementary techniques of

JUST THE MATHS SLIDES NUMBER 10.4 DIFFERENTIATION 4 (Products and quotients) &amp;

4.4. Vertical Differentiation Matilde Machado Industrial Organization- Matilde Machado Vertical

Beautiful differentiation Conal Elliott LambdaPix 1 September, 2009 ICFP Conal Elliott

Algorithmic Complexity Algorithmic Complexity &quot;Algorithmic Complexity&quot;, also called

Applications of Algorithmic Differentiation within Surrogate Model Generation Dr. David Toal,

Creating Meaningful Difference -a workshop about successful differentiation Workshop Team

HW2o Image Differentiation COMPSCI 527 Computer Vision COMPSCI 527 Computer Vision

Implicit Differentiation Michael Freeze MAT 151 UNC Wilmington Summer 2013 1 / 14 Section 6.4

Numerical Differentiation &amp; Integration Numerical Differentiation I Numerical Analysis (9th

OVERVIEW Contact-induced change Contact-induced differentiation (CID) A cognitive model

Numerical Differentiation CIS 541 - Differentiation The mathematical definition: Roger

Numerical Differentiation &amp; Integration Numerical Differentiation II Numerical Analysis (9th

Algorithmic Meta-Theorems for Restrictions of Treewidth Michael Lampis Computer Science Dept.

Algorithmic Aspects of Example: How to . . . Algorithmic Aspects of . . . Analysis, Prediction,

WHAT DID THE DATA SAY? JK 8 MATH PROGRAM PREK12 WORLD LANGUAGE PROGRAM PREK

Equal Access to Rigor: Middle School ELA BOARD PRESENTATION February 10, 2020 Teachers and

Introduction Overview Instructional Planning and Presentation assists students as they continue

EVALUATING THE IMPACT OF A CANVAS TRAINING MODULE ON TEACHER KNOWLEDGE OF INSTRUCTIONAL

SUMMER INSTITUTE What The Other Students Are Doing During Guided Reading Name Tents Describe

Graduation, Differentiation, and Vulnerability With reference to Out of trap. Supporting the

Marketing for Manufacturers: W hat Todays CEO Needs to Know Agenda Introduction

Symbolic Differentiation for Rapid Model Prototyping in Machine Learning and Data Analysis a

JUST THE MATHS SLIDES NUMBER 10.4 DIFFERENTIATION 4 (Products and quotients) &

Algorithmic Complexity Algorithmic Complexity "Algorithmic Complexity", also called

Numerical Differentiation & Integration Numerical Differentiation I Numerical Analysis (9th

Numerical Differentiation & Integration Numerical Differentiation II Numerical Analysis (9th