Multifidelity modeling: Exploiting structure in high-dimensional - PowerPoint PPT Presentation

Multifidelity modeling: Exploiting structure in high-dimensional problems Karen Willcox Joint work with Tiangang Cui, Andrew March, Youssef Marzouk, Leo Ng Workshop on Numerical Methods for High-dimensional Problems Ecole des Ponts Paristech April 15, 2014

Collaborators and Acknowledgements • Andrew March: Multifidelity optimization • Leo Ng: Multifidelity uncertainty quantification • Tiangang Cui: Statistical inverse problems • Professor Youssef Marzouk • AFOSR Computational Mathematics Program: AFOSR MURI on Uncertainty Quantification (F. Fahroo) • DOE Applied Mathematics Program: DiaMonD Multifaceted Mathematics Integrated Capability Center (S. Landsberg)

Outline • What is multifidelity modeling? • Motivation • Multifidelity modeling approaches: – Optimization – Inverse problems – Uncertainty quantification 3

Multifidelity modeling Often have available several physical and/or numerical models that describe a system of interest. – Models may stem from different resolutions, different assumptions, surrogates, approximate models, etc. – Each model has its own “fidelity” and computational cost Today’s focus: – Multifidelity setup with two models: a “truth” full -order model and a reduced-order model – Want to use the reduced model to accelerate solution of optimization, uncertainty quantification, or inverse problem solution {opt, UQ, inverse} 4

Projection-based model reduction 5

Why use a multifidelity formulation? Reduced model Full model (approximate) (“ truth ”) 6

Why use a multifidelity formulation? Reduced model Full model (approximate) (“ truth ”) Computationally Computationally cheap(er) expensive 7

Why use a multifidelity formulation? Reduced model Full model (approximate) (“ truth ”) Certified? yes • Replace full model with reduced model and solve {opt, UQ, inverse} • Propagate error estimates on forward predictions to determine error in {opt, UQ, inverse} solutions (may be non-trivial) 8

Why use a multifidelity formulation? Reduced model Full model (approximate) (“ truth ”) Certified? no • Replace full model with reduced model and solve {opt, UQ, inverse} • Hope for the best 9

Why use a multifidelity formulation? Reduced model Full model (approximate) (“ truth ”) Certified? no • Use a multifidelity formulation that invokes both the reduced model and the full model • Trade computational cost for the ability to place guarantees on the solution of {opt, UQ, inverse} 10

Why use a multifidelity formulation? Reduced model Full model (approximate) (“ truth ”) Certified? no • Use a multifidelity formulation that invokes both the reduced model and the full model • Trade computational cost for the ability to place guarantees on the solution of {opt, UQ, inverse} • Certify the solution of {opt, UQ, inverse} even in the absence of guarantees on the reduced model itself 11

Multifidelity Strategies • For optimization: – adaptive model calibration (corrections) – combined with trust region model management • For statistical inverse problems: – adaptive delayed acceptance Markov chain Monte Carlo (MCMC) methods • For forward propagation of uncertainty: – control variates 12

m𝑗𝑜 𝑔(𝑦) 𝑦 s.t. 𝑕 𝑦 ≤ 0 ℎ 𝑦 = 0 OPTIMIZATION

Design optimization formulation min 𝑦 𝑔 𝑦 𝑦 Design variables 𝑔(𝑦) s.t. 𝑕 𝑦 ≤ 0 Objective 𝑕(𝑦) , h (𝑦) Constraints ℎ 𝑦 = 0 optimizer f hi x g hi h hi hi-fi model • Interested in optimization of systems governed by PDEs (constraints and objective evaluation is expensive) 14

Multifidelity optimization formulation min 𝑦 𝑔 𝑦 𝑦 Design variables 𝑔(𝑦) s.t. 𝑕 𝑦 ≤ 0 Objective 𝑕(𝑦) , h (𝑦) Constraints ℎ 𝑦 = 0 optimizer optimizer f lo + a f hi g lo + b x x g hi h lo + g h hi lo-fi correction hi-fi model model x j f hi g hi h hi hi-fi model 15

Multifidelity optimization: Surrogate definition Denote a surrogate model of f high ( 𝐲 ) as 𝑛 ( 𝐲 ) • • The surrogate model could be: 1. The low-fidelity function (reduced model) 2. The sum of the low-fidelity function and an additive correction where 𝑓 ( 𝐲 ) is calibrated to the difference f high ( 𝐲 )- f low ( 𝐲 ) 3. The product of a low-fidelity function and a multiplicative correction where 𝛾 𝑑 𝐲 is calibrated to the quotient f high ( 𝐲 ) / f low ( 𝐲 ) • Update the correction terms as the optimization algorithm proceeds and additional evaluations of f high ( 𝐲 ) become available 16

Multifidelity optimization: Trust-region model management • At iteration 𝑙 , define a trust region centered on iterate 𝐲 𝑙 with size Δ 𝑙 • 𝑛 𝑙 is the surrogate model on the 𝑙 th iteration • Determine a trial step 𝒕 𝑙 at iteration 𝑙 , by solving a subproblem of the form: (unconstrained case) 17

Multifidelity optimization: Trust-region model management • Evaluate the function at the trial point: f high ( 𝐲 𝑙 + 𝐭 𝑙 ) • Compute the ratio of the actual improvement in the function value to the improvement predicted by the surrogate model: • Accept or reject the trial point and update trust region size according to (typical parameters):       k k 1 k 0 Reject step 0 . 5    k     0 0 . 1 k 1 k Accept step 0 . 5      1   k k k 0 . 1 0 . 75 Accept step       k k 1 k 0 . 75 Accept step 2 18

Trust-Region Demonstration 20

Trust-region model management: Corrections and convergence • Provably convergent to local minimum of high-fidelity function if surrogate is first-order accurate at center of trust region [Alexandrov et al., 2001] • Additive correction: with surrogate constructed as • Multiplicative correction: with surrogate constructed as • Only first-order corrections required to guarantee convergence; quasi- second-order corrections accelerate convergence [Eldred et al., 2004] • Trust-region POD [Arian, Fahl, Sachs, 2000] 21

Trust-region model management: Derivative-free framework • Derivative-free trust region approaches [Conn, Scheinberg, and Vicente, 2009] • Provably convergent under appropriate conditions if the surrogate model is “fully linear”       ( ) ( ) f m x x high k g k     2 f ( ) m ( ) x x high k f k • Achieved through adaptive corrections or adaptive calibration e.g., radial basis function calibration with sample points chosen to make surrogate model fully linear by construction [Wild, Regis and Shoemaker, 2011; Wild and Shoemaker, 2013] Trust Regions and Calibration Points • Key: never need gradients wrt the x 2 x 2 high-fidelity model x 1 x 1 22

Multifidelity design optimization example: March PhD 2012; Aircraft wing (with black-box codes) March, W., 2012 min 𝑦 𝑔 𝑦 Design variables : wing geometry, structural members Objectives : weight, lift-to-drag ratio s.t. 𝑕 𝑦 ≤ 0 Disciplines : aerodynamics, structures ℎ 𝑦 = 0 Aerodynamics and structures exchange pressure loading and deflections, requiring an iterative solve for each analysis. Multifidelity models : Structures: Nastran (commercial finite element code; MSC) Beam model Aerodynamics: Panair (panel code for inviscid flows; NASA) FRICTION (skin friction and form factors; W. Mason) AVL (vortex-lattice model; M. Drela) 23 Kriging surrogate

Multifidelity design optimization example: Aircraft wing Multifidelity approach : • Trust region model management – Derivative free framework [Conn et al., 2009] • Adaptive calibration of surrogates – Radial basis function calibration to provide fully linear models [Wild et al., 2009] – Calibration applied to correction function (difference between high- and low-fidelity models) [Kennedy & O’Hagan, 2001] • Computational speed-up + robustness to code failures Low-Fidelity Model Nastran Evals. Panair Evals. Time* (days) None 7,425 7,425 4.73 AVL/Beam Model 5,412 5,412 3.45 Kriging Surrogate 3,232 3,232 2.06 * Time corresponds to average of 30s per Panair evaluation, 25s per Nastran 24 evaluation, and serial analysis of designs within a discipline.

𝜌 𝑦|𝑒 ~𝑀 𝑒|𝑦 𝜌 0 𝑦 INVERSE PROBLEMS

Large-scale statistical inverse problems State Parameters Data  u  d  C ( u , e ) A ( u , x ) PDE: Observation:  t • Data are limited in number, noisy, and indirect • State-space is high dimensional (PDE model) • Unknown parameters are high-dimensional 26

Large-scale statistical inverse problems State Parameters Data 𝜌 𝑦|𝑒 ~𝑀 𝑒|𝑦 𝜌 0 𝑦 Bayes rule: posterior likelihood prior 27

Large-scale statistical inverse problems: Exploiting low-rank structure State Parameters Data 𝜌 𝑦|𝑒 ~𝑀 𝑒|𝑦 𝜌 0 𝑦 Bayes rule: posterior likelihood prior • Low-rank structure in the state space: Data-driven model reduction [Cui, Marzouk, W., 2014] • Low-rank structure in the parameter space: Efficient posterior exploration (likelihood-induced subspace) 28 [Lieberman, W., 2010; Cui, Martin, Marzouk, 2014]

Multifidelity modeling: Exploiting structure in high-dimensional - PowerPoint PPT Presentation

Multifidelity modeling: Exploiting structure in high-dimensional problems Karen Willcox Joint work with Tiangang Cui, Andrew March, Youssef Marzouk, Leo Ng Workshop on Numerical Methods for High-dimensional Problems Ecole des Ponts Paristech

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Design and Analysis for Multifidelity Computer Experiments Ying Hung Department of Statistics

Data-Driven Multifidelity Methods for Monte Carlo Estimation Benjamin Peherstorfer Courant

Multifidelity importance sampling methods for rare event simulation Benjamin Peherstorfer

A Multifidelity Cross-entropy Method for Rare Event Simulation Benjamin Peherstorfer Courant

EXPLOITING STRUCTURE FOR META-LEARNING NeurIPS Metalearning Workshop | December 8, 2018 Lise

Exploiting Structure of Uncertainty for Efficient Matroid Semi-Bandits Pierre Perrault (INRIA

Di ff erentially-Private Batch Query Answering Exploiting the Workload vs. Exploiting the Data

Exploiting Private Local Exploiting Private Local Memories to Reduce the Memories to Reduce the

Exploiting carbon and nitrogen Exploiting carbon and nitrogen compounds for enhanced energy

Exploiting Extreme Processor Counts on the Cray Exploiting Extreme Processor Counts on the Cray

Visualization of Geant4 Data: Exploiting Component Visualization of Geant4 Data: Exploiting

Hacking Browser's DOM Exploiting Ajax and RIA Exploiting Ajax and RIA Shreeraj Shah

Register Packing Register Packing Exploiting Narrow- -Width Operands Width Operands Exploiting

Exploiting Level- Exploiting Level -of of- -Detail Perception Detail Perception Multiple

C3 B: Exploiting the Num erous C3 B: Exploiting the Num erous Possibilities W eb Technology

Can Deep Learning Be Interpreted with Kernel Methods ? Ben Edelman & Preetum Nakkiran

Verifying commuting quantum computations via fidelity estimation of weighted graph states

Pooling fidelity and phase recovery Joan Bruna, Arthur Szlam, and Yann LeCun Neural networks:

Expectations and the demand for domestic goods April 5, 2018 Contents 1 Introduction 1 2 The

Coaching with Intention Making the Most of the PBC Cycle September 2020 Presenters Kymberly

Fiduciary Services Philip Nicol-Gent, Director Fiona Crocker, Deputy Director Nic Cleveland,

and nd Em Emergin rging g Priv ivacy acy Laws Kaylee Cox Bankston on Counsel, Manatt,

Socialized Costs of Hard Forks Free Money Wasnt Free 2017.08.01 18:12 UTC Free Money!!! Or

Multifidelity modeling: Exploiting structure in high-dimensional - PowerPoint PPT Presentation

Multifidelity modeling: Exploiting structure in high-dimensional problems Karen Willcox Joint work with Tiangang Cui, Andrew March, Youssef Marzouk, Leo Ng Workshop on Numerical Methods for High-dimensional Problems Ecole des Ponts Paristech

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Design and Analysis for Multifidelity Computer Experiments Ying Hung Department of Statistics

Data-Driven Multifidelity Methods for Monte Carlo Estimation Benjamin Peherstorfer Courant

Multifidelity importance sampling methods for rare event simulation Benjamin Peherstorfer

A Multifidelity Cross-entropy Method for Rare Event Simulation Benjamin Peherstorfer Courant

EXPLOITING STRUCTURE FOR META-LEARNING NeurIPS Metalearning Workshop | December 8, 2018 Lise

Exploiting Structure of Uncertainty for Efficient Matroid Semi-Bandits Pierre Perrault (INRIA

Di ff erentially-Private Batch Query Answering Exploiting the Workload vs. Exploiting the Data

Exploiting Private Local Exploiting Private Local Memories to Reduce the Memories to Reduce the

Exploiting carbon and nitrogen Exploiting carbon and nitrogen compounds for enhanced energy

Exploiting Extreme Processor Counts on the Cray Exploiting Extreme Processor Counts on the Cray

Visualization of Geant4 Data: Exploiting Component Visualization of Geant4 Data: Exploiting

Hacking Browser's DOM Exploiting Ajax and RIA Exploiting Ajax and RIA Shreeraj Shah

Register Packing Register Packing Exploiting Narrow- -Width Operands Width Operands Exploiting

Exploiting Level- Exploiting Level -of of- -Detail Perception Detail Perception Multiple

C3 B: Exploiting the Num erous C3 B: Exploiting the Num erous Possibilities W eb Technology

Can Deep Learning Be Interpreted with Kernel Methods ? Ben Edelman &amp; Preetum Nakkiran

Verifying commuting quantum computations via fidelity estimation of weighted graph states

Pooling fidelity and phase recovery Joan Bruna, Arthur Szlam, and Yann LeCun Neural networks:

Expectations and the demand for domestic goods April 5, 2018 Contents 1 Introduction 1 2 The

Coaching with Intention Making the Most of the PBC Cycle September 2020 Presenters Kymberly

Fiduciary Services Philip Nicol-Gent, Director Fiona Crocker, Deputy Director Nic Cleveland,

and nd Em Emergin rging g Priv ivacy acy Laws Kaylee Cox Bankston on Counsel, Manatt,

Socialized Costs of Hard Forks Free Money Wasnt Free 2017.08.01 18:12 UTC Free Money!!! Or

Can Deep Learning Be Interpreted with Kernel Methods ? Ben Edelman & Preetum Nakkiran