machine learning techniques in predicting uncertainty of
play

Machine learning techniques in predicting uncertainty of - PowerPoint PPT Presentation

Machine learning techniques in predicting uncertainty of environmental models Dimitri Solomatine Professor of Hydroinformatics, IHE Delft Institute for Water Education Delft, The Netherlands 1 Outline Introduction: what are analysisng?


  1. Machine learning techniques in predicting uncertainty of environmental models Dimitri Solomatine Professor of Hydroinformatics, IHE Delft Institute for Water Education Delft, The Netherlands 1

  2. Outline  Introduction: what are analysisng?  Machine learning methods to (a) analyse and (b) predict the model uncertainty  Suggested approach: “escalation” of uncertainty  Examples 2 D.P. Solomatine. Escalation of uncertainty.

  3. Example for a quick start: deterministic forecasts and 90% uncertainty bounds 4000 3500 3000 Discharge(m3/s) 2500 2000 1500 1000 500 0 900 920 940 960 980 1000 1020 Time(days) 3 D.P. Solomatine. Escalation of uncertainty.

  4. Sources of model uncertainty: perceptual, structure, parameters, data y = M(x, p) + ε s + ε θ + ε x + ε y SF – Snow SF – Snow RF – Rain RF – Rain EA – Evapotranspiration EA – Evapotranspiration SP – Snow cover SP – Snow cover SF SF SF IN – Infiltration IN – Infiltration RF RF RF EA EA EA R – Recharge R – Recharge SM – Soil moisture SM – Soil moisture CFLUX – Capillary transport CFLUX – Capillary transport SP SP UZ – Storage in upper reservoir UZ – Storage in upper reservoir IN IN IN PERC – Percolation PERC – Percolation SM SM LZ – Storage in lower reservoir LZ – Storage in lower reservoir R R CFLUX CFLUX CFLUX Qo – Fast runoff component Qo – Fast runoff component Q1 – Slow runoff component Q1 – Slow runoff component Q0 Q0 Q0 Q – Total runoff Q – Total runoff UZ UZ PERC PERC PERC Q1 Q1 Q1 Q=Q0+Q1 Q=Q0+Q1 Transform Transform LZ LZ function function 4 D.P. Solomatine. Escalation of uncertainty.

  5. Traditional steps in uncertainty analysis of a calibrated model  Identification of sources of uncertainty (input, parameter, model structure)  Quantification of uncertainty (e.g. as distribution)  Studying propagation of uncertainty through the model (e.g. by Monte Carlo simulation)  Quantification of uncertainty in the model outputs (i.e. identification of output distribution (pdf) or its characteristics – mean, st.dev., quantiles)  If possible, reduction of uncertainty (e.g. model improvement, more accurate measurements, etc.)  Application of the uncertain information in decision making process 5 D.P. Solomatine. Escalation of uncertainty.

  6. Data uncertainty (input, parameters): propagation of uncertainty through the model  y^ = M (x, p)  x = input, p = parameters  Uncertainty in X and p propagates to output y  pdf of parameters  pdf of output pdf p  pdf y  pdf of inputs pdf x  pdf of output pdf x  pdf y 6 D.P. Solomatine. Escalation of uncertainty.

  7. Monte Carlo Simulation 7

  8. Mote Carlo casino: roulette wheel It is a random number generator – uses  uniform distribution with the range of [0, 36] 8 D.P. Solomatine. Escalation of uncertainty.

  9. Single model run (no uncertainty) Input (single SF – Snow SF – Snow RF – Rain RF – Rain EA – Evapotranspiration EA – Evapotranspiration Single parameter SP – Snow cover SP – Snow cover time series) SF SF SF RF RF RF IN – Infiltration IN – Infiltration EA EA EA R – Recharge R – Recharge SM – Soil moisture SM – Soil moisture vector P: CFLUX – Capillary transport CFLUX – Capillary transport SP SP UZ – Storage in upper reservoir UZ – Storage in upper reservoir IN IN IN PERC – Percolation PERC – Percolation SM SM FC, ALPHA, K, LZ – Storage in lower reservoir LZ – Storage in lower reservoir R R Qo – Fast runoff component Qo – Fast runoff component CFLUX CFLUX CFLUX Q1 – Slow runoff component Q1 – Slow runoff component Q0 Q0 Q0 Q – Total runoff Q – Total runoff UZ UZ MAXBAS, etc. PERC PERC PERC Q1 Q1 Q1 Q=Q0+Q1 Q=Q0+Q1 Transform Transform LZ LZ function function Run the model Output (single time series) 9 D.P. Solomatine. Escalation of uncertainty.

  10. Monte Carlo simulation in analysing parametric uncertainty 10 D.P. Solomatine. Escalation of uncertainty.

  11. Sampling parameters and multiple model runs Sample one Do this muliple times parameter vector from distributions Input (single SF – Snow SF – Snow RF – Rain RF – Rain EA – Evapotranspiration EA – Evapotranspiration SP – Snow cover SP – Snow cover time series) SF SF SF Single parameter RF RF RF IN – Infiltration IN – Infiltration EA EA EA R – Recharge R – Recharge SM – Soil moisture SM – Soil moisture CFLUX – Capillary transport CFLUX – Capillary transport vector P: SP SP UZ – Storage in upper reservoir UZ – Storage in upper reservoir IN IN IN PERC – Percolation PERC – Percolation SM SM LZ – Storage in lower reservoir LZ – Storage in lower reservoir FC, ALPHA, K, R R Qo – Fast runoff component Qo – Fast runoff component CFLUX CFLUX CFLUX Q1 – Slow runoff component Q1 – Slow runoff component Q0 Q0 Q0 Q – Total runoff Q – Total runoff UZ UZ MAXBAS, etc. PERC PERC PERC Q1 Q1 Q1 Q=Q0+Q1 Q=Q0+Q1 Transform Transform LZ LZ function function Single output Run the model Ensemble of multiple output time series 11 D.P. Solomatine. Escalation of uncertainty.

  12. Monte Carlo sampling: illustration y = M(x, s, θ ) + ε s + ε θ + ε x + ε y 12 D.P. Solomatine. Escalation of uncertainty.

  13. Sampling rainfall and multiple model runs Sample one input Do this muliple times time series from distributions SF – Snow SF – Snow RF – Rain RF – Rain Input (single EA – Evapotranspiration EA – Evapotranspiration Single parameter SP – Snow cover SP – Snow cover SF SF SF RF RF RF IN – Infiltration IN – Infiltration time series) EA EA EA R – Recharge R – Recharge SM – Soil moisture SM – Soil moisture vector P: CFLUX – Capillary transport CFLUX – Capillary transport SP SP UZ – Storage in upper reservoir UZ – Storage in upper reservoir IN IN IN PERC – Percolation PERC – Percolation SM SM FC, ALPHA, K, LZ – Storage in lower reservoir LZ – Storage in lower reservoir R R Qo – Fast runoff component Qo – Fast runoff component CFLUX CFLUX CFLUX Q1 – Slow runoff component Q1 – Slow runoff component Q0 Q0 Q0 Q – Total runoff Q – Total runoff UZ UZ MAXBAS, etc. PERC PERC PERC Q1 Q1 Q1 Q=Q0+Q1 Q=Q0+Q1 Transform Transform LZ LZ function function Single output Run the model Ensemble of multiple output time series 13 D.P. Solomatine. Escalation of uncertainty.

  14. Representing uncertainty of model output by the confidence bounds 400 350 300 Discharge (m3.s) 250 q 95 200 q 5 150 100 50 0 10 20 30 40 50 60 70 Time (hr) Instead of fitting a theoretical distribution, we can use mean, standard deviation, quantiles. E.g., 5% and 95% form the 90% confidence bounds 14 14 D.P. Solomatine. Escalation of uncertainty.

  15. Propagation of parameters/data uncertainty by Monte Carlo simulation is a typical practical approach. But is it the only one? 15 D.P. Solomatine. Escalation of uncertainty.

  16. QUESTION 1. On assumptions  We are assuming some known distributions of parameters or inputs. How safe is this?  Could we take a safer route and assume less?  Let’s make a step backwards and pose the QUESTION 1: what is the uncertainty of the calibrated model itself ? 16 D.P. Solomatine. Escalation of uncertainty.

  17. Residual uncertainty: uncertainty of a calibrated (“optimal”) model Output Y Model Model output y^ Measured Model error Measured value y Actual Observation error Actual value y* (unknown) time Uncertainty of an optimal model M (x, θ)   Model M is calibrated on measured data y  We say the model M uncertainty is manifested in the residual model error ε = y^ – y  This error incorporates all uncertainties due to: observational errors, inaccurately estimated parameters, inadequate model structure 17 D.P. Solomatine. Escalation of uncertainty.

  18. ESCALATION (“build up”) of model uncertainty [message 1]  1. Study the (residual) uncertainty of an optimal model M (p*)  2. Add and study (typically, by MC simulation)  A) uncertainty of M (p*) due to DATA uncertainty  B) uncertainty of M (p) due to PARAMETERS uncertainty  3. Add and study uncertainty of M (p) due to STRUCTURAL uncertainty  4. Study uncertainty of a model class M (p), given the probabilistic properties of parameters and data 18 D.P. Solomatine. Escalation of uncertainty.

  19. QUESTION 2. On what is analysed 400 350 300 Discharge (m3.s) 250 q 95 200 q 5 ? 150 100 50 0 10 20 30 40 50 60 70 Time (hr)  In UA we always use the past data, so Estimates of uncertainty are about the PAST.  QUESTION 2: how can we assess the model uncertainty for new inputs, i.e. for the future? - and this question we pose for all sources of uncertainty (and not only residual ) 19 D.P. Solomatine. Escalation of uncertainty.

  20. Models of Residual Uncertainty : Using Methods of Computational Intelligence 20

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend