Forecasting intraday-load curve using sparse learning methods - PowerPoint PPT Presentation

Forecasting intraday-load curve using sparse learning methods Dominique Picard LPMA- Université Paris-Diderot-Paris 7 Collaborators : Mathilde Mougeot UPD, Vincent Lefieux RTE, Laurence Maillard RTE Numerical methods for high dimensional problems Dominique Picard Forecasting intraday-load curve using sparse learning

Pre-big data- framework, towards streaming machine learning Dominique Picard Forecasting intraday-load curve using sparse learning

Pre-streaming machine learning • Volume - moderate • Variety -moderate • Velocity -small Dominique Picard Forecasting intraday-load curve using sparse learning

Pre-streaming machine learning • Volume - moderate • smart (data-driven) organisation of the information • methods allowing increasing volume of data • Variety -moderate • multidimensional functional data • Velocity -small We describe a forecasting pipeline i.e. chain of learning algorithms to achieve a final functional prediction. Dominique Picard Forecasting intraday-load curve using sparse learning

Description of the problem Dominique Picard Forecasting intraday-load curve using sparse learning

Intraday load curve during a week Monday January 25 th to Sunday January 31 th 9 x 10 4 8.5 8 7.5 7 6.5 6 0 20 40 60 80 100 120 140 160 180 Dominique Picard Forecasting intraday-load curve using sparse learning

Intraday load curve forecasting -here 48h- 4 20100602−3−−20100603−4 5.6 x 10 5.4 5.2 5 4.8 4.6 4.4 4.2 4 Y 3.8 Yapx tpred 3.6 0 10 20 30 40 50 Dominique Picard Forecasting intraday-load curve using sparse learning

Forecasting pipeline 1 Construction of a ’ smart encyclopedia ’ of past scenarios out of a data basis using different learning algorithms. 2 Build a set of prediction experts consulting the encyclopedia. 3 Aggregate the prediction experts Dominique Picard Forecasting intraday-load curve using sparse learning

Data basis The past data basis • Electrical consumption of the past • Other ’shape variables’: calendar data, functional bases • Meteorological input Dominique Picard Forecasting intraday-load curve using sparse learning

Electrical consumptions of the past • Recorded every half hour from January 1 st , 2003 to August 31 th , 2010. • For this period of time, the global consumption signal is split into N = 2800 sub signals ( Y 1 , . . . , Y t , . . . , Y N ) . Y t ∈ R n , defines the intra day load curve for the t th day of size n = 48. Dominique Picard Forecasting intraday-load curve using sparse learning

Intraday load curve for seven days Monday January 25 th to Sunday January 31 th 9 x 10 4 8.5 8 7.5 7 6.5 6 0 20 40 60 80 100 120 140 160 180 Dominique Picard Forecasting intraday-load curve using sparse learning

Shape and seasonal effects 8 x 10 4 7.5 7 6.5 6 5.5 5 4.5 4 3.5 3 0 5 10 15 20 25 Figure : Intraday load curves for various days. 2010-02-03 winter: black dashed dot line, 2010-05-21 spring: red dashed line, 2009-10-23 autumn: green solid line, 2010-08-19 summer: blue dot line, 2010-01-01 public day: gray dot line. Dominique Picard Forecasting intraday-load curve using sparse learning

Calendar and functional effects (endogenous) 4 4 6 x 10 8.4 x 10 8.2 5.5 8 7.8 5 7.6 7.4 4.5 7.2 7 4 6.8 6.6 3.5 6.4 0 5 10 15 20 25 0 5 10 15 20 25 4 4 7.8 x 10 4.4 x 10 7.6 4.2 7.4 4 7.2 3.8 7 6.8 3.6 6.6 3.4 6.4 3.2 6.2 6 3 0 5 10 15 20 25 0 5 10 15 20 25 Figure : autumn, winter, spring and summer Dominique Picard Forecasting intraday-load curve using sparse learning

Calendar and functional effects (shape description) • Consumption on day T can be explained by consumption of days t ′ < T of the past. • can be explained by calendar values of the day T (monday,..., sunday, months, seasons,... • Is a function of time and can be expressed in a standard dictionary of functions (wavelets, Fourier,..) Dominique Picard Forecasting intraday-load curve using sparse learning

Functional aspect : dictionary 0.4 0.3 0.2 0.1 0 −0.1 −0.2 −0.3 −0.4 0 10 20 30 40 50 60 70 Figure : Functions of the dictionary. Constant (black-solid line), cosine (blue-dotted line), sine (blue-dashdot line), Haar (red-dashed line) and temperature (green-solid line with points) functions. Dominique Picard Forecasting intraday-load curve using sparse learning

Meteorological inputs: Exogeneous variables • A total of 371 (=2x39+293) meteorological variables • recorded each day half-hourly over the 2800 days of the same period of time. Temperature: T k for k = 1 , . . . , 39 measured in 39 weather stations scattered all over the French territory. Cloud Cover: N k for k = 1 , . . . , 39 measured in the same 39 weather stations. Wind: W k ′ for k ′ = 1 , . . . , 293 available at 293 network points scattered all over the territory. Dominique Picard Forecasting intraday-load curve using sparse learning

Weather stations Figure : Temperature and Cloud covering measurement stations. Wind stations Dominique Picard Forecasting intraday-load curve using sparse learning

Brest- Lille- Marseille 12 80 11 10 10 70 9 8 60 8 6 50 7 4 40 6 5 2 30 4 0 20 3 −2 10 2 −4 0 1 0 5 10 15 20 25 0 5 10 15 20 25 0 5 10 15 20 25 (a) T (b) CC (c) W Figure : Brest (blue line), Lille (red line) and Marseille (green). Dominique Picard Forecasting intraday-load curve using sparse learning

Main issues 1 Large dimension 2 Prediction requires to explain with a small number of predictable parameters 3 Most of the potentially explanatory variables (load curve, meteo, functions of the dictionary) are highly correlated Dominique Picard Forecasting intraday-load curve using sparse learning

Reduced set of explanatory variables For each t index of the day of interest, we register the daily electrical consumption signal Y t and Z t = [[ C ] t [ M ] t ] [ C ] t is the concatenation of the "calendar,functional, past-consumptions" variables and [ M ] t "meteo variables". Dominique Picard Forecasting intraday-load curve using sparse learning

Sparse approximation on the learning set Sparse Approximation of each consumption day on a learning set of days (2003-2010), using the set of potentially explanatory variables. • For each day t of the learning set, we build an approximation ˆ Y t of the (observed) signal Y t with the help of the set of explanatory variables (Z t ): • ˆ Y t = G t ( Z t ) G t ( Z t ) = Z t ˆ β t ( ∗ ) Sparse Approximation and Knowledge Extraction for Electrical Consumption Signals, 2012, M. Mougeot, D. P., K. Tribouley & V. Lefieux, L. Teyssier-Maillard Dominique Picard Forecasting intraday-load curve using sparse learning

High dimensional Linear Models Y = X β + ǫ R k is the unknown parameter (to be estimated) β ∈ I • ǫ = ( ǫ 1 , . . . , ǫ n ) ∗ is a (non observed) vector of random errors. It is assumed to be variables i.i.d. N ( 0 , σ 2 ) • X is a known matrix n × k. High dimension : k >> n t Dominique Picard Forecasting intraday-load curve using sparse learning

Forecasting procedure Forecasting using the encyclopedia • Construction of a set of forecasting experts. • Aggregation of the experts. Dominique Picard Forecasting intraday-load curve using sparse learning

Expert associated to the strategy M Forecasting experts • Strategy : M a function, data dependent or not, from N to N such that for any d ∈ N , M ( d ) < d (purely non anticipative). • Plug-in To the strategy M we associate the expert ˜ Y M t : the prediction of the signal of day t using forecasting strategy M , ˜ = G M ( t ) ( Z t ) = Z t ˆ Y M β M ( t ) t Dominique Picard Forecasting intraday-load curve using sparse learning

Examples of strategies : time depending tm1: Refer to the day before: (The coefficients used for prediction are those calculated the previous day) M ( d ) = d − 1 ˜ = Z t ˆ Y tm1 β t − 1 t tm7: Refer to one week before: M ( d ) = d − 7 Y tm7 ˜ = Z t ˆ β t − 7 t Dominique Picard Forecasting intraday-load curve using sparse learning

Experts introducing meteorological scenarios • T: Find the day having the closest temperature indicators, regarding the sup distance (over the days, and over the indicators): M ( d ) = ArgMin t sup k ∈{ 1 ,..., 6 } , i ∈{ 1 ,..., 48 } | T k d ( i ) − T k t ( i ) | • T m : Find the day having the closest median temperature with the sup distance (over the days): M ( d ) = ArgMin t sup i ∈{ 1 ,..., 48 } | T 3 d ( i ) − T 3 t ( i ) | Dominique Picard Forecasting intraday-load curve using sparse learning

MAPE error For day t, the prediction MAPE error over the interval [ 0 , T ] is defined by: T � | ˜ Y M t )( T ) = 1 t ( i ) − Y t ( i ) | MAPE ( Y , ˜ Y M T Y t ( i ) i = 1 T � t )( T ) = 1 MISE ( Y , ˜ | ˜ Y M Y M t ( i ) − Y t ( i ) | 2 T i = 1 Dominique Picard Forecasting intraday-load curve using sparse learning

Forecasting intraday-load curve using sparse learning methods - PowerPoint PPT Presentation

Forecasting intraday-load curve using sparse learning methods Dominique Picard LPMA- Universit Paris-Diderot-Paris 7 Collaborators : Mathilde Mougeot UPD, Vincent Lefieux RTE, Laurence Maillard RTE Numerical methods for high dimensional

Intraday Techniques Intraday Techniques Intraday Techniques Intraday Techniques Combining

Intraday Liquidity Risk Gamal Bemath What is intraday liquidity risk? Intraday Liquidity

Curve Curve Ninjas December 19, 2012 Curve Ninjas Curve Overview Using Curve Implementation

Nord Pool Intraday Auctions 5 December 2017 Executive Summary The intraday auctions will be

Cross Border Intraday (XBID) Trading Solution Pre-launch Event Brussels, 31st January 2018

Forecasts and potential futures Rob Hyndman Author, forecast Forecasting Using R Sample

Electric lectrical al load forecasting load forecasting E using artificial neural using

Elliptic Curve Cryptography Applications of Elliptic Curve Cryptography Elliptic Curve

Welcome to Forecasting Using R Rob Hyndman Author, forecast Forecasting Using R What you will

Long-term electric load Why Forecast Electric Load forecasting is necessary so that Decisions on

Sparse Matrices Example Of Sparse Matrices diagonal tridiagonal sparse many elements are

March 20 165 Using Intraday Charts Daily Chart (Slide 3 of 3) 166 Candle Charts Western The

Load Balancing with nftables by Laura Garca (Zen Load Balancer Team) Netdev 1.1 Prototype of

Flood Forecasting Initiative Guy Shalev Flooding impact Flood Forecasting Flood Forecasting

Bending the Cost Curve and Improving Bending the Cost Curve and Improving Bending the Cost Curve

The Idea Behaviour When it comes to intraday trading they currently pay 5 paisa to buy or sell,

Graphlet Screening (GS) Achieves Optimal Rate in Variable Selection Jiashun Jin Carnegie Mellon

Sparse Recovery via Differential Inclusions Yuan Yao School of Mathematical Sciences Peking

Global solution to non-convex optimization problems involving an approximate 0 penalization

Additional Topics on Linear Regression Ping Yu School of Economics and Finance The University of

Boosting: more than an ensemble method for prediction Peter B uhlmann ETH Z urich

German EoI for Power Converters of SIS100 - SIS100 Dipole Power Converter 1678 k -

TENET: Tail-Event-driven NETwork Risk Wolfgang Karl Hrdle Weining Wang Lining Yu Ladislaus

CISC323: Introduction to Software developing software that organizes the effort into a number of