summary of polson and sokolov 2018
play

Summary of Polson and Sokolov 2018 Deep Learning for Energy Markets - PowerPoint PPT Presentation

Summary of Polson and Sokolov 2018 Deep Learning for Energy Markets David Prentiss OR750-004 November 12, 2018 The PJM Interconnection The PennsylvaniaNew JerseyMaryland Interconnection (PTO) is a regional transmission organization


  1. Summary of Polson and Sokolov 2018 Deep Learning for Energy Markets David Prentiss OR750-004 November 12, 2018

  2. The PJM Interconnection ◮ The Pennsylvania–New Jersey–Maryland Interconnection (PTO) is a regional transmission organization (RTO). ◮ It implements a wholesale electricity market for a network of producers and consumers in the Mid-Atlantic. ◮ It’s primary purpose is to prevent outages or otherwise un-met demand. ◮ Obligations are exchanged in bilateral contracts, the day-ahead market, and the real-time market.

  3. Local marginal price data ◮ Local Marginal Prices (LMP) are price data aggregated for prices in various locations and interconnection services is the network. ◮ They reflect the cost of producing and transmitting electricity in the network. ◮ Prices are non-linear because electricity. ◮ This paper proposes a NN to model price extremes.

  4. Load vs. price

  5. Load vs. previous load

  6. RNN vs. long short-term memory Vanilla RNN � � h t − 1 �� h t = tanh W x t LSTM     i σ � h t − 1 � f σ      =  ◦ W     o σ x t   k tanh c t = f ⊙ c t − 1 + i ⊙ k h t = o ⊙ tanh ( c t )

  7. LTSM model     i σ � h t − 1 � f σ      =  ◦ W     o σ x t   k tanh c t = f ⊙ c t − 1 + i ⊙ k h t = o ⊙ tanh ( c t )

  8. Extreme value theory ◮ Extreme value analysis begins by filtering the data to select “extreme” values. ◮ Extreme values are selected by one of two methods. ◮ Block maxima: Select the peak values after dividing the series into periods. ◮ Peak over threshold: Select values larger than some threshold. ◮ Peak over threshold used in this paper.

  9. Peak over threshold ◮ Pickands–Balkema–de Hann (1974 and 1975) theorem characterizes the asymptotic tail distribution of an unknown distribution. ◮ Distribution of events that exceed a threshold are approximated with the generalized Pareto distribution. ◮ Low threshold increases bias. ◮ High threshold increases variance.

  10. Generalized Pareto distribution ◮ CDF � − 1 � 1 + ξ y − u ξ H ( y | σ, ξ ) = 1 − σ + ◮ PDF � − 1 ξ − 1 h ( y | σ, ξ ) = 1 − 1 � 1 + ξ y − u σ σ

  11. Parameters � − 1 ξ − 1 h ( y | σ, ξ ) = 1 − 1 � 1 + ξ y − u σ σ ◮ Location, u , is the threshold ◮ Scale, σ , is our learned parameter ◮ Shape, ξ = f ( u , σ )? EX [ y ] = σ + u = ⇒ ξ = 0?

  12. Fourier (ARIMA) model

  13. Fourier (ARIMA) model vs DL

  14. Demand forcasting DL–EVT ◮ DL–EVT Architecture � W (1) X + b (1) � → Z (1) → exp � � Z (1) � X → tanh tanh → σ ( X ) ◮ W (1) ∈ R p × 3 , x ∈ R p , p = 24 (one day) ◮ Threshold, u = 31 , 000

  15. Vanilla DL vs. DL-EVT

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend