Summary of Polson and Sokolov 2018 Deep Learning for Energy Markets - - PowerPoint PPT Presentation

summary of polson and sokolov 2018
SMART_READER_LITE
LIVE PREVIEW

Summary of Polson and Sokolov 2018 Deep Learning for Energy Markets - - PowerPoint PPT Presentation

Summary of Polson and Sokolov 2018 Deep Learning for Energy Markets David Prentiss OR750-004 November 12, 2018 The PJM Interconnection The PennsylvaniaNew JerseyMaryland Interconnection (PTO) is a regional transmission organization


slide-1
SLIDE 1

Summary of Polson and Sokolov 2018

Deep Learning for Energy Markets David Prentiss

OR750-004

November 12, 2018

slide-2
SLIDE 2

The PJM Interconnection

◮ The Pennsylvania–New Jersey–Maryland Interconnection (PTO) is a regional transmission organization (RTO). ◮ It implements a wholesale electricity market for a network of producers and consumers in the Mid-Atlantic. ◮ It’s primary purpose is to prevent outages or otherwise un-met demand. ◮ Obligations are exchanged in bilateral contracts, the day-ahead market, and the real-time market.

slide-3
SLIDE 3

Local marginal price data

◮ Local Marginal Prices (LMP) are price data aggregated for prices in various locations and interconnection services is the network. ◮ They reflect the cost of producing and transmitting electricity in the network. ◮ Prices are non-linear because electricity. ◮ This paper proposes a NN to model price extremes.

slide-4
SLIDE 4

Load vs. price

slide-5
SLIDE 5

Load vs. previous load

slide-6
SLIDE 6

RNN vs. long short-term memory

Vanilla RNN ht = tanh

  • W

ht−1 xt

  • LSTM

    i f

  • k

    =     σ σ σ tanh     ◦ W ht−1 xt

  • ct = f ⊙ ct−1 + i ⊙ k

ht = o ⊙ tanh (ct)

slide-7
SLIDE 7

LTSM model

    i f

  • k

    =     σ σ σ tanh     ◦ W ht−1 xt

  • ct = f ⊙ ct−1 + i ⊙ k

ht = o ⊙ tanh (ct)

slide-8
SLIDE 8

Extreme value theory

◮ Extreme value analysis begins by filtering the data to select “extreme” values. ◮ Extreme values are selected by one of two methods.

◮ Block maxima: Select the peak values after dividing the series into periods. ◮ Peak over threshold: Select values larger than some threshold.

◮ Peak over threshold used in this paper.

slide-9
SLIDE 9

Peak over threshold

◮ Pickands–Balkema–de Hann (1974 and 1975) theorem characterizes the asymptotic tail distribution of an unknown distribution. ◮ Distribution of events that exceed a threshold are approximated with the generalized Pareto distribution. ◮ Low threshold increases bias. ◮ High threshold increases variance.

slide-10
SLIDE 10

Generalized Pareto distribution

◮ CDF H(y | σ, ξ) = 1 −

  • 1 + ξ y − u

σ −1

ξ

+

◮ PDF h(y | σ, ξ) = 1 − 1 σ

  • 1 + ξ y − u

σ −1

ξ −1

slide-11
SLIDE 11

Parameters

h(y | σ, ξ) = 1 − 1 σ

  • 1 + ξ y − u

σ −1

ξ −1

◮ Location, u, is the threshold ◮ Scale, σ, is our learned parameter ◮ Shape, ξ = f (u, σ)? EX [y] = σ + u = ⇒ ξ = 0?

slide-12
SLIDE 12

Fourier (ARIMA) model

slide-13
SLIDE 13

Fourier (ARIMA) model vs DL

slide-14
SLIDE 14

Demand forcasting DL–EVT

◮ DL–EVT Architecture X → tanh

  • W (1)X + b(1)

→ Z (1) → exp

  • tanh
  • Z (1)

→ σ(X) ◮ W (1) ∈ Rp×3, x ∈ Rp, p = 24 (one day) ◮ Threshold, u = 31, 000

slide-15
SLIDE 15

Vanilla DL vs. DL-EVT