Behavior Analytics Providing insights that enable evidence-based, - - PowerPoint PPT Presentation

behavior analytics
SMART_READER_LITE
LIVE PREVIEW

Behavior Analytics Providing insights that enable evidence-based, - - PowerPoint PPT Presentation

Lawrence Berkeley National Laboratory Behavior Analytics Providing insights that enable evidence-based, data-driven decisions Taking advantage of smart meter data: combining behavioral economics with data science analytics Pete ter Cap


slide-1
SLIDE 1

Taking advantage of smart meter data:

combining behavioral economics with data science analytics

Pete ter Cap apper ers & Annika ka Todd, PhD

IRP Contempor ntemporary ary Issues Technica hnical Conf nfer eren ence ce April 24, 2018

Lawrence Berkeley National Laboratory

Behavior Analytics

Providing insights that enable evidence-based, data-driven decisions

slide-2
SLIDE 2

2

Data explosion in energy

  • AMI, thermostats, appliances, cars
  • Linked to other time and location-specific

information (temperature, census, satellite)

  • Provide vast, constantly growing streams of

rich data

slide-3
SLIDE 3
  • What can we do with all of this data?
  • Many possibilities!
  • Insights from the data  tremendous potential

value for a wide range of energy programs, policies, and overall grid integration.

3

Smart meter data enables many possibilities for cutting edge analyses

010101011110100100101010110010011011100110110101010111101001001010101100100110111 110011011011100110001111011010100101010011101101001110101010111101001001010101100 001010101100100110111001101101110011000111101101010010101001110011011011100110001 011011100110110111001100011110110101001101001110110101011011100110001111011010100

slide-4
SLIDE 4

Our solution: Combine behavioral economics with data science

Better understand:

  • Customers’ energy characteristics
  • Customers’ energy usage behaviors

Implications and uses for: 1. Load forecasting 2. Utility planning 3. Increasing cost effectiveness of rates and DSM programs (existing

  • r new)

Using only easily accessible data from smart meters and other sources

slide-5
SLIDE 5
  • 1. Lots of things you can do with smart meter

data (5 examples)

  • 2. Some can be really useful, and some aren’t

(insist on seeing results)

  • 3. Let’s just do a lot of quick A/B testing and

analysis – what actually works? What should we try next?

  • Test big things (program validity), small things (best

wording for marketing messages), test continuously

5

Main Takeaway:

slide-6
SLIDE 6

Dataset for these examples

  • Residential hourly electricity data
  • 100,000 households
  • A region with usage peaks in the summer time
  • Pilots for TOU and CPP rates
  • Randomized controlled trial of these new rates
  • Households are randomly placed in different treatment groups
  • Randomized control group to compare to
  • Over 3 years of data
  • One year prior to new rates
  • Two years once rates start

6

Examples that we have done

slide-7
SLIDE 7

Example 1

Cluster load shape patterns Form groups of households with similar load shapes

7

slide-8
SLIDE 8

8

What the grid sees – aggregate load shape from everyone on the system

slide-9
SLIDE 9

99 Load Shapes: Now What?

9

Wide variety of load patterns across customers (even customers who appeared to be similar)

slide-10
SLIDE 10
  • Let machine learning show you patterns of

energy usage characteristics

  • Cluster all of the various types of households’

daily load shape patterns, to form groups that are similar to each other

10

Cluster load shape patterns Form groups of similar load shapes

slide-11
SLIDE 11

11

Use algorithms to cluster load shapes.

99 cluster groups: these 16 are the biggest

slide-12
SLIDE 12

12

Look at “Representative Load shapes”

Better predictions of current/future energy use

slide-13
SLIDE 13

Example 2

Look at distribution of load shape clusters across…

Number of peaks When the peaks occur

13

slide-14
SLIDE 14

14

Group clusters based on when peaks occur

Different # of peaks at different times of the day

slide-15
SLIDE 15

15

75% of daily patterns are single peaking

Group clusters based on when peaks occur

Different # of peaks at different times of the day

slide-16
SLIDE 16

16

26% of the daily load patterns have a peak at the same time as the system peak.

Group clusters based on when peaks occur

Different # of peaks at different times of the day

System peak

slide-17
SLIDE 17

17

Group clusters based on when peaks occur

Different # of peaks at different times of the day If we mostly care about predicting daily demand during system peak hours, then we could focus on getting really good predictions for these clusters since they drive most of the demand during peak hours

System peak

slide-18
SLIDE 18

18

Group clusters based on when peaks occur

Different # of peaks at different times of the day We can target households with these clusters for peak hour DR programs (like TOU pricing programs)

System peak

slide-19
SLIDE 19

Example 3

Look at distribution of load shape clusters across…..

Outdoor temperatures Day of week Season of the year

19

slide-20
SLIDE 20

20

Group clusters by temperature and contribution to usage

What we are seeing: The blue dots are

  • nly 3 of the

clusters, and yellow is all of the others (96 other clusters) On hot days (where the red line is high), there are more blue dots than on other days. This means….

slide-21
SLIDE 21

21

Group clusters by temperature and contribution to usage

These 3 clusters: Cover 50% of electricity usage on hottest days

slide-22
SLIDE 22

22

Group clusters by temperature and contribution to usage

If we mostly care about predicting daily demand during hot days, could focus on getting really good predictions for these three clusters since they drive most of the demand on those days These 3 clusters: Cover 50% of electricity usage on hottest days

slide-23
SLIDE 23

23

Group clusters by temperature and contribution to usage

We can target households with these three clusters for event-driven DR programs (like CPP pricing programs) These 3 clusters: Cover 50% of electricity usage on hottest days

slide-24
SLIDE 24

Example 4

Identify energy characteristics and develop metrics to represent those characteristics Segment household enrollment & response by energy characteristics Apply segmentation for targeting, tailoring, and predicting to get better program outcomes

24

slide-25
SLIDE 25

Identified a set of behavioral energy characteristics that we hypothesized should influence a household’s willingness to enroll in and respond to time-varying pricing programs

  • Baseload usage
  • Metric: daily minimum usage
  • Flexibility of a household’s energy use schedule
  • More flexible households may be more able or willing to make changes
  • Metrics measuring variability in electricity usage patterns over time
  • Savings potential
  • Metric of load magnitude on hot days;
  • Occupancy behavior of a household
  • Presence of residents during times surrounding the peak periods may make them

more able to respond, represented by

  • Metrics of usage during non-typical hours,
  • “Structural winningness” for a particular type of program (e.g., new rate)
  • Structural winners are households that would receive lower bills on the new rate

if they didn’t make any changes in their energy usage relative to the prior year (while on the traditional time-invariant electricity rate)

25

Decide what characteristics are useful, draw these characteristics out of the data

slide-26
SLIDE 26

26

Prototypical Load Shapes Enrollment vs. Response

Berkeley Lab

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability Source: Borgeson et al. (Forthcoming) kWh Savings (per household hourly savings during peak hour on event days)

slide-27
SLIDE 27

27

Do customers who are more likely to enroll also provide greater load response?

Berkeley Lab

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-28
SLIDE 28

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

28

Do customers who are more likely to enroll also provide greater load response?

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

No!

slide-29
SLIDE 29

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

29

Planning Efforts Could Benefit from Knowing Types of Customers based on Enrollment and Responsiveness

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-30
SLIDE 30

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

30

Do customers who see greater bill savings (i.e., structural winners) provide less load response?

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-31
SLIDE 31

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

31

Do customers who see greater bill savings (i.e., structural winners) provide less load response?

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-32
SLIDE 32

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

32

Do customers who see greater bill savings (i.e., structural winners) provide less load response?

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

No!

slide-33
SLIDE 33

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

33

Target market to the most responsive customers

Berkeley Lab

Target

 enrollment

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-34
SLIDE 34

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

34

Can we identify customers who are highly likely to enroll and may be able to increase their responsiveness?

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-35
SLIDE 35

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

35

Can we identify customers who are highly likely to enroll and may be able to increase their responsiveness?

Berkeley Lab

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

Yes!

slide-36
SLIDE 36

0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 10% 15% 20% 25%

kWh Sacings (per household hourly savings during peak hours on event days) Enrollment probability

36

Tailor marketing and education material to better engage customers and increase their responsiveness

Berkeley Lab

Tailor

 savings

kWh Savings (per household hourly savings during peak hour on event days) Source: Borgeson et al. (Forthcoming)

slide-37
SLIDE 37

37

Which customers are more cost effective to pursue?

Berkeley Lab

Source: Borgeson et al. (Forthcoming)

slide-38
SLIDE 38

Example 5

Identify timing of load response Better understand its implications on other metrics of interest

38

slide-39
SLIDE 39

Customer response to CPP: Consistent with Expectations on Event Days

Participants reduce usage during CPP event hours on event days – as expected True for volunteers as well as those defaulted

  • nto CPP
slide-40
SLIDE 40

Do CPP Customers Alter Usage on Non- Event Days?

Participants reduce usage during CPP event hours on non- event days too!!! True for volunteers as well as those defaulted

  • nto CPP
slide-41
SLIDE 41
  • Unclear if this “spillover” effect would apply to

PTR or other event-based DR programs…. But if it did:

  • Adversely affect baseline calculations that rely on

previous non-event days usage

  • Adversely impact settlement calculations resulting in

customers getting more/less than they actually deserve

  • Adversely impact load and peak demand forecasting,

as well as allocation of coincident peak demand reductions for resource adequacy

Spillover can Undermine Lots of Other Metrics

slide-42
SLIDE 42
  • Improve prediction and forecasting
  • Improve program cost-effectiveness
  • Better EM&V methods

 All of these help with utility planning, both short term (day-ahead DR planning), and long term portfolio planning

42

Summary of Implications

slide-43
SLIDE 43
  • 1. Lots of things you can do with smart meter

data

  • 2. Some can be really useful, and some aren’t

(test with real insist on seeing results)

  • 3. Let’s just do a lot of quick A/B testing and

analysis – what actually works? What should we try next?

  • Test big things (program validity), small things (best

wording for marketing messages), test continuously

43

Main Takeaway:

slide-44
SLIDE 44

Contact:

Peter Cappers

pacappers@lbl.gov 315-637-0513

Annika Todd

atodd@lbl.gov 510-495-2165

Berkeley Lab - Behavior Analytics

Providing insights that enable evidence-based, data-driven decisions

slide-45
SLIDE 45

45

Define relevant household energy behavior characteristics that you think are important

Flexibility Metrics (Variability of Usage)

entropy Entropy, meant to characterize overall variability in daily household consumption patterns, generated by clustering daily baseload usage patterns and calculating the entropy in load shape assignment for a given customer across days pre-peak CV Coefficient of variation (CV) of consumption in the two hours prior to the peak period across days peak CV CV of consumption during the peak period across days post-peak CV CV of consumption in the two hours following the peak period across days

Savings Potential Metrics (Load Magnitude during the Hottest Days)

pre-peak mean (hot) Average consumption during the two hours prior to the peak period peak mean (hot) Average consumption during the peak period post-peak mean (hot) Average consumption during the two hours following the peak period

Occupancy Metrics (Load Magnitude during the Non-Hottest Days)

pre-peak mean Average consumption during the two hours prior to the peak period peak mean Average consumption during the peak period post-peak mean Average consumption during the two hours following the peak period

Baseload Usage Metrics

minimum Average daily minimum consumption across all days (i.e., base load)

Structural Winningness

Structural Winningness

The degree to which a household would get lower bills on the new rate if they didn’t make any energy behavior changes (the amount of money a household would have saved in the pre-treatment year if they had been on the new rate instead of the old rate)

slide-46
SLIDE 46

Appendix Example

Better baseline estimates  More accurate EM&V  More accurate customer settlement payments & penalties

46

slide-47
SLIDE 47

47

Better Baseline Methods are Possible

  • Green dotted

line is actual usage

  • Red line is a

typical prediction method

  • Blue and

purple lines are different machine learning “gradient tree” methods

slide-48
SLIDE 48

48

Better Baseline Methods are Possible

  • Machine

learning methods do a better job at predicting real usage

  • Better

prediction of usage  better baselines for EM&V and customer settlements