Staring at the Sun: Black-box Solar Analytics and their Privacy - - PowerPoint PPT Presentation

staring at the sun black box solar analytics and their
SMART_READER_LITE
LIVE PREVIEW

Staring at the Sun: Black-box Solar Analytics and their Privacy - - PowerPoint PPT Presentation

Staring at the Sun: Black-box Solar Analytics and their Privacy Implications David Irwin Electrical and Computer Engineering University of Massachusetts Amherst 1 Solar Energy is Rapidly Expanding Installed cost of solar continues to drop


slide-1
SLIDE 1

David Irwin

Electrical and Computer Engineering University of Massachusetts Amherst

Staring at the Sun: Black-box Solar Analytics and their Privacy Implications

1

slide-2
SLIDE 2

David Irwin — UMass Amherst Staring at the Sun

Solar Energy is Rapidly Expanding

  • Installed cost of solar continues to drop
  • Cost fell by 50% from 2008 to 2013
  • Led to 418% increase in solar capacity
  • Many implications to this rising solar penetration

2

slide-3
SLIDE 3

David Irwin — UMass Amherst Staring at the Sun

Implications of Solar to Grid

  • Utilities must actively control generation to balance grid
  • Individual homes exhibit highly stochastic demand profiles
  • However, aggregate demand profiles are smooth and highly predictable
  • Enables utilities to plan generator “dispatch” schedules in advance

50 100 150 200 250 300 2 4 6 8 10 12 14 16 18 20 22 24

Power (kW) Time (hours)

Aggregate

1000 2000 3000 4000 5000 6000 7000

2 4 6 8 10 12 14 16

Power (W) Time (hours)

Individual

1 home 194 homes

3

slide-4
SLIDE 4

David Irwin — UMass Amherst Staring at the Sun

Implications of Solar to Grid

  • Large-scale solar penetration fundamentally alters this paradigm
  • Increases stochasticity of demand profiles, even when aggregated
  • Solar output can change instantly, while generators take time to “ramp up”
  • Complicates controlling generation to balance supply and demand
  • May require more energy storage, spinning reserve, or demand response capacity
  • Accurate solar monitoring and forecasting is critical
  • Track solar penetration rates over time
  • Monitor real-time fluctuations in grid solar production
  • Inform advanced planning of generator dispatch schedules
  • Identify faults and anomalies in solar output

4

slide-5
SLIDE 5

David Irwin — UMass Amherst Staring at the Sun

Prior Work

  • Possible to develop highly accurate models of solar performance
  • Leverages detailed information on site characteristics

Figure from PV Performance Modeling Collaborative

  • However, detailed information not always available

5

slide-6
SLIDE 6

David Irwin — UMass Amherst Staring at the Sun

Black-box Solar Analytics

  • Assumes only access to solar energy data time-series
  • Without any detailed metadata
  • Motivating Scenarios
  • Utilities managing grid with thousands of small-scale solar sites
  • Might know location, but not deployment details
  • Third-party energy analytics companies
  • Often do not know location, or deployment details
  • Researchers accessing public datasets
  • Metadata is often scarce and unreliable

6

slide-7
SLIDE 7

David Irwin — UMass Amherst Staring at the Sun

This Talk – Discuss Two Black-box Techniques

  • 1. Solar Disaggregation
  • Turns out utilities often do not even have access to solar data
  • Residential “grid-tied” solar almost always “behind the meter”
  • Only directly monitor the net of consumption and generation
  • Prevents wide-range of learning-based data analytics

7

slide-8
SLIDE 8

David Irwin — UMass Amherst Staring at the Sun

This Talk – Discuss Two Black-box Techniques

  • 2. Solar Localization
  • Determine location from “anonymous” solar energy data
  • Both a privacy threat and/or a potentially useful tool
  • Location is highly useful contextual information when analyzing energy data

8

slide-9
SLIDE 9

David Irwin — UMass Amherst Staring at the Sun

SunDance - Solar Disaggregation

  • Given meter location, separate “net” meter data into solar

generation and consumption at each time t

  • Pnet(t) = Ps(t) + Pc(t), where Pc(t) > 0, Ps(t) < 0, ∨ t > 0
  • Challenges
  • 1. Do not have access to already-separated historical data
  • 2. Cannot individually model solar generation or energy consumption
  • 4
  • 2

2 4

Power (kW) Time (Hours) Net Meter

2 4

Power (kW) Time (Hours) Consumption

= +

9

  • 4
  • 2

Power (kW) Time (Hours) Solar

slide-10
SLIDE 10

David Irwin — UMass Amherst Staring at the Sun

SunDance Design Overview

  • 1. Build a custom model of maximum solar generation
  • Find “best” fitting valid solar curve to the data using a small amount of data
  • Can find accurately even on noisy net meter data
  • 2. Build a general model of weather’s effect on irradiance
  • Train model that maps weather metrics to fraction of clear sky irradiance
  • Use to infer fraction of clear sky irradiance at site based on weather
  • Can train model using data from any solar sites where it is available
  • 3. Apply two models to disaggregate solar
  • Solar generation Ps(t) = Product of (1) and (2) at every time t
  • Energy consumption Pc(t) = Pnet(t) – Ps(t)

10

slide-11
SLIDE 11

David Irwin — UMass Amherst Staring at the Sun

SunDance Design Overview

  • 1. Build a custom model of maximum solar generation
  • Find “best” fitting valid solar curve to the data using a small amount of data
  • Can find accurately even on noisy net meter data
  • 2. Build a general model of weather’s effect on irradiance
  • Train model that maps weather metrics to fraction of clear sky irradiance
  • Use to infer fraction of clear sky irradiance at any site based on weather
  • Can train model using data from any solar sites where it is available
  • 3. Apply two models to disaggregate solar
  • Solar generation Ps(t) = Product of (1) and (2) at every time t
  • Energy consumption Pc(t) = Pnet(t) – Ps(t)

11

slide-12
SLIDE 12

David Irwin — UMass Amherst Staring at the Sun

Apply Physical Solar Performance Model

  • Use clear sky model to compute maximum irradiance
  • Search for size, efficiency, tilt, and orientation that yields the

tightest upper bound on the data

  • Apply linear temperature adjustment to data
  • Find linear constant c (~0.4%/C) that yields the tightest upper bound

12

2 4 6 8 12am 3am 6am 9am 12pm 3pm 6pm 9pm

Power (kW) Time (Hours)

Solar Data Ground Truth Best Fit 2 4 6 8 10 12 14 Winter Spring Summer Fall Power (kW) Time Solar Data Clear Sky Model 2 4 6 8 10 12 14 Winter Spring Summer Fall Power (kW) Time Solar Data Clear Sky Model(temp)

slide-13
SLIDE 13

David Irwin — UMass Amherst Staring at the Sun

  • Issues with modeling “noisy” net energy meter data
  • Power Consumption Floor – do not know zero point of solar
  • SunDance estimates based on minimum power consumption at night,

where solar power is known to be zero, to adjust the model

  • 4
  • 2

2 4 6 8 10 12

12am 3am 6am 9am 12pm 3pm 6pm 9pm

Power (kW) Time (Hours)

Net Meter Data Ground Truth Power Floor

Modeling Net Meter Data

13

power floor

slide-14
SLIDE 14

David Irwin — UMass Amherst Staring at the Sun

Modeling Net Meter Data

  • Issues with modeling “noisy” net energy meter data
  • Consumption “Noise” – reduces solar generation like weather
  • SunDance robust as long as at least one datapoint exists where solar

generation is near its maximum potential and energy consumption is low

14

  • 4
  • 2

2 4 6 8 10 12

12am 3am 6am 9am 12pm 3pm 6pm 9pm

Power (kW) Time (Hours)

Net Meter Data Ground Truth

slide-15
SLIDE 15

David Irwin — UMass Amherst Staring at the Sun

  • Issues with modeling “noisy” net energy meter data
  • Consumption “Noise” – reduces solar generation like weather
  • SunDance robust as long as at least one datapoint exists where solar

generation is near its maximum potential and energy consumption is low

Modeling Net Meter Data

15

best fit model

  • 4
  • 2

2 4 6 8 10 12

12am 3am 6am 9am 12pm 3pm 6pm 9pm

Power (kW) Time (Hours)

Net Meter Data Ground Truth Best Fit

slide-16
SLIDE 16

David Irwin — UMass Amherst Staring at the Sun

  • Can build highly accurate and custom maximum generation

models with a minimal amount of net meter data

  • In the limit, we only need the “right” two datapoints
  • Solar generation is near maximum, energy consumption is near minimum
  • There is a significant temperature difference
  • Accuracy changes little when using pure solar or net meter data

Building a Maximum Generation Model

16

slide-17
SLIDE 17

David Irwin — UMass Amherst Staring at the Sun

Modeling Net Meter Data

  • Verify using data from 10 more solar sites
  • Manually measure module tilt and orientation
  • Find values close to ground-truth using minimal data
  • Tilt slightly less accurate – difficult to distinguish between different

tilts and different module areas/efficiencies

17

10 20 30 40 50 60 1 2 3 4 5 6 7 8 9 10

Tilt (°) Solar Sites

SunDance Ground Truth SunDance(2 days) 30 60 90 120 150 180 210 240 270 300 1 2 3 4 5 6 7 8 9 10

Orientation (°) Solar Sites

SunDance Ground Truth SunDance(2 days)

slide-18
SLIDE 18

David Irwin — UMass Amherst Staring at the Sun

SunDance Design Overview

  • 1. Build a custom model of maximum solar generation
  • Find “best” fitting valid solar curve to the data
  • Can find accurately even on noisy net meter data
  • 2. Build a general model of weather’s effect on irradiance
  • Train model that maps weather metrics to fraction of clear sky irradiance
  • Use to infer fraction of clear sky irradiance at any site based on weather
  • Can train model using data from any solar sites where it is available
  • 3. Apply two models to disaggregate solar
  • Solar generation Ps(t) = Product of (1) and (2) at every time t
  • Energy consumption Pc(t) – Pnet(t) – Ps(t)

18

slide-19
SLIDE 19

David Irwin — UMass Amherst Staring at the Sun

Building a General Weather Model

  • Weather effects – blocks solar irradiance from reaching module
  • Primarily clouds, but also humidity, dewpoint, wind, etc. increase particulates
  • Weather’s impact on blocking solar irradiance is complex
  • Data above from three locations at different times for three different

weather conditions varies widely

19

2 4 6 8 10 20 40 60 80 100

Raw Power (kW) Sample Hour-long Periods

Clear Overcast Light Clouds

slide-20
SLIDE 20

David Irwin — UMass Amherst Staring at the Sun

Building a General Weather Model

  • Key insight – same weather reduces the maximum solar

irradiance by the same fraction

  • Independent of location, time, magnitude, etc.
  • Can infer weather’s effect at one location from others

20

20 40 60 80 100 20 40 60 80 100

Percentage Max (%) Sample Hour-long Periods

Clear Overcast Light Clouds

Clear Light Clouds Overcast

slide-21
SLIDE 21

David Irwin — UMass Amherst Staring at the Sun

Building a General Weather Model

  • Build using supervised machine learning
  • Input – weather metrics, e.g., cloud cover, humidity, dew point, etc.
  • Output – the fraction of the clear sky solar generation
  • Construct single training set using data from many different solar sites

where “pure” solar training data is available

  • Use SVMs, but compatible with any machine learning model

10 20 30 40 50 60 70 80 SVM-Linear SVM-Poly SVM-RBF Linear-OLS Linear-LASSO Linear-Ridge Linear-Bayesian

MAPE Models

21

slide-22
SLIDE 22

David Irwin — UMass Amherst Staring at the Sun

Building a General Weather Model

  • Key insight – we can use our maximum generation model to

infer fraction of solar irradiance from solar power data

  • More solar power data available than pyranometer data
  • Physical effects “cancel out” in the equation below

Pactual(t) = Iactual(t)*k*(cos(90-Θ)*sin(β)*cos(Φ-α)+sin(90-Θ)*cos(β)) Psmax(t) = Imax(t)*k*(cos(90-Θ)*sin(β)*cos(Φ-α)+sin(90-Θ)*cos(β))

22

  • Ratio above is called the clear sky index
  • Widely used in solar forecasting
  • Enables us to estimate a clear sky index from solar power data
  • Potentially useful for estimating ground-level irradiance
slide-23
SLIDE 23

David Irwin — UMass Amherst Staring at the Sun

SunDance Design Overview

  • 1. Build a custom model of maximum solar generation
  • Find “best” fitting valid solar curve to the data
  • Can find accurately even on noisy net meter data
  • 2. Build a general model of weather’s effect on irradiance
  • Train model that maps weather metrics to fraction of clear sky irradiance
  • Use to infer fraction of clear sky irradiance at any site based on weather
  • Can train model using data from any solar sites where it is available
  • 3. Apply two models to disaggregate solar
  • Solar generation Ps(t) = Product of (1) and (2) at every time t
  • Energy consumption Pc(t) = Pnet(t) – Ps(t)

23

slide-24
SLIDE 24

David Irwin — UMass Amherst Staring at the Sun

Implementation and Evaluation

  • Implement SunDance in python
  • Use simple Bird clear sky generation model, weather data from Weather

Underground, and Scikit-learn machine learning library

  • Compare SunDance with a supervised approach for 100 sites
  • Supervised approach uses exact same method as SunDance
  • Supervised approach - builds and trains each site’s maximum generation

model and weather model on historical solar generation data from that site

  • SunDance – assumes no access to already disaggregated training data
  • Use MAPE from sunrise to sunset as the evaluation metric

24

slide-25
SLIDE 25

David Irwin — UMass Amherst Staring at the Sun

Disaggregating Solar

  • Illustration on a net-zero home
  • Top – net meter data
  • Bottom – disaggregated solar

25

  • 4
  • 3
  • 2
  • 1

1 2 3 4 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Net Meter (kW) Time (Days)

Net Meter Data

1 2 3 4 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Generation (kW) Time (Days)

MAPE: 26.178426732 SunDance Ground truth

slide-26
SLIDE 26

David Irwin — UMass Amherst Staring at the Sun

Evaluation

  • Accuracy close to that of the supervised approach
  • Improves as consumption-to-solar ratio decreases
  • MAPE highly sensitive to small errors near sunrise/sunset
  • SunDance very close to supervised approach over middle of the day

10 20 30 40 50 60 70 10 20 30 40 50 60 70 80 90 100

MAPE Building ID

Supervised SunDance(Mid-day)

1 3 8 9 13 20 21 23 24 24 25 26 26 27 28 30 30 31 32 33 34 36 37 38 39 39 41 42 43 43 44 45 46 46 49 51 52 53 55 56 56 56 56 58 58 61 61 61 62 62 64 64 65 66 67 72 73 74 76 76 78 80 82 86 86 88 90 92 92 93 98 98 99 99 100100102 103 104 104106109 109111 111 112 112113114119122 124 129 159 159160 164184 193

10 20 30 40 50 60 70 10 20 30 40 50 60 70 80 90 100

MAPE Building ID

Supervised SunDance

1 3 8 9 13 20 21 23 24 24 25 26 26 27 28 30 30 31 32 33 34 36 37 38 39 39 41 42 43 43 44 45 46 46 49 51 52 53 55 56 56 56 56 58 58 61 61 61 62 62 64 64 65 66 67 72 73 74 76 76 78 80 82 86 86 88 90 92 92 93 98 98 99 99 100 100 102 103104 104106109109 111 111 112112113114119122 124 129 159159160 164184 193

26

slide-27
SLIDE 27

David Irwin — UMass Amherst Staring at the Sun

Summary

  • SunDance - “Behind the Meter” Solar Disaggregation
  • Leverages multiple insights into fundamental relationships between

location, weather, physical characteristics, and solar generation

  • Achieves similar accuracy without access to solar training data as a

fully supervised approach with complete access to solar training data

  • Requires little historical net meter data to build model
  • Enables utilities to accurately monitor solar data
  • Provides training data to support other solar energy analytics

27

slide-28
SLIDE 28

David Irwin — UMass Amherst Staring at the Sun

This Talk – Discuss Two Black-box Techniques

  • 2. Solar Localization
  • Determine location from “anonymous” solar energy data
  • Both a privacy threat and/or a potentially useful tool
  • Location is highly useful contextual information when analyzing energy data

28

slide-29
SLIDE 29

David Irwin — UMass Amherst Staring at the Sun

Privacy Implications

  • Energy data routinely monitored by third-parties, including…
  • …utilities, solar installers, researchers, governments, etc.
  • Not treated as sensitive if “anonymized”
  • Found ~28k “anonymous” homes making data available over public Internet

29

slide-30
SLIDE 30

David Irwin — UMass Amherst Staring at the Sun

Exploiting Energy Data using Analytics

  • Many companies actively working to develop energy data analytics
  • Identify energy waste to improve energy-efficiency
  • May also provide deep insights into user behavior
  • What are a home’s occupancy patterns?
  • How often do occupants go out for vacations?
  • How often do occupants eat-in versus go out to eat?
  • Privacy implications are less concerning for anonymized data
  • Cannot associate behaviors with specific people

30

slide-31
SLIDE 31

David Irwin — UMass Amherst Staring at the Sun

This is Real

  • Real public job advertisement for an energy analytics startup

31

slide-32
SLIDE 32

David Irwin — UMass Amherst Staring at the Sun

Exploiting Energy Data using Analytics

  • Policies for handling energy data are still evolving
  • DOE’s Data Privacy and the Smart Grid: A Voluntary Code of Conduct
  • Finalized on January 8th, 2015
  • Does not require user consent to release “anonymized” energy data
  • Defined as user account information: name, address, SSN, etc.

32

’ Consent Not Required: Prior customer consent is not required to disclose Customer Data in the case of: ’ ’ ’ (4) Aggregated or Anonymized Data. Service Providers can share Aggregated or Anonymized data with Third Parties without first obtaining customer consent if the methodology used to aggregate or anonymize Customer Data strongly limits the likelihood of reidentification of individual customers or their Customer Data from the aggregated or Anonymized data set. ’ ’

slide-33
SLIDE 33

David Irwin — UMass Amherst Staring at the Sun

Key Insight

  • Solar energy data is not anonymous
  • Every location on Earth has a unique solar signature
  • Sun’s position in the sky is unique at each location at every moment
  • E.g., unique sunrise, sunset, and solar noon time each day
  • Solar data embeds detailed location information

33

  • 100

100 200 300 400 7 am 9 am 11am 1pm 3pm 5pm

Power (w) Time (hour)

sunrise sunrise sunset sunset solar noon solar noon

slide-34
SLIDE 34

David Irwin — UMass Amherst Staring at the Sun

Problem

  • How to localize the source of anonymous solar data?
  • Explore severity and threat of solar localization
  • Depending on perspective, could also be a useful tool
  • Significant prior work on estimating solar output based on location
  • No work on estimating location based on solar output
  • SunSpot – system for localizing anonymous solar-powered homes

based on their solar energy data

  • Inform evolving policies on handling energy data that includes solar
  • Reconsider current notions of anonymity in energy data

34

slide-35
SLIDE 35

David Irwin — UMass Amherst Staring at the Sun

Basic Approach

  • Location uniquely identified by a latitude and longitude
  • Latitude – uniquely identified by the daylength [sunrise->sunset]
  • Duration from first to last times of positive solar generation
  • Longitude – uniquely identified by time of solar noon
  • Maximum solar generation
slide-36
SLIDE 36

David Irwin — UMass Amherst Staring at the Sun

Basic Approach

  • Location uniquely identified by a latitude and longitude
  • Latitude – uniquely identified by the daylength [sunrise->sunset]
  • Duration from first to last times of positive solar generation
  • Longitude – uniquely identified by time of solar noon
  • Maximum solar generation
slide-37
SLIDE 37

David Irwin — UMass Amherst Staring at the Sun

Basic Approach

  • Location uniquely identified by a latitude and longitude
  • Latitude – uniquely identified by the daylength [sunrise->sunset]
  • Duration from first to last times of positive solar generation
  • Longitude – uniquely identified by time of solar noon
  • Maximum solar generation
slide-38
SLIDE 38

David Irwin — UMass Amherst Staring at the Sun

Deriving Location from the Sun

  • Algorithms for deriving location from the sun are obscure
  • Typically used for celestial navigation of primitive ships
  • No widely-used open-source libraries or online APIs
  • Algorithms for deriving sunrise/sunset for location are common
  • Highly accurate but not easily reversible
  • Many open-source libraries and online APIs available
  • Leverage existing APIs as a sub-routine to conduct a binary

search for location given sunrise/sunset times

  • (sunrise, sunset) == (daylength, solar noon)
slide-39
SLIDE 39

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
slide-40
SLIDE 40

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
slide-41
SLIDE 41

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API

Latitude 0

slide-42
SLIDE 42

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API

Latitude 0 Latitude 90 Latitude -90

slide-43
SLIDE 43

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API

Latitude 0 Latitude 45 Latitude 90 Latitude -90

slide-44
SLIDE 44

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API

Latitude 0 Latitude 45 Latitude 22.5 Latitude 90 Latitude -90

slide-45
SLIDE 45

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API

Latitude 0 Latitude 45 Latitude 22.5 Latitude 33.75 Latitude 90 Latitude -90

slide-46
SLIDE 46

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API
  • Accuracy below on June 21st (summer solstice)
slide-47
SLIDE 47

David Irwin — UMass Amherst Staring at the Sun

Deriving Latitude given Daylength

  • Note that….
  • …in winter, daylength decreases moving south to north
  • …in summer, daylength increases moving south to north
  • Binary Search using API

800 1600 2400 3200 4000 60 120 180 240 300 360

Accuracy (m) Day of Year

slide-48
SLIDE 48

David Irwin — UMass Amherst Staring at the Sun

Deriving Longitude given Time of Solar Noon

  • Binary Search using API
  • Use API to compute solar noon for 0° and ±180°
  • Pick any latitude value
  • Select region with desired solar noon time
  • Either [0°,180°] or [0°,-180°]
  • Divide selected interval by two ([0°,90°], [0°,-90°]) and repeat…
  • …until longitude does not change
  • 0° latitude
slide-49
SLIDE 49

David Irwin — UMass Amherst Staring at the Sun

Deriving Longitude given Time of Solar Noon

  • Binary Search using API
  • Use API to compute solar noon for 0° and ±180°
  • Pick any latitude value
  • Select region with desired solar noon time
  • Either [0°,180°] or [0°,-180°]
  • Divide selected interval by two ([0°,90°], [0°,-90°]) and repeat…
  • …until longitude does not change

200 400 600 800 1000 1200 1400

  • 90
  • 60
  • 30

30 60 90

10000 20000 30000 40000

Longitude Accuracy (m) Longitude Accuracy (m) Latitude (°)

Second Resolution Minute Resolution

slide-50
SLIDE 50

David Irwin — UMass Amherst Staring at the Sun

SunSpot Challenge

  • Ideally, take solar generation from one day
  • Extract precise sunrise, sunset, and solar noon time (to the second)
  • Directly compute latitude and longitude accurately
  • But, solar cells are highly imprecise sensors of the sun
  • Error translates to hundreds-to-thousands of miles
  • 100

100 200 300 400 7 am 9 am 11am 1pm 3pm 5pm

Power (w) Time (hour)

Sunrise First +Point Solar Noon Maximum Power Last +Point Sunset

slide-51
SLIDE 51

David Irwin — UMass Amherst Staring at the Sun

Solar Imprecision and Inefficiency

  • Many dimensions of imprecision
  • Solar cell inefficiency – sunrise/sunset detection lag
  • Variable weather – may be cloudy at sunrise/sunset/solar noon
  • Shading from obstructions – nearby buildings, trees
  • Non-optimal physical properties – tilt/orientation
  • Non-optimal electrical characteristics – variations in grid voltage
  • Meter inaccuracy – typically 0.5% to 2% off
  • Accurate localization challenging using one day’s data
  • Impossible if day is near the equinox
  • SunSpot leverages data across multiple days
slide-52
SLIDE 52

David Irwin — UMass Amherst Staring at the Sun

Inferring Longitude from Noisy Solar Data

  • Day-to-day changes in solar noon over the year are the same at

every location on Earth

  • 31 minutes of movement captured by the Equation of Time (EoT)
  • Solar noon should precisely track the EoT

44 88 132 176 220 60 120 180 240 300 360

∆Time(minutes) Day of Year

slide-53
SLIDE 53

David Irwin — UMass Amherst Staring at the Sun

Inferring Longitude from Noisy Solar Data

  • Day-to-day changes in solar noon over the year are the same at

every location on Earth

  • 31 minutes of movement captured by the Equation of Time (EoT)
  • Solar noon should precisely track the EoT

44 88 132 176 220 60 120 180 240 300 360

∆Time(minutes) Day of Year

slide-54
SLIDE 54

David Irwin — UMass Amherst Staring at the Sun

Inferring Longitude from Noisy Solar Data

  • Day-to-day changes in solar noon over the year are the same at

every location on Earth

  • To “fit” EoT, we move it up and down the y-axis
  • Stop where it overlaps the most absolute data points (within ±1m)

44 88 132 176 220 60 120 180 240 300 360

∆Time(minutes) Day of Year

slide-55
SLIDE 55

David Irwin — UMass Amherst Staring at the Sun

Inferring Longitude from Noisy Solar Data

  • Day-to-day changes in solar noon over the year are the same at

every location on Earth

  • To “fit” EoT, we move it up and down the y-axis
  • Stop where it overlaps the most absolute data points (within ±1m)

44 88 132 176 220 60 120 180 240 300 360

∆Time(minutes) Day of Year

slide-56
SLIDE 56

David Irwin — UMass Amherst Staring at the Sun

Inferring Latitude from Noisy Solar Data

  • Problem: sunrise/sunset always lags solar data detection
  • Again, recall that daylength varies with latitude
  • …in fall/winter, daylength shorter moving south to north
  • …in spring/summer, daylength longer moving south to north
  • In fall/winter, always infer location north of actual location
  • In spring/summer, always infer location south of actual location

200 400 600 800 1000 60 120 180 240 300 360

Daylength (minutes) Day of Year

slide-57
SLIDE 57

David Irwin — UMass Amherst Staring at the Sun

Inferring Latitude from Noisy Solar Data

  • Problem: sunrise/sunset always lags solar data detection
  • Again, recall that daylength varies with latitude
  • …in fall/winter, daylength shorter moving south to north
  • …in spring/summer, daylength longer moving south to north
  • In fall/winter, always infer location north of actual location
  • In spring/summer, always infer location south of actual location

200 400 600 800 1000 60 120 180 240 300 360

Daylength (minutes) Day of Year

slide-58
SLIDE 58

David Irwin — UMass Amherst Staring at the Sun

Inferring Latitude from Noisy Solar Data

  • Problem: sunrise/sunset always lags solar data detection
  • Again, recall that daylength varies with latitude
  • …in fall/winter, daylength shorter moving south to north
  • …in spring/summer, daylength longer moving south to north
  • In fall/winter, always infer location north of actual location
  • In spring/summer, always infer location south of actual location

200 400 600 800 1000 60 120 180 240 300 360

Daylength (minutes) Day of Year

slide-59
SLIDE 59

David Irwin — UMass Amherst Staring at the Sun

Inferring Latitude from Noisy Solar Data

  • Problem: sunrise/sunset always lags solar data detection
  • Again, recall that daylength varies with latitude
  • …in fall/winter, daylength shorter moving south to north
  • …in spring/summer, daylength longer moving south to north
  • In fall/winter, always infer location north of actual location
  • In spring/summer, always infer location south of actual location

200 400 600 800 1000 60 120 180 240 300 360

Daylength (minutes) Day of Year

SunSpot

slide-60
SLIDE 60

David Irwin — UMass Amherst Staring at the Sun

Inferring Latitude from Noisy Solar Data

  • Problem: sunrise/sunset always lags solar data detection
  • Again, recall that daylength varies with latitude
  • …in fall/winter, daylength shorter moving south to north
  • …in spring/summer, daylength longer moving south to north
  • In fall/winter, always infer location north of actual location
  • In spring/summer, always infer location south of actual location

200 400 600 800 1000 60 120 180 240 300 360

Daylength (minutes) Day of Year

SunSpot

slide-61
SLIDE 61

David Irwin — UMass Amherst Staring at the Sun

Localizing a Specific Home

  • Previous steps identify only a region of interest
  • Limited by data resolution, and other inaccuracies
  • Search satellite imagery for solar-powered homes within region
  • Filter out land area without man-made structures (>97%)
  • Apply image recognition (either manually or algorithmically)
  • Filter identified solar sites by size of deployment, physical properties, etc.
slide-62
SLIDE 62

David Irwin — UMass Amherst Staring at the Sun

Implementation

  • SunSpot implemented in python
  • Uses available online APIs for computing sunrise/sunset for locations
  • For latitude, use to derive daylength curves
  • For longitude, use to derive solar noon time
  • Uses public satellite imagery from Google Earth
  • Leverage Google Maps API to extract images with man-made structures
  • Apply OpenCV to remove images without >5% black pixels
  • Automatically identify solar sites from images
  • Feed images to Mechanical Turk to identify solar sites
  • Could also include adjustments for non-south facing sites
slide-63
SLIDE 63

David Irwin — UMass Amherst Staring at the Sun

Evaluation

  • Amazon Mechanical Turk
  • A crowdsourcing Internet marketplace
  • Leverages humans to perform routine tasks

Task: Is there a solar panel in the image? Task: Is there a solar panel in the image?

  • Yes. No.
  • Yes. No.
slide-64
SLIDE 64

David Irwin — UMass Amherst Staring at the Sun

Evaluation

  • Three homes with per-second data resolution
  • Maximum localization precision ~500m
  • Inaccuracy ranges from 10-20km
slide-65
SLIDE 65

David Irwin — UMass Amherst Staring at the Sun

Evaluation

  • Microbenchmarks of image processing using Mechanical Turk
  • Took random urban area with 2km radius (or 12.6km2)
  • 82% covered with man-made structures
  • Extract and filtered satellite images from Google Earth
  • Ground truth - manually checked these images for visible solar sites
  • Programmatically submitted images to Mechanical Turk
  • 99% categorized within 30m, with average time ~42 seconds
  • 93% accurate - identified all but 2 solar sites we identified manually
  • Total cost: $170.82 or $13.6/km2
  • Costs lower, the more rural the area
  • More privacy in urban areas – offers k-anonymity
slide-66
SLIDE 66

David Irwin — UMass Amherst Staring at the Sun

Prior Work

  • Estimating and predicting solar generation from location
  • Commonly done by solar installers
  • Variety of models have been proposed
  • SunSpot does the opposite – estimates location from generation
  • Energy analytics on smart meter data
  • Analytics represent a potential privacy threat
  • Not significant, as long as energy data is anonymous
  • SunSpot exposes a new and different vulnerability
  • Data most believe is anonymous may not be
slide-67
SLIDE 67

David Irwin — UMass Amherst Staring at the Sun

Summary

  • SunSpot first work to expose this localization threat
  • Some issues
  • Only uses 3 datapoints per day to infer location
  • Every datapoint provides identifying information
  • Only uses solar signature
  • Weather signature also provides identifying information
  • Requires data over many days
  • Can reduce using more datapoints and weather
  • Requires fine-grained data for precision (second or minute)
  • 5-minute to 1-hour resolution more typical
slide-68
SLIDE 68

David Irwin — UMass Amherst Staring at the Sun

Combining SunDance and SunSpot?

  • Can we accurately localize coarse net meter data?
  • Much more significant privacy threat
  • SunSpot requires second- or minute-level data to localize a region
  • SunDance uses 1-hour resolution, since weather archives are 1-hour
  • Current Work
  • Looking at adding weather signatures to solar signatures
  • Look to be much more accurate with coarser data
  • Leveraging irradiance estimates from visible satellite imagery
  • Highly accurate and updated every ~15 minutes
  • Spatial resolution of 1km2
slide-69
SLIDE 69

David Irwin — UMass Amherst Staring at the Sun

Preserving Privacy?

  • Many possible options with different tradeoffs
  • 1. Remove timestamps from data
  • Pro – cannot identify longitude from solar signature
  • Con – can still identify latitude from solar signature
  • Can probably identify longitude from weather signature
  • Timestamps are useful for well-intentioned analytics
  • 2. Obscure the time of sunrise, sunset, and solar noon
  • Use battery, or actively control solar output at inverter/optimizer
  • Pro – could likely mitigate SunSpot attack using little energy
  • Con – more sophisticated attacks that use whole signature still possible
  • Does not address weather-based attacks, potentially accurate at day-level
  • Less efficient, reduces our generation
slide-70
SLIDE 70

David Irwin — UMass Amherst Staring at the Sun

Questions

70

Questions?