NILMTK An Open Source Toolkit for Non-intrusive Load Monitoring - - PowerPoint PPT Presentation

nilmtk
SMART_READER_LITE
LIVE PREVIEW

NILMTK An Open Source Toolkit for Non-intrusive Load Monitoring - - PowerPoint PPT Presentation

1 NILMTK An Open Source Toolkit for Non-intrusive Load Monitoring NILMTK team 2 Haimonti Alex Rogers Dutta Nipun Batra Oliver Parson Amarjeet Singh Mani Srivastava William Jack Kelly Knottenbelt 3 Non-intrusive load monitoring


slide-1
SLIDE 1

NILMTK

An Open Source Toolkit for Non-intrusive Load Monitoring

1

slide-2
SLIDE 2

NILMTK team

Nipun Batra

Amarjeet Singh Mani Srivastava Jack Kelly William Knottenbelt Haimonti Dutta Oliver Parson Alex Rogers

2

slide-3
SLIDE 3

Non-intrusive load monitoring (Energy disaggregation)

“Process of estimating the energy consumed by individual appliances given just a whole-house power meter reading”

3

slide-4
SLIDE 4

Wait a minute! This sounds complicated Would it help?

4

slide-5
SLIDE 5

Jane goes to the market

5

slide-6
SLIDE 6

Jane spends 200 pounds on her purchases

6

slide-7
SLIDE 7

Jane’s husband John is worried with the expenses

7

slide-8
SLIDE 8

He spends some time and looks at the purchase list

8

slide-9
SLIDE 9

Do you think the itemized billing helped him?

NILM is the same, but for energy!

9

slide-10
SLIDE 10

Quiz time!

Identify this famous CS scientist

10

slide-11
SLIDE 11

Quiz time!

Identify this famous CS scientist

11

That ain’t any great

  • scientist. That’s me on my

first birthday in 1990… This is not too far from the time when NILM was first discussed

slide-12
SLIDE 12

Giving credit where it is due

12

slide-13
SLIDE 13

NILM interest explosion

  • 1. National smart meter rollouts
  • 2. Reduced hardware costs
  • 3. International meetings

– NILM workshop 2012, 2014; EPRI NILM 2013

  • 4. Public datasets
  • 5. Startups

13

slide-14
SLIDE 14

“Data is the new oil”

  • 9 NILM datasets and counting (few not

specific to NILM)

  • Across 6 countries (India, UK, US,

Canada, EU)

  • Measure aggregate and appliance level

data

  • Across 3 colors 

– REDD – BLUED – GREEND

14

slide-15
SLIDE 15

The industry is interested!

15

slide-16
SLIDE 16

So, is everything so rosy? Not quite! Else we won’t be here

16

slide-17
SLIDE 17

The scientific method

“The scientific method is a body of techniques for investigating phenomena, acquiring new knowledge, or

correcting and integrating previous knowledge” as per

wiki

17

slide-18
SLIDE 18

3 core obstacles

preventing comparison of state-of-the-art

18

slide-19
SLIDE 19
  • 1. Hard to assess generality
  • Subtle differences in aims of

different data sets

  • Previous contributions evaluated only
  • n single dataset.
  • Non-trivial to set up similar

experimental conditions for direct comparison.

19

slide-20
SLIDE 20
  • 2. Lack of comparison against

same benchmarks

  • Newly proposed algorithms rarely

compared against same benchmarks.

  • Lack of “open source” reference

algorithms  often lead to reimplementation.

20

slide-21
SLIDE 21
  • 3. “Inconsistent”

disaggregation performance metrics

  • Different performance metrics proposed

in the past.

  • Different formulae for same metric, eg.

4+ versions of “energy assigned”

21

slide-22
SLIDE 22

What is NILMTK?

Open source NILM toolkit

22

slide-23
SLIDE 23

What does it do?

Enable easy comparative analysis of NILM algorithms across data sets.

23

slide-24
SLIDE 24

How does it do that?

Provides a pipeline from data sets to metrics to lower the entry barrier for researchers.

24

slide-25
SLIDE 25

NILMTK pipeline

REDD BLUED UK- DALE Statistics NILMTK- DF Training Preprocessing Model Disaggregation Metrics

Data interface

25

slide-26
SLIDE 26

Data Format

REDD BLUED UK- DALE Statistics NILMTK- DF Training Preprocessing Model Disaggregation Metrics

Data interface

26

slide-27
SLIDE 27

Data Format

  • We propose NILMTK-DF: a common data

format.

  • Provide importers for 6 datasets: REDD,

SMART*, Pecan street, iAWE, AMPds, UK-DALE

  • Both flat file and efficient binary storage

format

27

slide-28
SLIDE 28

The fun of data!

28

slide-29
SLIDE 29

Standardizing nomenclature

Fridge Refrigerator FGE 29

slide-30
SLIDE 30

Metadata

  • Geographic coordinates
  • Type of appliance- hot, cold, dry?
  • Metering hierarchy
  • Parameters measured

30

slide-31
SLIDE 31

Standard nomenclature + Metadata + Datasets =

Comparing power draw of washing machines across US (REDD) and UK (UK-DALE)

31

slide-32
SLIDE 32

Standard nomenclature + Metadata + Datasets =

Top 5 appliance according to energy consumption across geographies

32 US UK INDIA

slide-33
SLIDE 33

NILMTK pipeline

REDD BLUED UK- DALE Statistics NILMTK- DF Training Preprocessing Model Disaggregation Metrics

Data interface

33

slide-34
SLIDE 34

Statistics

10 20 30 40 50 60 70 80 90 100 REDD Smart* Pecan AMPds iAWE UK_DALE

% energy submetered

  • Energy submetered: Sum of energy of all

appliance/Energy at mains level

  • More energy submetered  More ground truth

34

slide-35
SLIDE 35

Statistics

  • Appliance usage patterns
  • Correlations with weather
  • Appliance power demands

35

slide-36
SLIDE 36

Diagnostics

  • Every data set has problems  NILMTK provides

diagnostic functions for common problems.

  • %Lost samples (per interval and whole), uptime

% lost samples in house 1 of REDD dataset

36

slide-37
SLIDE 37

Preprocessing

REDD BLUED UK- DALE Statistics NILMTK- DF Training Preprocessing Model Disaggregation Metrics

Data interface

37

slide-38
SLIDE 38

Preprocessing

  • Correct common problems (as per

diagnosis).

  • Other standard NILM preprocessors:

– Interpolating, filtering implausible – Downsample to lower frequency – Select Top-k-appliances by energy consumption

38

slide-39
SLIDE 39

Heart of NILMTK

REDD BLUED UK- DALE Statistics NILMTK- DF Training Preprocessing Model Disaggregation Metrics

Data interface

39

slide-40
SLIDE 40

Training

  • NILMTK provides two benchmark

algorithms

–Combinatorial optimization (CO) [Proposed by Hart] –Factorial hidden Markov model (FHMM) [More recent, more complex]

40

slide-41
SLIDE 41

Model

  • Beyond the usual train and disaggregate,

NILMTK allows importing and exporting learnt models

  • Allows NILM to be deployed in “real world

settings”

  • Action speaks louder than words!! Demo

follows!

41

slide-42
SLIDE 42

Disaggregate!

  • Quite a bit of work before we

disaggregate

  • We performed

– CO and FHMM based disaggregation across first home of each dataset – Detailed disaggregation analysis across the home in iAWE (dataset from India)

42

slide-43
SLIDE 43

Disaggregation across multiple datasets

  • CO as good as FHMM across iAWE,

UKPD, Pecan datasets

–Space heating contributes 60% in Pecan and 35% in iAWE. Both approaches able to detect with fair ease

43

And I thought that CO was really

  • utdated…
slide-44
SLIDE 44

Disaggregation across multiple datasets

44

  • FHMM outperforms CO across REDD,

Smart*, AMPds

  • This is expected as FHMM models time

variations.

  • CO exponentially quicker than FHMM
slide-45
SLIDE 45

Detailed disaggregation in iAWE dataset (India)

  • CO and FHMM perform similar
  • Appliances such as air conditioners

way easier to disaggregate

  • Complex appliances (laptops and

washing machines) – not so good 

45

slide-46
SLIDE 46

NILMTK pipeline

REDD BLUED UK- DALE Statistics NILMTK- DF Training Preprocessing Model Disaggregation Metrics

Data interface

46

slide-47
SLIDE 47

Metrics

  • NILMTK provides:

–General machine learning metrics

  • Precision, Recall, F-score

–Specialized metrics for NILM

  • Error in total energy assigned, RMS error in

assigned power,..

–Both event based and total power based NILM metrics.

47

slide-48
SLIDE 48

Demo time!!

48

slide-49
SLIDE 49

Conclusions

Three core challenges in NILM research

  • 1. Hard to address generality
  • 2. Lack of comparison against same benchmarks
  • 3. Inconsistent disaggregation performance metrics

How NILMTK addresses these challenges

  • 1. Standard input and output formats (Addresses #1)
  • 2. Parsers for 6 NILM data sets (Addresses #1, #2)
  • 3. Two benchmark NILM algorithms (Addresses #1, #2)
  • 4. Statistics, diagnostics and preprocessing (Addresses #1,

#2)

  • 5. Metrics for different NILM use cases (Addresses #1)

49

slide-50
SLIDE 50

Backup

50

slide-51
SLIDE 51

Combinatorial optimization

  • Seeks to find the optimal combination of

appliances’ power draw to minimize residual energy.

  • Similar to subset-sum problem and thus

NP-complete 

  • Power draw is not related in time

51

slide-52
SLIDE 52

Combinatorial optimization

Appliance Off power On power Air conditioner (AC) 2000 Refrigerator 200

If total power observed = 210  AC is OFF and Refrigerator is ON

52

slide-53
SLIDE 53

Combinatorial optimization

Appliance Off power On power Air conditioner (AC) 2000 Refrigerator 200

If total power observed = 2000  AC is ON and Refrigerator is OFF

53

slide-54
SLIDE 54

Combinatorial optimization

Appliance Off power On power Air conditioner (AC) 2000 Refrigerator 200

If total power observed = 2230  AC is ON and Refrigerator is ON

54

slide-55
SLIDE 55

FHMM

  • Each appliance modeled as HMM

– Power draw related in time If TV is on right now, likely to be on next second.

  • Exact inference scales worse than CO

55

slide-56
SLIDE 56

A bit of history

Seminal work on NILM done at MIT dates back to early 1980s – A good 6-7 years before I was born!

56

slide-57
SLIDE 57

Field progress

10 20 30 40 50 60 70 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004

# Papers citing the seminal work per year

What happened here?

57