Web Performance Optimization: Analytics Wim Leers Promotor: Prof. - - PowerPoint PPT Presentation

web performance optimization analytics
SMART_READER_LITE
LIVE PREVIEW

Web Performance Optimization: Analytics Wim Leers Promotor: Prof. - - PowerPoint PPT Presentation

Web Performance Optimization: Analytics Wim Leers Promotor: Prof. dr. Jan Van den Bussche Web Performance Optimization Speed matters! Source: http://www.useit.com/alertbox/response-times.html, Jakob Nielsen, June 21, 2010 Web Performance


slide-1
SLIDE 1

Web Performance Optimization: Analytics

Wim Leers Promotor: Prof. dr. Jan Van den Bussche

slide-2
SLIDE 2

Web Performance Optimization

  • Speed matters!

Source: http://www.useit.com/alertbox/response-times.html, Jakob Nielsen, June 21, 2010

slide-3
SLIDE 3

Web Performance Optimization

  • Speed matters!
  • 0.1 s → direct manipulation

Source: http://www.useit.com/alertbox/response-times.html, Jakob Nielsen, June 21, 2010

slide-4
SLIDE 4

Web Performance Optimization

  • Speed matters!
  • 0.1 s → direct manipulation
  • 1 s → good navigation

Source: http://www.useit.com/alertbox/response-times.html, Jakob Nielsen, June 21, 2010

slide-5
SLIDE 5

Web Performance Optimization

  • Speed matters!
  • 0.1 s → direct manipulation
  • 1 s → good navigation
  • 10 s → attention kept

Source: http://www.useit.com/alertbox/response-times.html, Jakob Nielsen, June 21, 2010

slide-6
SLIDE 6

Web Performance Optimization

  • Speed matters!
  • 0.1 s → direct manipulation
  • 1 s → good navigation
  • 10 s → attention kept
  • >10 s → bye bye!

Source: http://www.useit.com/alertbox/response-times.html, Jakob Nielsen, June 21, 2010

slide-7
SLIDE 7

How to Measure? Episodes

slide-8
SLIDE 8

How to Measure? Episodes

  • Measures “episodes” during page loading
slide-9
SLIDE 9

How to Measure? Episodes

  • Measures “episodes” during page loading
  • Real measurements: JS in browser, for each visitor
slide-10
SLIDE 10

How to Measure? Episodes

  • Measures “episodes” during page loading
  • Real measurements: JS in browser, for each visitor
  • Result: Episodes log file
slide-11
SLIDE 11

Analytics

slide-12
SLIDE 12

Analytics

  • Automatically pinpoint causes of slow page loads
slide-13
SLIDE 13

Analytics

  • Automatically pinpoint causes of slow page loads
  • e.g.:
slide-14
SLIDE 14

Analytics

  • Automatically pinpoint causes of slow page loads
  • e.g.:
  • “http://uhasselt.be/ is slow in Belgium, for users of the ISP Telenet”
slide-15
SLIDE 15

Analytics

  • Automatically pinpoint causes of slow page loads
  • e.g.:
  • “http://uhasselt.be/ is slow in Belgium, for users of the ISP Telenet”
  • “http://uhasselt.be/studenten/dossier has slowly loading CSS”
slide-16
SLIDE 16

Analytics

  • Automatically pinpoint causes of slow page loads
  • e.g.:
  • “http://uhasselt.be/ is slow in Belgium, for users of the ISP Telenet”
  • “http://uhasselt.be/studenten/dossier has slowly loading CSS”
  • “http://uhasselt.be/bib has slowly loading JS in Firefox 3”
slide-17
SLIDE 17

Analytics

  • Automatically pinpoint causes of slow page loads
  • e.g.:
  • “http://uhasselt.be/ is slow in Belgium, for users of the ISP Telenet”
  • “http://uhasselt.be/studenten/dossier has slowly loading CSS”
  • “http://uhasselt.be/bib has slowly loading JS in Firefox 3”
slide-18
SLIDE 18

Literature Study Subjects

slide-19
SLIDE 19

Literature Study Subjects

  • Data Stream Mining
slide-20
SLIDE 20

Literature Study Subjects

  • Data Stream Mining
  • Anomaly Detection
slide-21
SLIDE 21

Literature Study Subjects

  • Data Stream Mining
  • Anomaly Detection}

Data Mining: finding patterns in data

slide-22
SLIDE 22

Literature Study Subjects

  • Data Stream Mining
  • Anomaly Detection
  • OLAP: Data Cube }

Data Mining: finding patterns in data

slide-23
SLIDE 23

Literature Study Subjects

  • Data Stream Mining
  • Anomaly Detection
  • OLAP: Data Cube }

Data Mining: finding patterns in data

}

OLAP: querying multidimensional data

slide-24
SLIDE 24

Data Stream Mining

slide-25
SLIDE 25

Data Stream Mining

  • Constraints
slide-26
SLIDE 26

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
slide-27
SLIDE 27

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
  • Window model
slide-28
SLIDE 28

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
  • Window model
  • Landmark: from beginning until now
slide-29
SLIDE 29

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
  • Window model
  • Landmark: from beginning until now
  • Tilted-time: per-hour window, 24 “hour windows” → “day window”, etc.
slide-30
SLIDE 30

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
  • Window model
  • Landmark: from beginning until now
  • Tilted-time: per-hour window, 24 “hour windows” → “day window”, etc.
  • Algorithms studied
slide-31
SLIDE 31

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
  • Window model
  • Landmark: from beginning until now
  • Tilted-time: per-hour window, 24 “hour windows” → “day window”, etc.
  • Algorithms studied
  • Frequent Item Mining: 7
slide-32
SLIDE 32

Data Stream Mining

  • Constraints
  • Possibly infinite data stream ⇒ approximation
  • Window model
  • Landmark: from beginning until now
  • Tilted-time: per-hour window, 24 “hour windows” → “day window”, etc.
  • Algorithms studied
  • Frequent Item Mining: 7
  • Frequent Pattern Mining: 2
slide-33
SLIDE 33

Data Stream Mining: FP-Stream

Source: Mining Frequent Patterns in Data Streams at Multiple Time Granularities, Giannella; Han et al., 2003

slide-34
SLIDE 34

Data Stream Mining: FP-Stream

Source: Mining Frequent Patterns in Data Streams at Multiple Time Granularities, Giannella; Han et al., 2003

slide-35
SLIDE 35

Data Stream Mining: FP-Stream

Source: Mining Frequent Patterns in Data Streams at Multiple Time Granularities, Giannella; Han et al., 2003

slide-36
SLIDE 36

Data Stream Mining: FP-Stream

Source: Mining Frequent Patterns in Data Streams at Multiple Time Granularities, Giannella; Han et al., 2003

slide-37
SLIDE 37

Anomaly Detection

slide-38
SLIDE 38

Anomaly Detection

  • Types
slide-39
SLIDE 39

Anomaly Detection

  • Types
  • Point: e.g. rainfall in mm
slide-40
SLIDE 40

Anomaly Detection

  • Types
  • Point: e.g. rainfall in mm
  • Contextual: point + contextual attributes, e.g. rainfall in mm + lat/lon
slide-41
SLIDE 41

Anomaly Detection

  • Types
  • Point: e.g. rainfall in mm
  • Contextual: point + contextual attributes, e.g. rainfall in mm + lat/lon
  • Contextual anomaly detection algorithms categories
slide-42
SLIDE 42

Anomaly Detection

  • Types
  • Point: e.g. rainfall in mm
  • Contextual: point + contextual attributes, e.g. rainfall in mm + lat/lon
  • Contextual anomaly detection algorithms categories
  • Reduction: 1) certain context, 2) point anomaly algorithm
slide-43
SLIDE 43

Anomaly Detection

  • Types
  • Point: e.g. rainfall in mm
  • Contextual: point + contextual attributes, e.g. rainfall in mm + lat/lon
  • Contextual anomaly detection algorithms categories
  • Reduction: 1) certain context, 2) point anomaly algorithm
  • Model: 1) learn through training, 2) compare: observed vs. expected
slide-44
SLIDE 44

Anomaly Detection

  • Types
  • Point: e.g. rainfall in mm
  • Contextual: point + contextual attributes, e.g. rainfall in mm + lat/lon
  • Contextual anomaly detection algorithms categories
  • Reduction: 1) certain context, 2) point anomaly algorithm
  • Model: 1) learn through training, 2) compare: observed vs. expected
  • Algorithms studied: 2
slide-45
SLIDE 45

Anomaly Detection: Vilalta/Ma

slide-46
SLIDE 46

Anomaly Detection: Vilalta/Ma

  • Based on frequent pattern mining
slide-47
SLIDE 47

Anomaly Detection: Vilalta/Ma

  • Based on frequent pattern mining
  • Find all frequent itemsets that precede anomalies
slide-48
SLIDE 48

OLAP: Data Cube

Source: Introduction to Data Mining, Tan; Steinbach; Kumar, 2006

slide-49
SLIDE 49

OLAP: Data Cube

Source: Introduction to Data Mining, Tan; Steinbach; Kumar, 2006

slide-50
SLIDE 50

OLAP: Data Cube: Range-Sum Performance

slide-51
SLIDE 51

OLAP: Data Cube: Range-Sum Performance

  • Very common type of query
slide-52
SLIDE 52

OLAP: Data Cube: Range-Sum Performance

  • Very common type of query
  • Algorithms studied: 3
slide-53
SLIDE 53

OLAP: Data Cube: Dynamic Data Cube

Source: Data Cubes in Dynamic Environments, Geffner; Riedewald; Agrawal, 1999

slide-54
SLIDE 54

OLAP: Data Cube: Dynamic Data Cube

Source: Data Cubes in Dynamic Environments, Geffner; Riedewald; Agrawal, 1999

slide-55
SLIDE 55

Outlook

slide-56
SLIDE 56

Outlook

  • Further literature study, especially: data cubes over data streams
slide-57
SLIDE 57

Outlook

  • Further literature study, especially: data cubes over data streams
  • Implementation
slide-58
SLIDE 58

Outlook

  • Further literature study, especially: data cubes over data streams
  • Implementation
slide-59
SLIDE 59

Questions?

Thanks for your time!