riding the big iot data wave
play

Riding the Big IoT Data Wave Complex Analytics for IoT Data Series - PDF document

27-Apr-17 Riding the Big IoT Data Wave Complex Analytics for IoT Data Series Themis Palpanas Paris Descartes University Telecom Paristech Paris, April 2017 2 References papers ADS: The Adaptive Data Series Index . VLDBJ 2016


  1. 27-Apr-17 Riding the Big IoT Data Wave Complex Analytics for IoT Data Series Themis Palpanas Paris Descartes University Telecom Paristech Paris, April 2017 2 References • papers ▫ ADS: The Adaptive Data Series Index . VLDBJ 2016  http://www.mi.parisdescartes.fr/~themisp/publications/vldbj16-ads.pdf ▫ Big Sequence Management: A Glimpse on the Past, the Present, and the Future. LNCS, 2016 http://www.mi.parisdescartes.fr/~themisp/publications/sofsem16-bisem.pdf  ▫ Query Workloads for Data-Series Indexes . KDD 2015  http://www.mi.parisdescartes.fr/~themisp/publications/kdd15-bends.pdf ▫ RINSE: Interactive Data Series Exploration . VLDB 2015  http://www.mi.parisdescartes.fr/~themisp/publications/vldb15-rinse.pdf ▫ Indexing for Interactive Exploration of Big Data Series . SIGMOD 2014 http://www.mi.parisdescartes.fr/~themisp/publications/sigmod14-ads.pdf  ▫ Beyond One Billion Time Series: Indexing and Mining Very Large Time Series Collections with iSAX2+ . KAIS 2014 http://www.mi.parisdescartes.fr/~themisp/publications/kais14-isax2plus.pdf  ▫ i SAX 2.0: Indexing and Mining One Billion Time Series . ICDM 2010  http://www.mi.parisdescartes.fr/~themisp/publications/icdm10-billiontimeseries.pdf • code and datasets ▫ http://www.mi.parisdescartes.fr/~themisp/isax2plus/ • data series toolbox ▫ https://github.com/zoumpatianos/DSStat • demo ▫ http://daslab.seas.harvard.edu/rinse/ Themis Palpanas - Telecom Paristech, Apr 2017 1

  2. 27-Apr-17 3 Acknowledgements • Michele Linardi • Anna Gogolou • Botao Peng • Karia Echihabi Paris Descartes University • Alessandro Camerra University of Trento • Stratos Idreos • Kostas Zoumpatianos Harvard University • Yin Lou • Johannes Gehrke Cornell University • Jin Shieh • Eamonn Keogh University of California at Riverside Themis Palpanas - Telecom Paristech, Apr 2017 4 Executive Summary • data collected at unprecedented rates • they enable data-driven scientific discovery • lots of these data are sequences ▫ takes days-weeks to analyze big sequence collections Themis Palpanas - Telecom Paristech, Apr 2017 2

  3. 27-Apr-17 5 Executive Summary • data collected at unprecedented rates • they enable data-driven scientific discovery • lots of these data are sequences ▫ takes days-weeks to analyze big sequence collections our work: analyze big sequences in minutes/seconds Themis Palpanas - Telecom Paristech, Apr 2017 6 Data series Themis Palpanas - Telecom Paristech, Apr 2017 3

  4. 27-Apr-17 7 Data series • Sequence of points ordered along some dimension value x 1 x n x 2 … v 2 v 1 sequence dimension Themis Palpanas - Telecom Paristech, Apr 2017 8 Data series • Sequence of points ordered along some dimension value x 1 x n x 2 … v 2 v 1 sequence dimension Time Themis Palpanas - Telecom Paristech, Apr 2017 4

  5. 27-Apr-17 9 Data series • Sequence of points ordered along some dimension value x 1 x n x 2 … v 2 v 1 sequence dimension Time Position Themis Palpanas - Telecom Paristech, Apr 2017 21 GTCAATGGCCAGGATATTAGAACAGTACTCTGTGAACCCTATTTATGGTGGCACCCCTTAGACTAA GATAACACAGGGAGCAAGAGGTTGACAGGAAAGCCAGGGGAGCAGGGAAGCCTCCTGTAAAGAG AGAAGTGCTAAGTCTCCTTTCTAAGGCACATGATGGATTCAAGGGAAAGCCACATTTGACTAAAGC CCAAGGGATTGTTGCTTCTAATCCGATTTCTTGGCAGAAGATATTACAAACTAAGAGTCAGATTAA TATGTGGGTGCCAAAATAAATAAACAAATAATTGAATAATCCCTGGAGGTTTAAGTGAGGAGAAA CTCCTCCACAGCTTGCTACCGAGGCAGAACCGGTTGAAACTGAAATGCATCCGCCGCCAGAGGATC TGTAAAAGAGAGGTTGTTACGAAACTGGCAACTGCCAACCAAAGTCCACCAATGGACAAGCAAAA AAGAGCACTCATCTCATGCTCCCAAGGATCAACCTTCCCAGAGTTTTCACTTAAGTGGCCACCAAG CCAGTTGTCAATCCAGGGCTTTGGACTGAAATCTAGGGCTTCATCCGCTACCTCAGAGTGTCTTCT ATTTCTTCCAGCCAGTGACAAATACAACAAACATCTGAGATGTTTTAGCTATAAATCCTTTACAATT GTTATTTATGTCTTAACTTTTGTTATACCTGGAAAAGTAGGGGAAACAATAAGAACATACTGTCTT GGCCAAGCATCCAAGGTTAAATGAGTTATGGAAATTCATTTGGGAGCCAAGACATTGCACGTGGT TATTTATTAGTCACCCAAGCATGTATTTTGCATGTCCATCAGTTGTTCTTGGCCAAAAGAGCAGAAT CAATGAGCCGCTGCAGATGCAGACATAGCAGCCCCTTGCAGGGACAAGTCTGCAAGATGAGCATT GAAGAGGATGCACAAGCCCGGTAGCCCGGGAAATGGCAGGCACTTACAAGAGCCCAGGTTGTTGC CATGTTTGTTTTTGCAACTTGTCTATTTAAAGAGATTTGGGCAATGGCCAGGATATTAGAACAGTA CTCTGTGAACCCTATTTATGGTAGCACCCCTTAGACTAAGATAACACAGGGAGCAAGAGGTTGACA GGAAAGCCAGGGGAGCAGGGAAGCCTCCTGTAAAGAGAGAAGTGCTAAGTCTCCTTTCTAAGGCA CATGATGGATCAAGGGAAAGTCACATTTGACTAAAGCCCAAGGGATTGTTGCTTCTAATCCGATTC TTGGCAGAAGATATTGCAAACTAAGAGTCAGATTAATATGTGGGTGCCAAAATAAATAAACAAATA ATTGAATAATCCCTGGAGGTTTAAGTGAGGAGAAACTCCTCCACACTTGCTACCGAGGCAGAACCG GTTGAAACTGAAATGCACCCGCTGCCAGATTTATTAGTCACCCAAGCATGTATTTTGCATGTCCAT CAGTTGTTCTTGGCCAAAAGAACAGAATCAATGAGCCGCTGCAGATGCAGACATAGCAGCCCCTTG CAGGAACAAGTCTGCAAGATGAGCATTGAAGAGGATGCACAAGCCCGGTAGCCCGGGAAATGGCA GGCACTTACAAGAGCCCAGGTTGTTGCCATGTTTGTTTTTGCAACTTGTCTTTTAAACAGATTTGA Position Themis Palpanas - Telecom Paristech, Apr 2017 5

  6. 27-Apr-17 22 GTCAATGGCCAGGATATTAGAACAGTACTCTGTGAACCCTATTTATGGTGGCACCCCTTAGACTAA GATAACACAGGGAGCAAGAGGTTGACAGGAAAGCCAGGGGAGCAGGGAAGCCTCCTGTAAAGAG AGAAGTGCTAAGTCTCCTTTCTAAGGCACATGATGGATTCAAGGGAAAGCCACATTTGACTAAAGC CCAAGGGATTGTTGCTTCTAATCCGATTTCTTGGCAGAAGATATTACAAACTAAGAGTCAGATTAA TATGTGGGTGCCAAAATAAATAAACAAATAATTGAATAATCCCTGGAGGTTTAAGTGAGGAGAAA CTCCTCCACAGCTTGCTACCGAGGCAGAACCGGTTGAAACTGAAATGCATCCGCCGCCAGAGGATC TGTAAAAGAGAGGTTGTTACGAAACTGGCAACTGCCAACCAAAGTCCACCAATGGACAAGCAAAA AAGAGCACTCATCTCATGCTCCCAAGGATCAACCTTCCCAGAGTTTTCACTTAAGTGGCCACCAAG CCAGTTGTCAATCCAGGGCTTTGGACTGAAATCTAGGGCTTCATCCGCTACCTCAGAGTGTCTTCT ATTTCTTCCAGCCAGTGACAAATACAACAAACATCTGAGATGTTTTAGCTATAAATCCTTTACAATT GTTATTTATGTCTTAACTTTTGTTATACCTGGAAAAGTAGGGGAAACAATAAGAACATACTGTCTT GGCCAAGCATCCAAGGTTAAATGAGTTATGGAAATTCATTTGGGAGCCAAGACATTGCACGTGGT TATTTATTAGTCACCCAAGCATGTATTTTGCATGTCCATCAGTTGTTCTTGGCCAAAAGAGCAGAAT CAATGAGCCGCTGCAGATGCAGACATAGCAGCCCCTTGCAGGGACAAGTCTGCAAGATGAGCATT GAAGAGGATGCACAAGCCCGGTAGCCCGGGAAATGGCAGGCACTTACAAGAGCCCAGGTTGTTGC CATGTTTGTTTTTGCAACTTGTCTATTTAAAGAGATTTGGGCAATGGCCAGGATATTAGAACAGTA CTCTGTGAACCCTATTTATGGTAGCACCCCTTAGACTAAGATAACACAGGGAGCAAGAGGTTGACA GGAAAGCCAGGGGAGCAGGGAAGCCTCCTGTAAAGAGAGAAGTGCTAAGTCTCCTTTCTAAGGCA CATGATGGATCAAGGGAAAGTCACATTTGACTAAAGCCCAAGGGATTGTTGCTTCTAATCCGATTC TTGGCAGAAGATATTGCAAACTAAGAGTCAGATTAATATGTGGGTGCCAAAATAAATAAACAAATA ATTGAATAATCCCTGGAGGTTTAAGTGAGGAGAAACTCCTCCACACTTGCTACCGAGGCAGAACCG GTTGAAACTGAAATGCACCCGCTGCCAGATTTATTAGTCACCCAAGCATGTATTTTGCATGTCCAT CAGTTGTTCTTGGCCAAAAGAACAGAATCAATGAGCCGCTGCAGATGCAGACATAGCAGCCCCTTG CAGGAACAAGTCTGCAAGATGAGCATTGAAGAGGATGCACAAGCCCGGTAGCCCGGGAAATGGCA GGCACTTACAAGAGCCCAGGTTGTTGCCATGTTTGTTTTTGCAACTTGTCTTTTAAACAGATTTGA Position Themis Palpanas - Telecom Paristech, Apr 2017 23 GTCAATGGCCAGGATATTAGAACAGTACTCTGTGAACCCTATTTATGGTGGCACCCCTTAGACTAA GATAACACAGGGAGCAAGAGGTTGACAGGAAAGCCAGGGGAGCAGGGAAGCCTCCTGTAAAGAG AGAAGTGCTAAGTCTCCTTTCTAAGGCACATGATGGATTCAAGGGAAAGCCACATTTGACTAAAGC CCAAGGGATTGTTGCTTCTAATCCGATTTCTTGGCAGAAGATATTACAAACTAAGAGTCAGATTAA TATGTGGGTGCCAAAATAAATAAACAAATAATTGAATAATCCCTGGAGGTTTAAGTGAGGAGAAA CTCCTCCACAGCTTGCTACCGAGGCAGAACCGGTTGAAACTGAAATGCATCCGCCGCCAGAGGATC TGTAAAAGAGAGGTTGTTACGAAACTGGCAACTGCCAACCAAAGTCCACCAATGGACAAGCAAAA AAGAGCACTCATCTCATGCTCCCAAGGATCAACCTTCCCAGAGTTTTCACTTAAGTGGCCACCAAG CCAGTTGTCAATCCAGGGCTTTGGACTGAAATCTAGGGCTTCATCCGCTACCTCAGAGTGTCTTCT ATTTCTTCCAGCCAGTGACAAATACAACAAACATCTGAGATGTTTTAGCTATAAATCCTTTACAATT GTTATTTATGTCTTAACTTTTGTTATACCTGGAAAAGTAGGGGAAACAATAAGAACATACTGTCTT GGCCAAGCATCCAAGGTTAAATGAGTTATGGAAATTCATTTGGGAGCCAAGACATTGCACGTGGT TATTTATTAGTCACCCAAGCATGTATTTTGCATGTCCATCAGTTGTTCTTGGCCAAAAGAGCAGAAT CAATGAGCCGCTGCAGATGCAGACATAGCAGCCCCTTGCAGGGACAAGTCTGCAAGATGAGCATT GAAGAGGATGCACAAGCCCGGTAGCCCGGGAAATGGCAGGCACTTACAAGAGCCCAGGTTGTTGC CATGTTTGTTTTTGCAACTTGTCTATTTAAAGAGATTTGGGCAATGGCCAGGATATTAGAACAGTA CTCTGTGAACCCTATTTATGGTAGCACCCCTTAGACTAAGATAACACAGGGAGCAAGAGGTTGACA GGAAAGCCAGGGGAGCAGGGAAGCCTCCTGTAAAGAGAGAAGTGCTAAGTCTCCTTTCTAAGGCA CATGATGGATCAAGGGAAAGTCACATTTGACTAAAGCCCAAGGGATTGTTGCTTCTAATCCGATTC TTGGCAGAAGATATTGCAAACTAAGAGTCAGATTAATATGTGGGTGCCAAAATAAATAAACAAATA ATTGAATAATCCCTGGAGGTTTAAGTGAGGAGAAACTCCTCCACACTTGCTACCGAGGCAGAACCG GTTGAAACTGAAATGCACCCGCTGCCAGATTTATTAGTCACCCAAGCATGTATTTTGCATGTCCAT CAGTTGTTCTTGGCCAAAAGAACAGAATCAATGAGCCGCTGCAGATGCAGACATAGCAGCCCCTTG CAGGAACAAGTCTGCAAGATGAGCATTGAAGAGGATGCACAAGCCCGGTAGCCCGGGAAATGGCA GGCACTTACAAGAGCCCAGGTTGTTGCCATGTTTGTTTTTGCAACTTGTCTTTTAAACAGATTTGA Position Themis Palpanas - Telecom Paristech, Apr 2017 6

  7. 27-Apr-17 27 Themis Palpanas - Telecom Paristech, Apr 2017 28 Schinnerer et al. Themis Palpanas - Telecom Paristech, Apr 2017 7

  8. 27-Apr-17 29 Telecommunications • analysis of call activity patterns ▫ Telecom Italia call activity for Easter Monday 60000 50000 40000 30000 20000 10000 0 clustermap of incoming calls time series 1 30 59 88 117 146 175 204 233 262 291 320 349 378 407 436 465 494 523 552 581 610 639 668 697 Time average number of calls for 5 smallest clusters Themis Palpanas - Telecom Paristech, Apr 2017 30 Home Networks • temporal usage behavior analysis of home networks ▫ Portugal Telecom Time clustering based on user activity patterns (previously unknown) frequent behavior pattern Themis Palpanas - Telecom Paristech, Apr 2017 8

  9. 27-Apr-17 Operation Health Monitoring Time Themis Palpanas - Telecom Paristech, Apr 2017 32 Operation Health Monitoring Time Themis Palpanas - Telecom Paristech, Apr 2017 33 9

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend