Sense Making in an IOT World: Sensor Data Analysis with Deep - PowerPoint PPT Presentation

GTC 2016 Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Natalia Vassilieva, PhD Senior Research Manager

Deep learning proof points as of today Vision Speech Text Other Search & information Interactive voice Search and ranking Recommendation extraction response (IVR) systems engines Sentiment analysis Security/Video Voice interfaces Advertising Machine translation surveillance (Mobile, Cars, Gaming, Fraud detection Question answering Home) Self-driving cars AI challenges Security (speaker Robotics Drug discovery identification) Sensor data analysis Health care Diagnostic support People with disabilities 2

Why Deep Learning & Sensor Data? Deep Learning is about … Sensor Data is about … – Huge volumes of training data (labeled and – Huge volumes of data (mostly unlabeled) unlabeled) – Complex data with non-trivial patterns (mostly – Multidimensional and complex data with non- temporal) trivial patterns (spatial or temporal) – Variety of data representations, feature – Replacement of manual feature engineering with engineering is hard unsupervised feature learning – Multiple modalities – Cross modality feature learning Works well for speech! Most sensor data is time series 3

This talk Does Deep Learning work for sensor data? Do existing infrastructure and algorithms fit sensor data? The Machine and Distributed Mesh Computing 4

Part I Case Study: Sensor Data Analysis with Deep Learning 5

Patient activity recognition from accelerometer data – Scripted video and accelerometer data from one sensor and 52 subjects (~20 min per subject) – Accelerometer data: 500Hz x 4 dimensions = 12000 measurements per minute per person – 16 classes

Accelerometer data (X axes) 7

Data distribution Total number of frames: ~3.35M 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 8

Approaches Baselines - ZeroR (majority class predictor) - Support Vector Machines - Decision Trees (C50 implementation) - Shallow Neural Networks (FANN library) - Features: manually engineered Deep Neural Networks - Fully-connected hidden layers (pre-trained with stacked sparse autoencoders) + softmax - Time-delay layers - Recurrent layers - Deep Neural Networks and Conditional Random Fields - Multiple meta-parameter configurations - Features: amplitude spectrum 9

Approaches Baselines - ZeroR (majority class predictor) - Support Vector Machines - Decision Trees (C50 implementation) - Shallow Neural Networks (FANN library) - Features: manually engineered Deep Neural Networks - Fully-connected: stacked sparse autoencoders + softmax - Time-delay layers - Recurrent layers - Deep Neural Networks and Conditional Random Fields - Multiple meta-parameter configurations - Features: amplitude spectrum 10

Growing class-separation power of deeper representations First level of Second level of Raw data representations representations

Results: single person set type size train 388518 cv 129510 Baseline methods, engineered features test 129510 total 647538 ZeroR SVM (binary) Shallow NN c50 SVM Accuracy 71.2 97.6 98.6 99.6 98.03 Deep Neural Networks, amplitude spectrum 1533-200-200-16 Accuracy 99.7 12

Results: 52 subjects, subject-independent set type size train 2608637 cv 0 Baselines v.s. DNN test 738180 total 3346817 ZeroR c50 DNN DNN + CRF 95.1 Accuracy 69.7 71.6 84.5 DNN DNN + CRF True labels 13

Results: 52 subjects, subject-independent set type size train 2608637 cv 0 Baselines v.s. DNN test 738180 total 3346817 ZeroR c50 DNN DNN + CRF 95.1 Accuracy 69.7 71.6 84.5 Deep models: - are better at classification on sensor data (generalize better) - do not require sophisticated feature engineering - require significant amount of iterations to converge 14

Part II Today’s infrastructure and Deep Learning 15

Today’s scale Model size, data size, compute requirements Application Model Training data FLOP per epoch Training time 14*10 6 images 1.7 * 10 9 6*1.7*10 9 *14*10 6 Vision 3 days x 16000 cores ~1.4*10 17 ~6.8 GB ~2.5 TB (256x256) 2 days x 16 servers x 4 GPUs ~10 TB (512x512) 8 hours x 36 servers x 4 GPUs 60 * 10 6 6*60*10 6 *34*10 9 100K hours of audio days x 8 GPUs Speech ~34*10 9 frames ~1.2*10 19 ~240 MB ~50 TB 856*10 6 words 6.5 * 10 6 6*6.5*10 6 *856*10 6 Text 4 weeks ~3.3*10 16 ~260 MB 3*10 6 frames 1.2 * 10 6 6*1.2*3*10 6 *3*10 6 Signals days 6.5*10 13 ~4.8 MB

Challenges of DNN training Slow and expensive – Very large number of parameters (>10 6 ), huge (unlabeled) data sets for training (10 6 - 10 9 ) – Computationally expensive: requires O(model size * data size) FLOPs per epoch – Needs many iterations (and epochs) to converge – Needs frequent synchronization to converge fast Compute requirements today: Today’s hardware: 10 13 – 10 19 FLOPs per epoch NVIDIA Titan X: 7 TFLOPS SP, 12 GB memory 1 epoch per hour: ~10x TFLOPS SP NVIDIA Tesla M40: 7 TFLOPS SP, 12 GB memory NVIDIA Tesla K40: 4.29 TFLOPS SP, 12 GB memory NVIDIA Tesla K80: 5.6 TFLOPS SP, 24 GB memory INTEL Xeon Phi: 2.4 TFLOPS SP 17

Scalability of DNN training for time series Hard to scale – Google Brain: 1000 machines (16000 CPUs) x 3 days – COTS HPC systems: 16 machines x 4 GPUs x 2 days – Deep Image by Baidu: 36 machines x 4 GPUs x ~8 hours – Deep Speech by Baidu: 8 GPUs x ~weeks – Deep Speech 2 by Baidu: 8 or 16 GPUs x 3 to 5 days Limited scalability of training for speech/time-series data! J. Dean et. al, Large Scale Distributed Deep Networks 18

Types of artificial neural networks Topology to fit data characteristics Images: Speech, time series, sequences: Locally Connected Convolutional Fully Connected, Recurrent Input Hidden Hidden Hidden Output Input Hidden Hidden Hidden Output Layer 1 Layer 2 Layer 3 Layer 1 Layer 2 Layer 3 19

Today’s hardware (scale-out or scale-up) CPU/GPU cluster Multi-socket large memory machine GPU GPU InfiniBand: ~12 GB/s PCIe: ~16 GB/s PCIe CPU CPU QPI link CPU QPI link: ~12.8 GB/s per direction Memory Memory Memory NUMA node 1 NUMA node 2 InfiniBand GPU GPU CPU CPU QPI link PCIe CPU Memory Memory Memory NUMA node 3 NUMA node 4 20

Part III The Machine and Distributed Mesh Computing 21

Memory SoC SoC Memory Memory Memory SoC SoC SoC SoC + Fabric SoC SoC Memory Processor-centric computing Memory-Driven Computing 22

I/O Copper 23

Copper 24

Copper 25

Memory GPU SoC Open Memory ASIC Memory SoC RISC SoC V Architecture SoC Quantum Memory Processor-centric computing Memory-Driven Computing 27

The Machine will be ported to different scales and form-factors The Machine Many cores Massive pool of NVM Photonics NVM = Non-volatile memory 28

The evolution of the IoT Gen 0 Gen 1 Gen 2 Gen 3 Yesteryears Today Tomorrow The future Things on a The Cloud- Edge analytics Distributed Mesh network centric IoT Computing Ideal for “things” producing large Still works well for Good choice for Multi-party “things” volumes of data that small, local, custom low-cost “things” autonomously are difficult, costly or systems with strict where data can collaborate with sensitive to move performance needs easily be moved, privacy intact with few ramifications 29

Tomorrow: Deep Learning and Edge Analytics Edge Node Center Gets trained model Collects all data Uses the model in real-time Trains model Collects data Sends model to edge Sends some data to center nodes 30

The Future: Deep Learning and Distributed Mesh Computing Edge Node The Mesh Participate in Training Distributed training Uses the model in real-time Sends model as needed Collects data Sends some data in mesh 31

Summary Does Deep Learning work for sensor data? Yes, we have proof points Do existing infrastructure and algorithms fit sensor data? No, training deep models for sensor data is slow and expensive today The Machine and Distributed Mesh Computing We believe this changes everything 24

Thank you! Natalia Vassilieva nvassilieva@hpe.com To learn more about Hewlett Packard Labs, visit: http://www.labs.hpe.com To learn more, visit www.hpe.com/themachine 33

Sense Making in an IOT World: Sensor Data Analysis with Deep - PowerPoint PPT Presentation

GTC 2016 Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Natalia Vassilieva, PhD Senior Research Manager Deep learning proof points as of today Vision Speech Text Other Search & information Interactive voice

The Internet of Things: (almost) every thing connected to Internet By Vctor M. Rivas Santos

TUFF TUFF TUFF TUFF TUFF TUFF TUFF TUFF MAKING MAKING MAKING MAKING SENSE OF SENSE OF

IoT - Big Data & Security MWC Smart Cities Seminar Telefnica Global IoT Group Feb 2017

An Introduction to IoT Penetration Testing @libertyunix www.kmco.com The Agenda n IoT Attack

Internet of Things (IoT) Raspberry Pi Summer Camp Tech Talk Raspberry Pi Camp IoT 1

Sensor Networks & TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

Why IoT IoT Domain IoT Data Characteristics Massive data: 20.4 Billion connected Growing

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

Considerations for Enterprise Grade IoT Ishu Verma Red Hat AGENDA l 50 Shades of IoT l Functions,

Telkomsel Presenta.on IoT for Making Indonesia 4.0 Jakarta Conven,on Center, 28 November

IoT Trade Mission to Malaysia 23 rd 26 th April 2018 IOT IN ASIA AND MALAYSIA Global IoT

Akintayo Akinyoade 12/01/2017 Survey Roadmap Internet of Things (IoT)? Tech. Enablers for IoT

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

How to capture, analyse and react on IoT generated sensor data in real time Romeo Kienzler, Chief

Exploring the IoT attack surface Breaking || Liberating the brave new IoT world. doc.dr.sc.

+ = Autonomous Sensing Board SENSOR Computer NODE FOSAD'09 2 Components of the sensor node

GAINS AND CHALLENGES GAINS AND CHALLENGES 1 Annual GDP Growth Rate PH Credit Rating History 9%

Kazuyoshi Kudo, Seiji Tsuno, Tatsuo Kanno, and Hideki Nagumo Learn a lesson from the past / or

BUILDING SEISMIC INSTRUMENTATION USING STRONG MOTION ACCELEROGRAPHS National Structural Code of

BODIES VAN Guido Schoepker CEO Schmitz Cargobull Van Bodies WE ARE OPERATING IN A NEW VEHICLE

Physical Activity and Sedentary Behaviour Subsequent to Serious Orthopaedic Injury: A Systematic

Review October 29, 2015 Spanish Planning Program Process Phase 1: early provincial

David B. Weiss, MD Division Head Orthopaedic Trauma University of Virginia Disclosures In

Experience the future with PVD coatings CEOs presentation Annual General Meeting of

Sense Making in an IOT World: Sensor Data Analysis with Deep - PowerPoint PPT Presentation

GTC 2016 Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Natalia Vassilieva, PhD Senior Research Manager Deep learning proof points as of today Vision Speech Text Other Search & information Interactive voice

The Internet of Things: (almost) every thing connected to Internet By Vctor M. Rivas Santos

TUFF TUFF TUFF TUFF TUFF TUFF TUFF TUFF MAKING MAKING MAKING MAKING SENSE OF SENSE OF

IoT - Big Data &amp; Security MWC Smart Cities Seminar Telefnica Global IoT Group Feb 2017

An Introduction to IoT Penetration Testing @libertyunix www.kmco.com The Agenda n IoT Attack

Internet of Things (IoT) Raspberry Pi Summer Camp Tech Talk Raspberry Pi Camp IoT 1

Sensor Networks &amp; TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor

Why IoT IoT Domain IoT Data Characteristics Massive data: 20.4 Billion connected Growing

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

Considerations for Enterprise Grade IoT Ishu Verma Red Hat AGENDA l 50 Shades of IoT l Functions,

Telkomsel Presenta.on IoT for Making Indonesia 4.0 Jakarta Conven,on Center, 28 November

IoT Trade Mission to Malaysia 23 rd 26 th April 2018 IOT IN ASIA AND MALAYSIA Global IoT

Akintayo Akinyoade 12/01/2017 Survey Roadmap Internet of Things (IoT)? Tech. Enablers for IoT

Word Sense Word Sense Word Sense Disambiguation Disambiguation Disambiguation Presented by

How to capture, analyse and react on IoT generated sensor data in real time Romeo Kienzler, Chief

Exploring the IoT attack surface Breaking || Liberating the brave new IoT world. doc.dr.sc.

+ = Autonomous Sensing Board SENSOR Computer NODE FOSAD'09 2 Components of the sensor node

GAINS AND CHALLENGES GAINS AND CHALLENGES 1 Annual GDP Growth Rate PH Credit Rating History 9%

Kazuyoshi Kudo, Seiji Tsuno, Tatsuo Kanno, and Hideki Nagumo Learn a lesson from the past / or

BUILDING SEISMIC INSTRUMENTATION USING STRONG MOTION ACCELEROGRAPHS National Structural Code of

BODIES VAN Guido Schoepker CEO Schmitz Cargobull Van Bodies WE ARE OPERATING IN A NEW VEHICLE

Physical Activity and Sedentary Behaviour Subsequent to Serious Orthopaedic Injury: A Systematic

Review October 29, 2015 Spanish Planning Program Process Phase 1: early provincial

David B. Weiss, MD Division Head Orthopaedic Trauma University of Virginia Disclosures In

Experience the future with PVD coatings CEOs presentation Annual General Meeting of

IoT - Big Data & Security MWC Smart Cities Seminar Telefnica Global IoT Group Feb 2017

Sensor Networks & TinyDB Author: Roman Kolcun Supervisor: Julie A. McCann Index Sensor