Applications of Deep Learning (Beyond Text & Images) Brian Mac - PowerPoint PPT Presentation

Applications of Deep Learning (Beyond Text & Images) Brian Mac Namee

APPLICATIONS OF MACHINE LEARNING

https://trends.google.com/trends/

https://xkcd.com/1425/

https://xkcd.com/1831/

artificial intelligence

artificial intelligence machine learning

artificial intelligence deep learning machine learning

artificial intelligence deep learning machine data learning science

artificial intelligence supervised learning deep learning machine data unsupervised learning learning science reinforcement learning

artificial intelligence decision tree learning deep Bayesian learning learning machine data instance-based learning learning science analytical learning reinforcement learning

artificial intelligence probability-based deep learning error-based machine data learning science information-based similarity-based

artificial intelligence recognising deep forecasting learning machine data generating learning science organising controlling

Domains Ripe for Application of Machine Learning Involve repetitive tasks with defined outcomes Massive collections of historical examples of the task with solutions already exist Involve simple decisions rather than complex recommendations The domain does not change too rapidly The opportunity to augment human performance rather than replace it exists

Limitations of Machine Learning Still best for one-level questions Struggles to deal with subtle context Encode biases that exist in datasets Making machine learning models that continuously learn is still difficult Explanation of models (in domains where trust is required) remains challenging

(BEYOND TEXT & IMAGES)

There’s All Kinds Of Data Out There!

What Data You Analyzed – KDnuggets Poll Results and Trends https://www.kdnuggets.com/2017/04/poll-results-data-analyzed.html

Activity Tracking artificial intelligence recognising deep forecasting learning generating data machine science organising learning controlling

WISDM v1.1 Activity Recognition Data Accelerometer data recorded in controlled conditions for activity recognition – 1,098,207 instances – 3 attributes – 6 activity classes Assume signals contain both spatial and temporal structure

WISDM v1.1 Activity Recognition Data International Workshop on Knowledge Discovery from Sensor Data (at KDD-10) Jennifer R. Kwapisz, Gary M. Weiss and Samuel A. Moore (2010). Activity Recognition using Cell Phone Accelerometers, Proceedings of the Fourth 20 20 Y Axis Y Axis 15 15 10 10 Acceleration Acceleration http://www.cis.fordham.edu/wisdm/dataset.php 5 5 0 0 X Axis -5 -5 Z Axis Z Axis X Axis -10 -10 0 0.5 1 1.5 2 2.5 0 0.5 1 1.5 2 2.5 Time (s) Time (s) (a) Walking (b) Jogging

WISDM v1.1 Activity Recognition Data International Workshop on Knowledge Discovery from Sensor Data (at KDD-10) Jennifer R. Kwapisz, Gary M. Weiss and Samuel A. Moore (2010). Activity Recognition using Cell Phone Accelerometers, Proceedings of the Fourth 20 20 Y Axis Y Axis 15 15 Z Axis 10 Acceleration 10 Acceleration 5 http://www.cis.fordham.edu/wisdm/dataset.php Z Axis 5 0 0 -5 -5 X Axis X Axis -10 -10 0 0.5 1 1.5 2 2.5 0 0.5 1 1.5 2 2.5 Time (s) Time (s) (c) Ascending Stairs (d) Descending Stairs

WISDM v1.1 Activity Recognition Data International Workshop on Knowledge Discovery from Sensor Data (at KDD-10) Jennifer R. Kwapisz, Gary M. Weiss and Samuel A. Moore (2010). Activity Recognition using Cell Phone Accelerometers, Proceedings of the Fourth 10 10 Y Axis X Axis Z Axis Acceleration Acceleration 5 5 http://www.cis.fordham.edu/wisdm/dataset.php Z Axis 0 0 Y Axis X Axis -5 -5 0 0.5 1 1.5 2 2.5 0 0.5 1 1.5 2 2.5 Time (s) Time (s) (e) Sitting (f) Standing Figure 2: Acceleration Plots for the Six Activities (a-f)

WISDM v1.1 Activity Recognition Data International Workshop on Knowledge Discovery from Sensor Data (at KDD-10) Jennifer R. Kwapisz, Gary M. Weiss and Samuel A. Moore (2010). Activity Recognition using Cell Phone Accelerometers, Proceedings of the Fourth 10 10 Y Axis X Axis Objective: apply deep learning Z Axis Acceleration Acceleration 5 5 http://www.cis.fordham.edu/wisdm/dataset.php Z Axis approaches without any specialist 0 0 Y Axis domain knowledge or manual X Axis feature engineering -5 -5 0 0.5 1 1.5 2 2.5 0 0.5 1 1.5 2 2.5 Time (s) Time (s) (e) Sitting (f) Standing Figure 2: Acceleration Plots for the Six Activities (a-f)

CNN Based Architecture Input x y z channels 1 x 64 1 x 64 1 x 64 1D conv [ReLu] [ReLu] [ReLu] (Stride =1) 1D conv 64 x 64 64 x 64 64 x 64 (Stride=2) [ReLu] [ReLu] [ReLu] 1D conv 64 x 64 64 x 64 64 x 64 Stride=2 [ReLu] [ReLu] [ReLu] Concatenation 3 x 64 128 hidden nodes [ReLu] Fully connected 128 hidden nodes [ReLu] layers 6 output nodes [softmax] Classification

CNN on 1-D Time Series Channel Output layer Feature maps Fully connected layer Pooling Layer 1D convolutional layer

CNN-LSTM based architecture Input x y z channels 1 x 64 1 x 64 1 x 64 1D conv [ReLu] [ReLu] [ReLu] (Stride =1) 1D conv 64 x 64 64 x 64 64 x 64 (Stride=2) [ReLu] [ReLu] [ReLu] 1D conv 64 x 64 64 x 64 64 x 64 Stride=2 [ReLu] [ReLu] [ReLu] Concatenation 3 x 64 LSTM [128 hidden] Recurrent LSTM [128 hidden] layers LSTM [6 hidden] Softmax Classification

CNN to LSTM y Classification LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM LSTM Inputs to LSTM x 0 x 1 x n Feature vector at each Output of CNN timestamp t 0 t 1 t n

Results

User Centric Problem Impersonal Data – Model trained on data from only users outside the test set. – Don’t require user-specific data but are less accurate Personal Data – Model trained on data only from the test user. – Require user-specific data but tend to be accurate Hybrid Data – Model trained on data from both the test users and users outside the test set.

Malware Detection artificial intelligence recognising deep forecasting learning generating data machine science organising learning controlling

Kaggle Microsoft Malware Classification Challenge https://www.kaggle.com/c/malware-classification Kaggle Microsoft Malware Classification Challenge Malware is malicious code which is often encountered as compiled executable byte code Kaggle Microsoft malware Malware Class Instances classification challenge Ramnit 1541 Lollipop 2478 – Over 400 GB uncompressed data Kelihos_v3 2942 Vundo 475 – 9 labelled malware classes Simda 42 – 10,868 malware files as raw Tracur 751 byte code (plus disassembled Kelihos_v1 398 machine code) in training set Obfuscator.ACY 1228 Gatak 1013

Kaggle Microsoft Malware Classification Challenge https://www.kaggle.com/c/malware-classification Kaggle Microsoft Malware Classification Challenge .text:00401000 56 push esi 00401000 56 8D 44 24 08 50 8B F1 .text:00401001 8D 44 24 08 lea eax, [esp+8] E8 1C 1B 00 00 C7 06 08 Objective: apply deep learning .text:00401005 50 push eax 00401010 BB 42 00 8B C6 5E C2 04 00 CC CC CC CC CC CC CC .text:00401006 8B F1 mov esi, ecx approaches without any specialist 00401020 C7 01 08 BB 42 00 E9 26 .text:0040100D C7 06 08 mov dword ptr [esi] 1C 00 00 CC CC CC CC CC domain knowledge or manual offset off_42BB08 00401030 56 8B F1 C7 06 08 BB 42 .text:00401013 8B C6 mov eax, esi feature engineering 00 E8 13 1C 00 00 F6 44 .text:00401015 5E pop esi 00401040 24 08 01 74 09 56 E8 6C .text:00401016 C2 04 00 retn 4 1E 00 00 83 C4 04 8B C6 .text:00401019 CC CC CC align 10h 00401050 5E C2 04 00 CC CC CC CC .text:00401020 C7 01 08 mov dword ptr [ecx], CC CC CC CC CC CC CC CC offset off_42BB08 00401060 8B 44 24 08 8A 08 8B 54 .text:00401026 E9 26 1C jmp sub_402C51 24 04 88 0A C3 CC CC CC

Applications of Deep Learning (Beyond Text & Images) Brian Mac - PowerPoint PPT Presentation

Applications of Deep Learning (Beyond Text & Images) Brian Mac Namee APPLICATIONS OF MACHINE LEARNING https://trends.google.com/trends/ https://xkcd.com/1425/ https://xkcd.com/1831/ artificial intelligence artificial intelligence

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

Beyond Text INFM 718X/LBSC 708X Session 10 Douglas W. Oard Agenda Beyond Text, but still

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

CS4495/6495 Introduction to Computer Vision 2A-L1 Images as functions Images as functions Images

Deep Image-Text Embeddings Learning Deep Structure-Preserving Image-Text Embeddings (CVPR 2016)

Medical Imaging Elisa Sayrol Medical Imaging Interest in this area in Deep Learning: DeepDeep

God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here

Invited Workshop on Compiler Techniques for Sparse Tensor Algebra Cambridge MA, January 26th

Demand Aware Network ( DAN ) Design Some Results and Open Questions Chen Avin Joint work with

Coordinating Data-Parallel SAC Programs with S-Net Sven-Bodo Scholz Clemens Grelck and Alex

The Role of a Context Service in a System that aims at integrating the Digital with the Real

McTiny: classic.mceliece.org McEliece for tiny network servers submission team (alphabetical):

INF5110 Compiler Construction Spring 2017 1 / 93 Outline 1. Grammars Introduction

Heuristic Search with Pre-Computed Databases Tsan-sheng Hsu tshsu@iis.sinica.edu.tw

Privacy Engineering Objectives and Risk Model Objective-Based Design for Improving Privacy in

Sambuz

Useful Links

Newsletter

Mail Us