UnTran: Recognizing Unseen Activities with Unlabeled data using - PowerPoint PPT Presentation

UnTran: Recognizing Unseen Activities with Unlabeled data using Transfer Learning ACM/IEEE IoTDI'18 April 18 th , 2018 Md Abdullah Al Hafiz Khan, Nirmalya Roy

Challenges in Scaling Activity Recognition • Cross User Diversity • Device Type Diversity • Device-instance Diversity • Heterogeneous Environments • Heterogeneous sensor Diversity • Unseen Activities 2 mdkhan1@umbc.edu Hafiz Khan

Motivation • Cross-user Diversity • Person A's walking pattern is different than Person B • One person's walking may be similar to running for another person. • How to cope with this diversity? Walking 3 mdkhan1@umbc.edu Hafiz Khan

Motivation Labeled Data • Unseen Activities In the Target Domain • Two Scenarios • Balanced Unseen Activities • Imbalanced Unseen Activities • Balanced Unseen Activities • Both domain contains equivalent number of activities Train Environment • New actvities Unlabeled Data Labeled Data • Imbalanced Unseen Activities Existing • Number of activities are larger than the Existing Activity Activity training environment • New activities Activity Activity New New Test Environment 4 mdkhan1@umbc.edu Hafiz Khan

Transfer Learning • Psychological point of view • The study of dependency of human conduct, learning or performance on prior experience • Thorndike and Woodworth explored how individuals would transfer in one context to another context that share similar characteristics [Psychological review, 1901]. 5 mdkhan1@umbc.edu Hafiz Khan

Transfer Learning Machine learning community • Inspired by human’s transfer of learning ability • The ability of a system to recognize and apply knowledge and skills learned in previous domains/tasks to novel tasks/domains, which share some commonality [pan et. al., TKDE 2010]. • Examples • Goal: to train an AR model to infer task T1 in an indoor environment E1 using machine learning techniques: • Sufficient training data required: sensor readings to measure the environment as well as human supervision, i.e., labels • A predictive model can be learned, and used in the same environment 6 mdkhan1@umbc.edu Hafiz Khan

Our Approach • We address the following scenarios • Imbalanced Activities • Unseen activities • May also contains the below challenges (inherently) • Cross User Diversity • Device-Instance Diversity • Autoencoder • Classifiers decision fusion 7 mdkhan1@umbc.edu Hafiz Khan

Problem & Solution • Collecting annotated samples are costly • Deep models • Data hungry • Required large training time • How to use Source trained Deep models? • Transfer one or more layer • no/small number of labels (target domain) • Reduce training time, reuse existing model • Unseen (both balanced and imbalanced) • Autoencoder + shallow classifier 8 mdkhan1@umbc.edu Hafiz Khan

Our AR Approach Overview of our activity recognition approach. (a) Source domain labeled activity instances, (b) Target domain contains both unlabeled and few labeled activity instances, (c) Common feature space for classification, and (d) Resulting activities after classification. Note that different shapes correspond to different activities 9 mdkhan1@umbc.edu Hafiz Khan

Proposed Architecture • Three Modules • Data Processing • Feature Encoding • Activity Recognition • Data Processing • Pre-processing • Feature Extracting • Feature Encoding • Autoendoer • Activity Recognition Model • Fusion - Traget Raw AR, Source Overall Architecture AR, Target Deep AR 10 mdkhan1@umbc.edu Hafiz Khan

Feature Encoding • Four layers autoencoder named it Deep Sparse Autoencoder (DSAE) • Cost function: • Additional classifier layer (softmax layer) • Lower layer features are more generic [6] • Transfer two layers to implicit minimize domain distribution 11 mdkhan1@umbc.edu Hafiz Khan

Activity Recognition Model • Fuse three classifiers Existing activity fusion probability • Based on empirical evaluation • Source trained classifier • Deep feature based • Traget classifier • Deep Feature based Unseen activity fusion probability • Raw Feature based • Class probability: • Novel class detector • One class SVM • Distinguish between seen vs unseen • Activity Class determination 12 mdkhan1@umbc.edu Hafiz Khan

Evaluation • Three datasets - • Opportunistic (Opp), • Wisdm • Daily and sports (Das) • Opportunistic • 17 activities (ADL), 4 participant, 64 Hz sampling frequency, accelerometer sensor • Wisdm • 6 distinct activities, sampling frequency 20 Hz, 29 users, smartphone kept on pants pocket • Daily and Sports (Das) • 19 activities, 8 users, sampling frequency 25 Hz, right arm data was considered 13 mdkhan1@umbc.edu Hafiz Khan

UnTran performance on different layers • Fixed number of unseen activities in target domain • Standard leave-two-sample-out cross- validation • Generic features in lower layers and domain specific feature on upper layers • 30% labeled data to train target domain classifier Performance on different layers 14 mdkhan1@umbc.edu Hafiz Khan

Balanced Activities: Varying Labeled Data ► Daily and sport • Equivalent number of activities in both domain • Standard leave-two-out cross validation • Varying amount of labeled data of (n-2) ► Opportunistic samples randomly • 20-30% labeled data required to get resonable performance • Larger data distributions reduces the ► Wisdm performance 15 mdkhan1@umbc.edu Hafiz Khan

► Daily and sport • Vary number of activities • Similar leave-two-samples-out cross validation • Model trained with 30% labeled data • Performance drops 5-12% with increasing number of unseen activities ► Wisdm • Performance gain 10-13% compared to TCA and JDA 16 mdkhan1@umbc.edu Hafiz Khan

Imbalance Activities: Varying Labeled Data • Leave-two-class-out cross validation • (A-2) activity classes participate in training • Rest two class activity samples used in testing phase • Trained with 30% annotated data • Performance gain 10-12% compared to TCA, JDA Opportunistic 17 mdkhan1@umbc.edu Hafiz Khan

Imbalance Activities: Varying Unseen Activities • Leave-two-class-out cross validation • (A-2) activity classes participate in training • Rest two class activity samples used in testing phase • Performance gain 15-20% compared to TCA, JDA • Achieves F1 score about 70% on average Opportunistic 18 mdkhan1@umbc.edu Hafiz Khan

Discussion & Conclusion • Cross user diversity investigation are warranted • Explicit structural pattern mapping among activities and instances are needed • Intra- and inter-activity similarities can be exploited • Annotation cost • Assumption is that the user provides few labeled data • One possible direction is to reduce the annotation cost • UnTran achieves • Approx. 75% accuracy for coss-user differences with unlabeled data • Approx. 87% accuracy with 10% annotated samples in target domain 19 mdkhan1@umbc.edu Hafiz Khan

Thank you? Md Abdullah Al Hafiz PhD candidate Information System University of Maryland, Baltimore County Email: mdkhan1@umbc.edu https://userpages.umbc.edu/~mdkhan1 20 mdkhan1@umbc.edu Hafiz Khan

UnTran: Recognizing Unseen Activities with Unlabeled data using - PowerPoint PPT Presentation

UnTran: Recognizing Unseen Activities with Unlabeled data using Transfer Learning ACM/IEEE IoTDI'18 April 18 th , 2018 Md Abdullah Al Hafiz Khan, Nirmalya Roy Challenges in Scaling Activity Recognition Cross User Diversity Device

PARADOX THE UPSIDE DOWN TRUTH OF FAITH PARADOX Week 4 Seeing the Unseen to Truly See

COMPREHENSION OF UNSEEN PASSAGES UNSEEN PASSAGES Teacher : Prof. Indu Bora Subject :

Mimicking Word Embeddings using Subword RNNs Yuval Pinter, Robert Guthrie, Jacob Eisenstein

10 Steps to Counting Unlabeled Planar Graphs: 20 Years Later Manuel Bodirsky October 2007

Recognizing objects and actions in Finding boundaries images and video Recognizing

10701 Semi supervised learning Can Unlabeled Data improve supervised learning? Important

Word2Vec Michael Collins, Columbia University Motivation We can easily collect very large

Clustering Clustering is an unsupervised classification method, i.e. unlabeled data is partitioned

& Information Theory Problems with Unseen Sequences Suppose we want to evaluate bigram

Language Modeling (Part II) Lecture 10 CS 753 Instructor: Preethi Jyothi Unseen Ngrams By

Chapter 22 Dark Matter, Dark Energy, and the Fate of the Universe 22.1 Unseen Influences in the

Chapter 22 Dark Matter, Dark Energy, and 22.1 Unseen Influences in the Cosmos the Fate of the

Unlabeled Motzkin numbers Max Alekseyev Dept. Computer Science and Engineering 2013 Max

Challenges in Recognizing Challenges in Recognizing NFL with DY NFL with DY Accessibility

Overview of the Recognizing Inference in TExt (RITE-2) at Recognizing Inference in

Recognizing object instances 3. Recognizing object instances Kristen Grauman UT-Austin Image

OUTLINE THE BEAUTY SAMPLING BAR WHAT IS IT? CUSTOMIZED BEAUTY SAMPLING FOR A CAUSE Onsite

MSHAs Final Respirable Dust Rules Major Sections Modified Part 70 (Underground coal mines)

EP A Wash Superfund Sites SuperfundSite U . S . En v i r o n m e n t a l P r o t e c t i o n A g

GCSE English Language Unit 3 (GEN31) Sample on Mark pilot Summer 2020 Background to Sampling

F O R C E M A J E U R E B e n H olli d a y S A A P L A pril 2 0 2 0 E N E R G Y I S T H E C E N

The Regulations Governing Beach Protection and the Use of Beaches Public Workshop DNREC

WELCOME Community Budget Meeting Monday, February 9, 2015 5:30-7:00 Trustee Johnny Walker, Board

Discussion Topics 1 Regulation Changes 9% Application, 2 Attachments, Checklist 3

UnTran: Recognizing Unseen Activities with Unlabeled data using - PowerPoint PPT Presentation

UnTran: Recognizing Unseen Activities with Unlabeled data using Transfer Learning ACM/IEEE IoTDI'18 April 18 th , 2018 Md Abdullah Al Hafiz Khan, Nirmalya Roy Challenges in Scaling Activity Recognition Cross User Diversity Device

PARADOX THE UPSIDE DOWN TRUTH OF FAITH PARADOX Week 4 Seeing the Unseen to Truly See

COMPREHENSION OF UNSEEN PASSAGES UNSEEN PASSAGES Teacher : Prof. Indu Bora Subject :

Mimicking Word Embeddings using Subword RNNs Yuval Pinter, Robert Guthrie, Jacob Eisenstein

10 Steps to Counting Unlabeled Planar Graphs: 20 Years Later Manuel Bodirsky October 2007

Recognizing objects and actions in Finding boundaries images and video Recognizing

10701 Semi supervised learning Can Unlabeled Data improve supervised learning? Important

Word2Vec Michael Collins, Columbia University Motivation We can easily collect very large

Clustering Clustering is an unsupervised classification method, i.e. unlabeled data is partitioned

&amp; Information Theory Problems with Unseen Sequences Suppose we want to evaluate bigram

Language Modeling (Part II) Lecture 10 CS 753 Instructor: Preethi Jyothi Unseen Ngrams By

Chapter 22 Dark Matter, Dark Energy, and the Fate of the Universe 22.1 Unseen Influences in the

Chapter 22 Dark Matter, Dark Energy, and 22.1 Unseen Influences in the Cosmos the Fate of the

Unlabeled Motzkin numbers Max Alekseyev Dept. Computer Science and Engineering 2013 Max

Challenges in Recognizing Challenges in Recognizing NFL with DY NFL with DY Accessibility

Overview of the Recognizing Inference in TExt (RITE-2) at Recognizing Inference in

Recognizing object instances 3. Recognizing object instances Kristen Grauman UT-Austin Image

OUTLINE THE BEAUTY SAMPLING BAR WHAT IS IT? CUSTOMIZED BEAUTY SAMPLING FOR A CAUSE Onsite

MSHAs Final Respirable Dust Rules Major Sections Modified Part 70 (Underground coal mines)

EP A Wash Superfund Sites SuperfundSite U . S . En v i r o n m e n t a l P r o t e c t i o n A g

GCSE English Language Unit 3 (GEN31) Sample on Mark pilot Summer 2020 Background to Sampling

F O R C E M A J E U R E B e n H olli d a y S A A P L A pril 2 0 2 0 E N E R G Y I S T H E C E N

The Regulations Governing Beach Protection and the Use of Beaches Public Workshop DNREC

WELCOME Community Budget Meeting Monday, February 9, 2015 5:30-7:00 Trustee Johnny Walker, Board

Discussion Topics 1 Regulation Changes 9% Application, 2 Attachments, Checklist 3

& Information Theory Problems with Unseen Sequences Suppose we want to evaluate bigram