Active Multitask Learning Using Both Supervised and Latent Shared - PowerPoint PPT Presentation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active Multitask Learning Using Both Supervised and Latent Shared Topics Ayan Acharya , Raymond J. Mooney, Joydeep Ghosh UT Austin, Dept. of ECE & CS April 24, 2014

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Outline Background Act-DSLDA and Act-NPDSLDA Datasets & Empirical Results References

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Motivation Multitask Learning: data from multiple tasks are collected and models are learnt simultaneously Active Learning: only the most informative examples are queried from the unlabeled pool Unify both of these approaches

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Problem Setting In training corpus each document/image belongs to a known class and has a set of attributes (supervised topics). Classes from aYahoo data: carriage, centaur, bag, building, donkey, goat, jetski, monkey, mug, statue, wolf, and zebra Attributes: “has head”, “has wheel”, “has torso” and 61 others Train models using words, supervised topics and class labels An active MTL framework that can use and query over both attributes and class labels

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Transfer with Shared Supervised Attributes Train to infer attributes from visual features Train to infer categories from attributes [Lampert et al., 2009]

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Multitask Learning with Shared Latent Features Reference: [Caruana, 1997]

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Transfer with Shared Supervised and Latent Attributes

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Topic Models: LDA α θ z w β M n K N Figure : LDA Figure : Visual Representation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Topic Models: LLDA α θ Λ z w β M n K N Figure : LLDA Figure : Visual Representation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Topic Models: MedLDA α θ z Y w β M n K N r Figure : MedLDA Figure : Visual Representation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Topic Models: DSLDA Doubly Supervised LDA [Acharya et al., 2013] α ( 1 ) , α ( 2 ) : priors over supervised and latent topics α ( 1 ) α ( 2 ) Λ θ z ǫ w β Y M n K N r Figure : DSLDA Figure : Visual Representation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active DSLDA (Act-DSLDA) r 1 : weights for multiclass SVM r 2 : weights for binary SVMs α ( 1 ) α ( 2 ) r 2 θ Λ z X ǫ w β Y M n K N r 1 Figure : Act-DSLDA Figure : Visual Representation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active NPDSLDA (Act-NPDSLDA) Non-parametric Doubly Supervised LDA [Acharya et al., 2013] α ( 2 ) δ 0 ǫ β ′ π ( 2 ) π ′ c Λ γ 0 ∞ z η 1 φ ∞ w Y η 2 φ M n N K 2 r Figure : NPDSLDA

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active NPDSLDA (Act-NPDSLDA) Non-parametric Doubly Supervised LDA [Acharya et al., 2013] α ( 2 ) δ 0 ǫ α ( 2 ) δ 0 ǫ β ′ π ( 2 ) π ′ c Λ γ 0 r 2 π ( 2 ) c β ′ Λ π ′ γ 0 ∞ ∞ z z X η 1 φ η 1 φ ∞ ∞ w w Y Y η 2 η 2 φ φ M n M n N K 2 K 2 N r 1 r Figure : NPDSLDA Figure : Act-NPDSLDA

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Visual Representation of Act-NPDSLDA Figure : Visual Representation of Act-NPDSLDA

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Inference and Learning Active learning measure: expected error reduction [Nigam et al., 1998] Batch mode: variational EM with completely factorized approximation to posterior, online SVM [Bordes et al., 2007] Active selection mode: incremental EM [Neal and Hinton, 1999], online SVM

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Description of Dataset: ACM Conference Conference names: WWW, SIGIR, KDD, ICML, Classes: ISPD, DAC; abstracts of papers are treated as documents keywords provided by the authors Supervised topics:

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Experimental Methodology Multitask training that evaluates benefits of sharing information among classes on the predictive accuracy of all classes Start with a completely labeled dataset L consisting of 300 documents In every active iteration, 50 labels (class labels or supervised topics) are queried for.

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Compared Models Model Supervised Topics Latent Topics Class Labels Act-DSLDA present & queried shared queried Act-NPDSLDA present & queried shared queried R-MedLDA-MTL absent shared random selection R-DSLDA present & random selection shared & random selection random selection Act-MedLDA-OVA absent not shared queried Act-MedLDA-MTL absent shared queried Act-DSLDA-OSST present & queried absent queried Act-DSLDA-NSLT present & queried not shared queried R andom MedLDA-MTL (R-MedLDA-MTL) 1 2 R andom DSLDA (R-DSLDA) Active Learning in MedLDA with o ne- v s- a ll classification (Act-MedLDA-OVA) 3 4 Active Learning in MedLDA with m ulti t ask l earning (Act-MedLDA-MTL) Act-DSLDA with o nly s hared s upervised t opics (Act-DSLDA-OSST) 5 Act-DSLDA with n o s hared l atent t opics (Act-DSLDA-NSLT) 6

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Random MedLDA-MTL (R-MedLDA-MTL)

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Random DSLDA (R-DSLDA)

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active Learning in MedLDA with one-vs-all classification (Act-MedLDA-OVA)

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active Learning in MedLDA with Multitask Learning (Act-MedLDA-MTL)

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Act-DSLDA with Only Shared Supervised Topics (Act-DSLDA-OSST)

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Act-DSLDA with No Shared Latent Topics (Act-DSLDA-NSLT)

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References aYahoo Learning Curves

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References aYahoo Query Distribution

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References ACM Conference Learning Curves

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References ACM Conference Query Distribution

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Conclusion and Future Work Experimental results demonstrate the utility of integrating active and multitask learning in one framework that also unifies latent and supervised shared topics. Better approximation techniques for active selection with large scale learning Active query with annotators’ rationales

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References References Acharya, A., Rawal, A., Mooney, R. J., and Hruschka, E. R. (2013). Using both supervised and latent shared topics for multitask learning. In ECML PKDD, Part II, LNAI 8189 , pages 369–384. Bordes, A., Bottou, L., Gallinari, P., and Weston, J. (2007). Solving multiclass support vector machines with larank. In Proc. of ICML , pages 89–96. Caruana, R. (1997). Multitask learning. Machine Learning , 28:41–75. Lampert, C. H., Nickisch, H., and Harmeling, S. (2009). Learning to detect unseen object classes by betweenclass attribute transfer. In Proc. of CVPR , pages 951–958. Neal, R. M. and Hinton, G. E. (1999). A view of the EM algorithm that justifies incremental, sparse, and other variants. Nigam, K., McCallum, A., Thrun, S., and Mitchell, T. (1998). Learning to classify text from labeled and unlabeled documents. In Proceedings of the Fifteenth National Conference on Artificial Intelligence , pages 792–799. AAAI Press.

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Questions?

Active Multitask Learning Using Both Supervised and Latent Shared - PowerPoint PPT Presentation

Background Act-DSLDA & Act-NPDSLDA Datasets & Empirical Results References Active Multitask Learning Using Both Supervised and Latent Shared Topics Ayan Acharya , Raymond J. Mooney, Joydeep Ghosh UT Austin, Dept. of ECE & CS April

Multitask Learning Lei Tang Arizona State University Nov. 6th, 2006 Lei Tang Multitask

Consistent Multitask Learning with Nonlinear Output Constraints Carlo Ciliberto Department of

PCA CS 446 Supervised learning So far, weve done supervised learning: Given (( x i , y i )) ,

Agenda Intro to Active Learning Activity Design Resources for Active Learning Lunch with Active

Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care Patrick Schwab 1

Extending Unsupervised Neural Image Compression With Supervised Multitask Learning David Tellez,

Distant-supervised Heterogeneous multitask learning for social event forecasting with

COMPETITIVE MULTITASK MARINE TECHNOLOGY Ocean Cleaner Technology S.L. is a competitive marine

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Margin-based Semi-supervised Learning Using Apollonius circle MONA EMADI AND JAFAR TANHA T TC S

Generative Adversarial Networks (GANs) By: Ismail Elezi ismail.elezi@gmail.com Supervised

Multi-Task Active Learning Yi Zhang Outline Active Learning Multi-Task Active Learning

Machine Learning for NLP Supervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

Introduction to Scikit-Learn: Machine Learning with Introduction to Scikit-Learn: Machine Learning

Supervised Learning Prof. Kuan-Ting Lai 2020/4/9 Machine Learning Supervised Unsupervised

Batch Mode Active Learning and Its Application to Medical Image Classification ICML 2006 S. Hoi,

CLASS Customer Load Active System Services Victoria Turnham LCNI conference, Brighton Session

Exploring Complex Energy Networks Florian D orfler @ETH for Complex Systems Control

System Services (CLASS) Webinar Thursday 27 June 2013 Steve Cox Head of Future Networks What

DEWDROP: AN ENERGY- AWARE RUNTIME FOR COMPUTATIONAL RFID Michael Buettner (UW), Benjamin

ECE 697J Advanced Topics Advanced Topics ECE 697J in Computer Networks in Computer

Distribution Networks Laura Ramrez Elizondo Learning Objectives What is an active

Signatures of Time-dependent Heating in Active Regions and the Slow Solar Wind Organizers: Steve

Sambuz

Useful Links

Newsletter

Mail Us