Practical Unsupervised Learning INFO/CS 4300, Spring 2016 Jack - PowerPoint PPT Presentation

Practical Unsupervised Learning INFO/CS 4300, Spring 2016 Jack Hessel

Unsupervised Learning is Cool!

But how can we use this in our projects? 1. 2.

But first, let’s look at our dataset...

Data Dimensionality! Probability of Lung Cancer Developing X X X X X X X X X Cigarettes Consumed Per Day

Data Dimensionality! Probability of Lung X Cancer Developing X X X X X X X X Cigarettes Consumed Per Day

Data Dimensionality! X Probability of Lung X Cancer Developing X X X X X X X X Cigarettes Consumed Per Day

Data Dimensionality! Probability of Lung Cancer Developing X X X X X X X X X Cigarettes Consumed Per Day

Data Dimensionality! Probability of Lung Cancer Developing X X X X X X X X X m o r f ? e t c h n g e i e r e h f Cigarettes Consumed f n i D a e m Per Day

Data Dimensionality! 1 Dimension! Probability of Lung Cancer Developing X X X X X X X X X Cigarettes Consumed Per Day

Words and documents are the same way... |D| X tfidf |V|

Words and documents are the same way... |D| Pineapple X tfidf |V|

Words and documents are the same way... |D| Pineapple X X tfidf |V|

Words and documents are the same way... |D| Pineapple X (but, really -- a low dimensional subspace…) X tfidf |V|

Words and documents are the same way... e r e w s |D| e l p p a ” … e n d i e P l “ l a c e r X X tfidf |V|

Words and documents are the same way... e r e w s |D| e l p p a ” … e n d i e P l “ l a c e r X (but, really -- a low dimensional subspace…) X tfidf |V|

Key questions in unsupervised NLP:

Key questions in unsupervised NLP: 1. How many dimensions does our dataset actually live in?

Key questions in unsupervised NLP: 1. How many dimensions does our dataset actually live in? 2. How do we project our data down to those dimensions?

Key questions in unsupervised NLP: 1. How many dimensions does our dataset actually live in? 2. How do we project our data down to those dimensions? 3. Does any of this stuff actually do anything for our projects?

Key tool in Linear Algebra, NLP, Machine Learning, Data Science, Computer Vision, Algorithms, Matrix Computations, Optimization, Statistics...

|D| X tfidf |V| =

|D| |D| 1 1 X tfidf |V| = |V|

|D| |D| 1 1 X tfidf |V| = k |V|

|D| |D| 1 1 k 1 1 X tfidf |V| = |V|

|D| |D| 2 2 k 1 0 2 2 0 k 2 X tfidf |V| = |V|

Key questions in unsupervised NLP: 1. How many dimensions does our dataset actually live in? 2. How do we project our data down to those dimensions?

Enough talk, time for some magic.

|D| X tfidf + = |V|

|D| X tfidf + = |V| Latent Semantic Indexing (LSI)

|D| X tfidf + = |V| Latent Semantic Indexing (LSI) = Latent Semantic Analysis (LSA)

As a side note... Great first NLP paper to read! Highly accessible :-) “Indexing by latent semantic analysis” - Deerwester et al. 1990 Scott Deerwester

What are topic models? |D| |D| k k X tfidf |V| |V|

What are topic models? |D| |D| k k X tfidf |V| |V| Latent semantic indexing (Deerwater et al. 1990)

What are topic models? |D| |D| k k X tfidf |V| |V| Latent semantic indexing Non-negative matrix factorization (Deerwater et al. 1990) (Lee and Seung 1999)

What are topic models? |D| |D| Latent dirichlet allocation k (Blei et al. 2003) k X tfidf |V| |V| Latent semantic indexing Non-negative matrix factorization (Deerwater et al. 1990) (Lee and Seung 1999)

Why do we care?? |D| k k |V|

Why do we care?? |D| k k Interpretable, small number of features for |V| text classification!

Document length *matters a lot*

Document length *matters a lot* Different regimes of supervised NLP (Jack’s opinions only! Lots of caveats!)

Document length *matters a lot* Different regimes of supervised NLP (Jack’s opinions only! Lots of caveats!) Less words More words 50-100 Words

Document length *matters a lot* Different regimes of supervised NLP (Jack’s opinions only! Lots of caveats!) Topic models fail Topic models work Less words More words 50-100 Words

Document length *matters a lot* Different regimes of supervised NLP (Jack’s opinions only! Lots of caveats!) Topic models fail Topic models work Less words More words 50-100 Words Naive bayes, n-gram features + linear classifier are almost always pretty good in practice :-)

So can we see if our Kickstarter will be successful?

Practical Unsupervised Learning INFO/CS 4300, Spring 2016 Jack - PowerPoint PPT Presentation

Practical Unsupervised Learning INFO/CS 4300, Spring 2016 Jack Hessel Unsupervised Learning is Cool! But how can we use this in our projects? 1. 2. But first, lets look at our dataset... Data Dimensionality! Probability of Lung Cancer

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

Unsupervised Learning Andrea Passerini passerini@disi.unitn.it Machine Learning Unsupervised

Introduction to PCA Unsupervised Learning in R Unsupervised learning Two methods of

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

Unsupervised Language Learning: Representation Learning for NLP Katia Shutova ILLC University

Unsupervised Learning Unsupervised Learning Learning without Class Labels (or correct Learning

Unsupervised Learning Introduction Nakul Verma Unsupervised Learning What can we learn from

12. Unsupervised Deep Learning CS 535 Deep Learning, Winter 2018 Fuxin Li With materials from

Machine Learning for NLP Unsupervised Learning Aurlie Herbelot 2019 Centre for Mind/Brain

Unsupervised Learning Unsupervised vs Supervised Learning: Most of this course focuses on

Unsupervised Learning Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer Science, National

Unsupervised Learning Unsupervised vs Supervised Learning: Most of this course focuses on

Unsupervised Learning Shan-Hung Wu shwu@cs.nthu.edu.tw Department of Computer Science, National

On the Limitations of Unsupervised Bilingual Dictionary Induction Anders Sgaard Sebastian

Unsupervised learning introduction October 7, 2019 Unsupervised learning introduction

Squarelet Cooperative Learning Who We Are The Open Knowledge Foundation Germany (OKFN.de) is a

Fr Frequently uently as aske ked d questi stions ons il illu lust strate rated uCu

DIVISION OF EXAMINATIONS, AWARDS AND SCHOLARSHIPS (DEAS) Sungai Long Campus (Ice, Cik Rita,

2D1N Overseas Learning Journey to Malacca 2018 Group 1: 22 23 OCT 2018 Group 2: 24 25 OCT

Beyond Cyber Securit ity Dangerous Toys USB Device Impersonators USB Killers Man in the

Exploring Multivariate Data with Clustering and Dimensionality Reduction Marco Baroni Practical

Cyber@UC Meeting 56 W I F I If Youre New! Join our Slack: ucyber.slack.com SIGN IN!

Maximum Laplacian Energy among Threshold Graphs Christoph Helmberg (TU Chemnitz) joint work with