Text Classification Contd + Document Representations
- Prof. Sameer Singh
CS 295: STATISTICAL NLP WINTER 2017
January 17, 2017
Based on slides from Nathan Schneider, Noah Smith, Dan Klein and everyone else they copied from.
Text Classification Contd + Document Representations Prof. Sameer - - PowerPoint PPT Presentation
Text Classification Contd + Document Representations Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 17, 2017 Based on slides from Nathan Schneider, Noah Smith, Dan Klein and everyone else they copied from. Outline Logistic
January 17, 2017
Based on slides from Nathan Schneider, Noah Smith, Dan Klein and everyone else they copied from.
CS 295: STATISTICAL NLP (WINTER 2017) 2
CS 295: STATISTICAL NLP (WINTER 2017) 3
CS 295: STATISTICAL NLP (WINTER 2017) 4
Human machine interface for ABC computer applications
Paper Title CS Area
CS 295: STATISTICAL NLP (WINTER 2017) 5
Human machine interface for ABC computer applications
CS 295: STATISTICAL NLP (WINTER 2017) 6
CS 295: STATISTICAL NLP (WINTER 2017) 7
CS 295: STATISTICAL NLP (WINTER 2017) 8
CS 295: STATISTICAL NLP (WINTER 2017) 9
CS 295: STATISTICAL NLP (WINTER 2017) 10
CS 295: STATISTICAL NLP (WINTER 2017) 11
CS 295: STATISTICAL NLP (WINTER 2017) 12
CS 295: STATISTICAL NLP (WINTER 2017) 13
Sparsity of Words
CS 295: STATISTICAL NLP (WINTER 2017) 14
Why use log(proportion)
CS 295: STATISTICAL NLP (WINTER 2017) 15
Overfitting
For a word that occurs 10 times… There are many that occur ~10 times!
CS 295: STATISTICAL NLP (WINTER 2017) 16
Fixing Overfitting
Regularization Strength Accuracy
CS 295: STATISTICAL NLP (WINTER 2017) 17
CS 295: STATISTICAL NLP (WINTER 2017) 18
CS 295: STATISTICAL NLP (WINTER 2017) 19
CS 295: STATISTICAL NLP (WINTER 2017) 20
CS 295: STATISTICAL NLP (WINTER 2017) 21
CS 295: STATISTICAL NLP (WINTER 2017) 22
Many hidden layers In NLP, utilize unlabeled data to learn representations… (next lecture)
CS 295: STATISTICAL NLP (WINTER 2017) 23
CS 295: STATISTICAL NLP (WINTER 2017) 24
Relation of user perceived response time to error measurement A survey of user opinion of computer system response time The generation of random, binary, ordered trees
CS 295: STATISTICAL NLP (WINTER 2017) 25
Advantages
CS 295: STATISTICAL NLP (WINTER 2017) 26
CS 295: STATISTICAL NLP (WINTER 2017) 27
Local Weighting
Global Weighting
CS 295: STATISTICAL NLP (WINTER 2017) 28
c1: Human machine interface for ABC computer applications c2: A survey of user opinion of computer system response time c3: The EPS user interface management system c4: System and human system engineering testing of EPS c5: Relation of user perceived response time to error measurement m1: The generation of random, binary, ordered trees m2: The intersection graph of paths in trees m3: Graph minors IV: Widths of trees and well-quasi-ordering m4: Graph minors: A survey
From http://lsa.colorado.edu/papers/dp1.LSAintro.pdf
CS 295: STATISTICAL NLP (WINTER 2017) 29 c1 c2 c3 c4 c5 m1 m2 m3 m4
human interface computer user system response time EPS survey trees graph minors
CS 295: STATISTICAL NLP (WINTER 2017) 30
c1 c2 c3 c4 c5 m1 m2 m3 m4
c1 c2 c3 c4 c5 m1 m2 m3 m4
CS 295: STATISTICAL NLP (WINTER 2017) 31
c1: Human machine interface for ABC computer applications c2: A survey of user opinion of computer system response time m4: Graph minors: A survey
CS 295: STATISTICAL NLP (WINTER 2017) 32
c1 c2 c3 c4 c5 m1 m2 m3 m4
c1 c2 c3 c4 c5 m1 m2 m3 m4
CS 295: STATISTICAL NLP (WINTER 2017) 33
CS 295: STATISTICAL NLP (WINTER 2017) 34
c1 c2 c3 c4 c5 m1 m2 m3 m4 c1 c2 c3 c4 c5 m1 m2 m3 m4
CS 295: STATISTICAL NLP (WINTER 2017) 35
Homework
Project