Teaching the Basics of NLP and ML in an Introductory Course to - PowerPoint PPT Presentation

Teaching the Basics of NLP and ML in an Introductory Course to Information Science Apoorv Agarwal Columbia University Sunday, September 8, 13

COMS1001 Sunday, September 8, 13

COMS1001 • Introductory course on information science to undergraduates at Columbia University Sunday, September 8, 13

COMS1001 • Introductory course on information science to undergraduates at Columbia University • Mostly taken by freshmen and sophomores Sunday, September 8, 13

COMS1001 • Introductory course on information science to undergraduates at Columbia University • Mostly taken by freshmen and sophomores • Assumes no prior programming or math background Sunday, September 8, 13

COMS1001 • Introductory course on information science to undergraduates at Columbia University • Mostly taken by freshmen and sophomores • Assumes no prior programming or math background • 10% : what’s a programming language? Sunday, September 8, 13

Student demographics Sunday, September 8, 13

Student demographics Math and Engineering majors Sunday, September 8, 13

Student demographics Challenge 1: Cannot use Math terminology: vector space, dot product, high- dimensional space etc. Math and Engineering majors Sunday, September 8, 13

Traditionally taught topics Sunday, September 8, 13

Traditionally taught topics • About thirty 75 min lectures Sunday, September 8, 13

Traditionally taught topics • About thirty 75 min lectures • First half: Operating systems, WWW and the Internet, Binary and Machine Language, Spreadsheets, Database systems Sunday, September 8, 13

Traditionally taught topics • About thirty 75 min lectures • First half: Operating systems, WWW and the Internet, Binary and Machine Language, Spreadsheets, Database systems • Second half: Algorithms, Programming in Python Sunday, September 8, 13

Traditionally taught topics • About thirty 75 min lectures • First half: Operating systems, WWW and the Internet, Binary and Machine Language, Spreadsheets, Database systems • Second half: Algorithms, Programming in Python Challenge 2: Introduce NLP/ML in one lecture Sunday, September 8, 13

Overall Strategy Sunday, September 8, 13

Overall Strategy • Keep definitions simple Sunday, September 8, 13

Overall Strategy • Keep definitions simple • Use analogies and concrete examples (also observed by Reva Freedman 2005) Sunday, September 8, 13

Overall Strategy • Keep definitions simple • Use analogies and concrete examples (also observed by Reva Freedman 2005) • Take baby steps -- incremental learning Sunday, September 8, 13

Overall Strategy • Keep definitions simple • Use analogies and concrete examples (also observed by Reva Freedman 2005) • Take baby steps -- incremental learning • Introduce the core concepts in one lecture and build on them using homework and exam problems Sunday, September 8, 13

Strategy Sunday, September 8, 13

Strategy Sentiment analysis of tweets Sunday, September 8, 13

Strategy Sentiment analysis of movie reviews Sentiment analysis of tweets Sunday, September 8, 13

Strategy Sentiment analysis of movie reviews Sentiment analysis of tweets Email classification into Imp/Not-Imp Sunday, September 8, 13

Strategy Gear towards text processing Sentiment analysis of movie reviews Sentiment analysis of tweets Email classification into Imp/Not-Imp Sunday, September 8, 13

Strategy Gear towards text processing Sentiment analysis of movie reviews Sentiment analysis of tweets Email classification into Imp/Not-Imp Implement end-to-end SA pipeline Sunday, September 8, 13

Overview • Lecture organization • Questions asked in class • Performance on the mid-term examination • Final projects • Conclusion Sunday, September 8, 13

Lecture Organization Sunday, September 8, 13

Lecture Organization • General discussion on how to define intelligence Sunday, September 8, 13

Lecture Organization • General discussion on how to define intelligence • Introduce a concrete application: sentiment analysis of Twitter data Sunday, September 8, 13

Lecture Organization • General discussion on how to define intelligence • Introduce a concrete application: sentiment analysis of Twitter data • Demonstrate annotation process Sunday, September 8, 13

Lecture Organization • General discussion on how to define intelligence • Introduce a concrete application: sentiment analysis of Twitter data • Demonstrate annotation process • Demonstrate feature extraction Sunday, September 8, 13

Lecture Organization • General discussion on how to define intelligence • Introduce a concrete application: sentiment analysis of Twitter data • Demonstrate annotation process • Demonstrate feature extraction • Demonstrate a basic classification process Sunday, September 8, 13

Points we drive home Sunday, September 8, 13

Points we drive home 1. The machine automatically learns the connotation of words by looking at how often certain words appear in positive and negative tweets. Sunday, September 8, 13

Points we drive home 1. The machine automatically learns the connotation of words by looking at how often certain words appear in positive and negative tweets. 2. The machine also learns more complex patterns that have to do with the conjunction and disjunction of features. Sunday, September 8, 13

Points we drive home 1. The machine automatically learns the connotation of words by looking at how often certain words appear in positive and negative tweets. 2. The machine also learns more complex patterns that have to do with the conjunction and disjunction of features. 3. The quality and amount of training data is important – for if the training data fails to encode a substantial number of patterns important for classification, the machine is not going to learn well. Sunday, September 8, 13

Questions asked in class by students Sunday, September 8, 13

Questions asked in class by students 1. Could we create and use a dictionary that lists the prior polarity of commonly used words? Sunday, September 8, 13

Questions asked in class by students 1. Could we create and use a dictionary that lists the prior polarity of commonly used words? 2. If the prediction score for the tweet is high, does that mean the machine is more confident about the prediction? Sunday, September 8, 13

Questions asked in class by students 1. Could we create and use a dictionary that lists the prior polarity of commonly used words? 2. If the prediction score for the tweet is high, does that mean the machine is more confident about the prediction? 3. In the unigram approach, the sequence of words does not matter. But clearly, if “not” does not negate the words containing opinion, then won’t the machine learn a wrong pattern? Sunday, September 8, 13

Questions asked in class by students 1. Could we create and use a dictionary that lists the prior polarity of commonly used words? 2. If the prediction score for the tweet is high, does that mean the machine is more confident about the prediction? 3. In the unigram approach, the sequence of words does not matter. But clearly, if “not” does not negate the words containing opinion, then won’t the machine learn a wrong pattern? 4. If we have too many negative tweets in our training data (as compared to the positive tweets), then would the machine not be predisposed to predict the polarity of an unseen tweet as negative? Sunday, September 8, 13

Mid-term: Email classification • 53 students • Required to do only 2 out of the following 4 problems Sunday, September 8, 13

Mid-term: Email classification • 53 students • Required to do only 2 out of the following 4 problems Problem (25 points) NLP/ML Logic Gates Database design Machine Instructions Sunday, September 8, 13

Mid-term: Email classification • 53 students • Required to do only 2 out of the following 4 problems Problem Average (25 points) NLP/ML 20.54 Logic Gates 16.94 Database design 13.63 Machine 12.8 Instructions Sunday, September 8, 13

Mid-term: Email classification • 53 students • Required to do only 2 out of the following 4 problems Problem Average Std-dev (25 points) NLP/ML 20.54 4.46 Logic Gates 16.94 6.48 Database design 13.63 6.48 Machine 12.8 6.81 Instructions Sunday, September 8, 13

Mid-term: Email classification • 53 students • Required to do only 2 out of the following 4 problems Problem Average Std-dev Median (25 points) NLP/ML 20.54 4.46 22 Logic Gates 16.94 6.48 20 Database design 13.63 6.48 14 Machine 12.8 6.81 14.5 Instructions Sunday, September 8, 13

Mid-term: Email classification • 53 students • Required to do only 2 out of the following 4 problems Problem # students Average Std-dev Median (25 points) attempted NLP/ML 20.54 4.46 22 51 Logic Gates 16.94 6.48 20 36 Database design 13.63 6.48 14 42 Machine 12.8 6.81 14.5 30 Instructions Sunday, September 8, 13

Student projects Sunday, September 8, 13

Student projects • Formulate your own task Sunday, September 8, 13

Teaching the Basics of NLP and ML in an Introductory Course to - PowerPoint PPT Presentation

Teaching the Basics of NLP and ML in an Introductory Course to Information Science Apoorv Agarwal Columbia University Sunday, September 8, 13 COMS1001 Sunday, September 8, 13 COMS1001 Introductory course on information science to

BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Ontologies for NLP NLP for Ontologies FOIS 2014 - LogOnto Workshop on Logics and Ontologies for

NLP Programming Tutorial 0 - Programming Basics Graham Neubig Nara Institute of Science and

Hybrid NLP Hybrid NLP O UTLINE O UTLINE Problems of Deep and Shallow Processing

Facing NLP German Rigau i Claramunt http://adimen.si.ehu.es/~rigau IXA group Departamento de

IXA pipes: Efficient and Ready to Use Multilingual NLP tools Rodrigo Agerri IXA NLP Group,

Natural Language Processing (NLP) In 11-711 Algorithms for NLP we take an

NLP Programming Tutorial 4 - Word Segmentation Graham Neubig Nara Institute of Science and

SI485i : NLP Set 12 Features and Prediction What is NLP, really? Many of our tasks boil down

Prominent Research Directions in NLP Alexander Panchenko Assistant Professor for NLP About

Deep Learning for NLP Kiran Vodrahalli Feb 11, 2015 Overview What is NLP? Natural

Capsule Networks for NLP Will Merrill Advanced NLP 10/25/18 Capsule Networks: A Better ConvNet

Ignition SCADA for Water/Wastewater An Introduction to Inductive Automations Ignition software

From Student to Software From Student to Software Engineer in the Indian IT Engineer in the

CSE 373: Data Structure & Algorithms Lecture 25: Programming Languages Nicki Dell Spring

ICANN Language Services Community Translation Christina Rodriguez ICANN | ICANN55 | 8

Linda Grove-Paul, LCSW, MPA VP for Recovery and Innovation, Centerstone of Indiana Agenda

Nevada Housing Recovery ry Christine Hess, Executive Director June 11, 2020 BOARD OF DIRECTORS

The Arctic ScienceandTechnology InformationSystem Arctic Science and Technology Information

TSDM Transportation System Data Management Presentation By BACKGROUND DelDOT owns

Teaching the Basics of NLP and ML in an Introductory Course to - PowerPoint PPT Presentation

Teaching the Basics of NLP and ML in an Introductory Course to Information Science Apoorv Agarwal Columbia University Sunday, September 8, 13 COMS1001 Sunday, September 8, 13 COMS1001 Introductory course on information science to

BIBLICAL SURVEY Introductory Class Introductory Class BIBLICAL SURVEY Introductory Class

SI485i : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

SI425 : NLP Missing Topics and the Future Who cares about NLP? NLP has expanded quickly

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

NLP: Two pictures Wordnet and Word Sense Problem NLP Disambiguation Semantics NLP Trinity

Ontologies for NLP NLP for Ontologies FOIS 2014 - LogOnto Workshop on Logics and Ontologies for

NLP Programming Tutorial 0 - Programming Basics Graham Neubig Nara Institute of Science and

Hybrid NLP Hybrid NLP O UTLINE O UTLINE Problems of Deep and Shallow Processing

Facing NLP German Rigau i Claramunt http://adimen.si.ehu.es/~rigau IXA group Departamento de

IXA pipes: Efficient and Ready to Use Multilingual NLP tools Rodrigo Agerri IXA NLP Group,

Natural Language Processing (NLP) In 11-711 Algorithms for NLP we take an

NLP Programming Tutorial 4 - Word Segmentation Graham Neubig Nara Institute of Science and

SI485i : NLP Set 12 Features and Prediction What is NLP, really? Many of our tasks boil down

Prominent Research Directions in NLP Alexander Panchenko Assistant Professor for NLP About

Deep Learning for NLP Kiran Vodrahalli Feb 11, 2015 Overview What is NLP? Natural

Capsule Networks for NLP Will Merrill Advanced NLP 10/25/18 Capsule Networks: A Better ConvNet

Ignition SCADA for Water/Wastewater An Introduction to Inductive Automations Ignition software

From Student to Software From Student to Software Engineer in the Indian IT Engineer in the

CSE 373: Data Structure &amp; Algorithms Lecture 25: Programming Languages Nicki Dell Spring

ICANN Language Services Community Translation Christina Rodriguez ICANN | ICANN55 | 8

Linda Grove-Paul, LCSW, MPA VP for Recovery and Innovation, Centerstone of Indiana Agenda

Nevada Housing Recovery ry Christine Hess, Executive Director June 11, 2020 BOARD OF DIRECTORS

The Arctic ScienceandTechnology InformationSystem Arctic Science and Technology Information

TSDM Transportation System Data Management Presentation By BACKGROUND DelDOT owns

CSE 373: Data Structure & Algorithms Lecture 25: Programming Languages Nicki Dell Spring