Logistic Regression and POS Tagging CSE392 - Spring 2019 Special - PowerPoint PPT Presentation

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special Topic in CS

Task ● Machine learning: how? ● Parts-of-Speech Tagging ○ Logistic regression

Parts-of-Speech Open Class: Nouns, Verbs, Adjectives, Adverbs Function words: Determiners, conjunctions, pronouns, prepositions

Parts-of-Speech: The Penn Treebank Tagset

Parts-of-Speech: Social Media Tagset ( Gimpel et al., 2010)

POS Tagging: Applications ● Resolving ambiguity (speech: “lead”) ● Shallow searching: find noun phrases ● Speed up parsing ● Use as feature (or in place of word) For this course: ● An introduction to language-based classification (logistic regression) ● Understand what modern deep learning methods are dealing with implicitly.

Logistic Regression Binary classification goal: Build a “model” that can estimate P(A=1|B=?) i.e. given B, yield (or “predict”) the probability that A=1

Logistic Regression Binary classification goal: Build a “model” that can estimate P(A=1|B=?) i.e. given B, yield (or “predict”) the probability that A=1 In machine learning, tradition to use Y for the variable being predicted and X for the features use to make the prediction.

Logistic Regression Binary classification goal: Build a “model” that can estimate P(Y=1|X=?) i.e. given X, yield (or “predict”) the probability that Y=1 In machine learning, tradition is to use Y for the variable being predicted and X for the features use to make the prediction.

Logistic Regression Binary classification goal: Build a “model” that can estimate P(Y=1|X=?) i.e. given X, yield (or “predict”) the probability that Y=1 In machine learning, tradition is to use Y for the variable being predicted and X for the features use to make the prediction. Example: Y: 1 if target is verb, 0 otherwise; X: 1 if “was” occurs before target; 0 otherwise I was reading for NLP. We were fine. I am good. The cat was very happy. We enjoyed the reading material. I was good.

Logistic Regression Example: Y: 1 if target is a part of a proper noun, 0 otherwise; X: number of capital letters in target and surrounding words. They attend Stony Brook University. Next to the brook Gandalf lay thinking. The trail was very stony. Her degree is from SUNY Stony Brook. The Taylor Series was first described by Brook Taylor, the mathematician.

Logistic Regression Example: Y: 1 if target is a part of a proper noun, 0 otherwise; X: number of capital letters in target and surrounding words. They attend Stony Brook University. Next to the brook Gandalf lay thinking. x y The trail was very stony. Her degree is from SUNY Stony Brook. 2 1 1 0 0 0 The Taylor Series was first described by Brook Taylor, the mathematician. 6 1 2 1

Logistic Regression Example: Y: 1 if target is a part of a proper noun, 0 otherwise; X: number of capital letters in target and surrounding words. They attend Stony Brook University. Next to the brook Gandalf lay thinking. x y 2 1 The trail was very stony. Her degree is from SUNY Stony Brook. 1 0 0 0 The Taylor Series was first described by Brook Taylor, the mathematician. 6 1 2 1 They attend Binghamton. 1 1

Logistic Regression Example: Y: 1 if target is a part of a proper noun, 0 otherwise; X: number of capital letters in target and surrounding words. They attend Stony Brook University. Next to the brook Gandalf lay thinking. x y 2 1 The trail was very stony. Her degree is from SUNY Stony Brook. optimal b_0, b_1 changed! 1 0 0 0 The Taylor Series was first described by Brook Taylor, the mathematician. 6 1 2 1 They attend Binghamton. 1 1

Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X is a single value and can be anything numeric.

Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X can be anything numeric. The goal of this function is to: take in the variable x and return a probability that Y is 1.

Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X can be anything numeric. The goal of this function is to: take in the variable x and return a probability that Y is 1. Note that there are only three variables on the right: X i , B 0 , B 1

Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X can be anything numeric. The goal of this function is to: take in the variable x and return a probability that Y is 1. Note that there are only three variables on the right: X i , B 0 , B 1 X is given. B 0 and B 1 must be learned.

Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X can be anything numeric. The goal of this function is to: take in the variable x and HOW? Essentially, try different B 0 return a probability that Y is 1. and B 1 values until “best fit” to the training data (example X and Y ). Note that there are only three variables on the right: X i , B 0 , B 1 X is given. B 0 and B 1 must be learned .

“best fit” : whatever maximizes the likelihood function: Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X can be anything numeric. The goal of this function is to: take in the variable x and HOW? Essentially, try different B 0 return a probability that Y is 1. and B 1 values until “best fit” to the training data (example X and Y ). Note that there are only three variables on the right: X i , B 0 , B 1 X is given. B 0 and B 1 must be learned .

“best fit” : whatever maximizes the likelihood function: Logistic Regression on a single feature ( x ) Y i ∊ {0, 1}; X can be anything numeric. The goal of this function is to: take in the variable x and To estimate , HOW? Essentially, try different B 0 one can use return a probability that Y is 1. and B 1 values until “best fit” to the reweighted least training data (example X and Y ). Note that there are only three variables on the right: X i , B 0 , B 1 squares: X is given. B 0 and B 1 must be learned . (Wasserman, 2005; Li, 2010)

X can be multiple features Often we want to make a classification based on multiple features: ● Number of capital letters surrounding: integer ● Begins with capital letter: {0, 1} ● Preceded by “the”? {0, 1} We’re learning a linear (i.e. flat) separating hyperplane , but fitting it to a logit outcome.

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special - PowerPoint PPT Presentation

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special Topic in CS Task Machine learning: how? Parts-of-Speech Tagging Logistic regression Parts-of-Speech Open Class: Nouns, Verbs, Adjectives, Adverbs Function

POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana University Fall 2015 1 / 17 POS

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Arabic POS Tagging Results Error Analysis Conclusion Emad Mohamed, Sandra K ubler Indiana

Algorithms for NLP IITP, Spring 2020 HMMs, POS tagging, NER Yulia Tsvetkov 1 Plan POS

Todays lecture Logistic regression How can we use logistic regression for reranking? Shay

From Logistic Regression to Neural Networks CMSC 470 Marine Carpuat Logistic Regression What

LEARNING Outline Math Behind Logistic Regression Visualizing Logistic Regression Loss

Workshop 10.5a: Logistic regression Murray Logan August 23, 2016 Table of contents 1 Logistic

Part Of Speech (POS) Tagging Based on Foundations of Statistical NLP by C. Manning & H.

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Logistic Regression using OLS1D in Excel 2013 XL4D: V0H XL4D: V0H XL4D: V0H 2015 Schield

Workshop 10.5a: Logistic regression Murray Logan 05 Sep 2016 Section 1 Logistic regression

Lecture 3: Logistic Regression Feng Li Shandong University fli@sdu.edu.cn September 21, 2020

XL4B: Logistic Regression using OLS1B in Excel 2013 25 Feb 2018 V0C-2x XL4B: V0C-2x XL4B: V0C-2x

Learning Logistic Circuits Yitao Liang, Guy Van den Broeck January 31, 2019 Which model to

Homework Logistics HW1 is due this Friday 09/02. Login to Gradescope TODAY to see if you

Bringing Agility into Linked Data Development An Industrial Use Case in Logistics Domain Pinar

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur Jan 22, 23 and 24, 2020

IPv6 Ready Logo Aiding IPv6 Deployment Timothy Winters LACNIC 27 May 2017 IPv6 Forums IPv6

Update on Proton Calorimetric Reconstruction Heng-Ye Liao ProtoDUNE sim/reco meeting Dec 11,

Evaluating ProtoDUNE Single Phase Detector Response with a Cosmic Ray Tagger (CRT) Richie Diurba

Generalized tilting theory in functor categories Xi Tang April 25, 2019 logo Table of content

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special - PowerPoint PPT Presentation

Logistic Regression and POS Tagging CSE392 - Spring 2019 Special Topic in CS Task Machine learning: how? Parts-of-Speech Tagging Logistic regression Parts-of-Speech Open Class: Nouns, Verbs, Adjectives, Adverbs Function

POS Tagging HMMs L645 / B659 Dept. of Linguistics, Indiana University Fall 2015 1 / 17 POS

Regression 3: Logistic Regression Marco Baroni Practical Statistics in R Outline Logistic

Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt

Arabic POS Tagging Results Error Analysis Conclusion Emad Mohamed, Sandra K ubler Indiana

Algorithms for NLP IITP, Spring 2020 HMMs, POS tagging, NER Yulia Tsvetkov 1 Plan POS

Todays lecture Logistic regression How can we use logistic regression for reranking? Shay

From Logistic Regression to Neural Networks CMSC 470 Marine Carpuat Logistic Regression What

LEARNING Outline Math Behind Logistic Regression Visualizing Logistic Regression Loss

Workshop 10.5a: Logistic regression Murray Logan August 23, 2016 Table of contents 1 Logistic

Part Of Speech (POS) Tagging Based on Foundations of Statistical NLP by C. Manning &amp; H.

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat POS tagging Sequence labeling with

Regression Methods 1. Linear Regression and Logistic Regression: definitions, and a common

Logistic Regression using OLS1D in Excel 2013 XL4D: V0H XL4D: V0H XL4D: V0H 2015 Schield

Workshop 10.5a: Logistic regression Murray Logan 05 Sep 2016 Section 1 Logistic regression

Lecture 3: Logistic Regression Feng Li Shandong University fli@sdu.edu.cn September 21, 2020

XL4B: Logistic Regression using OLS1B in Excel 2013 25 Feb 2018 V0C-2x XL4B: V0C-2x XL4B: V0C-2x

Learning Logistic Circuits Yitao Liang, Guy Van den Broeck January 31, 2019 Which model to

Homework Logistics HW1 is due this Friday 09/02. Login to Gradescope *TODAY* to see if you

Bringing Agility into Linked Data Development An Industrial Use Case in Logistics Domain Pinar

Logistic Regression CS60010: Deep Learning Abir Das IIT Kharagpur Jan 22, 23 and 24, 2020

IPv6 Ready Logo Aiding IPv6 Deployment Timothy Winters LACNIC 27 May 2017 IPv6 Forums IPv6

Update on Proton Calorimetric Reconstruction Heng-Ye Liao ProtoDUNE sim/reco meeting Dec 11,

Evaluating ProtoDUNE Single Phase Detector Response with a Cosmic Ray Tagger (CRT) Richie Diurba

Generalized tilting theory in functor categories Xi Tang April 25, 2019 logo Table of content

Part Of Speech (POS) Tagging Based on Foundations of Statistical NLP by C. Manning & H.

Homework Logistics HW1 is due this Friday 09/02. Login to Gradescope TODAY to see if you