CMPT 413: Computational Linguistics CMPT 825: Natural Language Processing
Angel Xuan Chang
2020-09-09
1
SFU NatLangLab
Adapted from slides from Anoop Sarkar, Danqi Chen and Karthik Narasimhan
CMPT 413: Computational Linguistics CMPT 825: Natural Language - - PowerPoint PPT Presentation
SFU NatLangLab CMPT 413: Computational Linguistics CMPT 825: Natural Language Processing Angel Xuan Chang 2020-09-09 Adapted from slides from Anoop Sarkar, Danqi Chen and Karthik Narasimhan 1 NLP is everywhere Google translate Virtual
Angel Xuan Chang
2020-09-09
1
SFU NatLangLab
Adapted from slides from Anoop Sarkar, Danqi Chen and Karthik Narasimhan
2
Google translate Virtual assistants
3
Question Answering
IBM Watson defeated two of Jeopardy's greatest champions in 2011
speaker listener
I got a new puppy!
http://pyersqr.org/classes/Ling731/Space2.htm
(image credit: https://www.enterrasolutions.com/blog/computational-linguistics-and-natural-language-processing/)
Ted Underwood, David Bamman, and Sabrina Lee (2018), "The Transformation of Gender in English-Language Fiction," Cultural Analytics
Georgetown- IBM experiment, 1954
“Within three or five years, machine translation will be a solved problem”
Video of actual system: https://www.youtube.com/watch?v=bo4RvYJYOzI
Person: Pick up a big red block. Computer: OK. Person: Grasp the pyramid. Computer: I don't understand which pyramid you mean. Person (changing their mind): Find a block which is taller than the one you are holding and put it into the box. Computer: By "it", I assume you mean the block which is taller than the one I am holding. Computer: OK. Person: What does the box contain? Computer: The blue pyramid and the blue block. Person: What is the pyramid supported by? Computer: The box.
Lots of rules!
15
Rule-based system Input text (+other information) Generate output based on rules Set of rules
Men are all alike. IN WHAT WAY They're always bugging us about something or other. CAN YOU THINK OF A SPECIFIC EXAMPLE Well, my boyfriend made me come here. YOUR BOYFRIEND MADE YOU COME HERE He says I'm depressed much of the time. I AM SORRY TO HEAR YOU ARE DEPRESSED
Rogerian psychologist: reflect back what the patient said
rules based on keywords
words in sentence
Demo: http://psych.fullerton.edu/mbirnbaum/psych101/Eliza.htm
(Adapted from slides: Stanford CS124N, Dan Jurafsky)
Backoff
Please go on That’s very interesting I see
Rosie from the Jetsons
The Far Side - Gary Larson
Real newspaper headlines! Kids make nutritious snacks Stolen painting found by tree Miners refuse to work after death Squad helps dog bite victim Killer sentenced to die for second time in 10 years Lack of brains hinders research
Interpretation of language assumes a common basis of world knowledge and context
Herb Clark
“bank” “bat”
Table Counter
“I put the bowl on the table” “The numbers in the table don’t add up”
https://www.katrinascards.com/product/elephant-my-pajamas-large-card
24
25
https://christophm.github.io/interpretable-ml-book/terminology.html
IBM Models for translation Speech recognition
Anytime a linguist leaves the group the (speech) recognition rate goes up
English: Machine translation is cool!
36M sentence pairs
Russian: Машинный перевод - это крутo!
CleverBot (2010) How it works:
and copy the response
What do you get?
https://www.cleverbot.com/
Meena (Google, 2020)
https://ai.googleblog.com/2020/01/towards-conversational-agent-that-can.html
How it works:
turns (over 40B words)
(2.6 billion parameters) for 30 days on 2048 TPUs cores
sentence
Can you guess: Computer or human?
Imagine an "Imitation Game," in which a man and a woman go into separate rooms and guests try to tell them apart by writing a series
typewritten answers sent back. In this game both the man and the woman aim to convince the guests that they are the other. We now ask the question, "What will happen when a machine takes the part of A in this game?" Will the interrogator decide wrongly as often when the game is played like this as he does when the game is played between a man and a woman? These questions replace
Alan Turing
https://www.youtube.com/watch?v=D5VN56jQMWM&feature=youtu.be&t=70
City: Cambridge, MA Founded: 1861 Mascot: Tim the Beaver …
The Massachusetts Institute of Technology (MIT) is a private research university in Cambridge, Massachusetts,
prestigious universities. Founded in 1861 in response to the increasing industrialization of the United States, …
Article Database
Information Extraction: State of the Art
Dependence on large training sets
ACE: 300K words Freebase: 24M relations Not available for many domains (ex. medicine, crime)
Challenging task: even large corpora do not guarantee high performance ~ 75% F1 on relation extraction (ACE) ~ 58% F1 on event extraction (ACE)
(Wu et al., 2016)
https://talktotransformer.com/
Instructor TAs Angel Chang Ali Gholami Yue Ruan Sonia Raychaudhuri
and pytorch will be used.
be comfortable with taking multivariable derivatives
There will be optional tutorials that will help review these topics.