Natural Language Processing Lecture 27: Conclusion Levels of - - PowerPoint PPT Presentation

natural language processing
SMART_READER_LITE
LIVE PREVIEW

Natural Language Processing Lecture 27: Conclusion Levels of - - PowerPoint PPT Presentation

Natural Language Processing Lecture 27: Conclusion Levels of Linguistc nowledge spoken phonetcs writen orthography phonology shallower morphology syntax semantcs deeper pragmatcs discourse


slide-1
SLIDE 1

Natural Language Processing

Lecture 27: Conclusion

slide-2
SLIDE 2

Levels of Linguistc nowledge

phonetcs phonology

  • rthography

morphology syntax semantcs pragmatcs discourse “shallower” “deeper” spoken writen

slide-3
SLIDE 3

uygarlastramadıklarımızdanmıssınızcasına

“(behaving) as if you are among those whom we could not civilize”

slide-4
SLIDE 4

uygarlastramadıklarımızdanmıssınızcasına

“(behaving) as if you are among those whom we could not civilize” uygar “civilized” +las “become” +tr “cause to” +ama “not able” +dık past partciple +lar plural +ımız frst person plural possessive (“our”) +dan second person plural (“y’all”) +mıs past +sınız ablatve case (“from/among”) +casına fnite verb → adverb (“as if”)

slide-5
SLIDE 5

Finite-State Automaton

  • Q: a fnite set of states
  • q0

Q: a special start state ∈

  • F

Q: a set of fnal states ⊆

  • Σ: a fnite alphabet
  • Transitons:
  • Encodes a set of strings that can be recognized by

following paths from q0 to some state in F.

qi qj

s

Σ* ∈

... ...

slide-6
SLIDE 6

Levels of Linguistc nowledge

phonetcs phonology

  • rthography

morphology syntax semantcs pragmatcs discourse “shallower” “deeper” spoken writen

ambiguity

slide-7
SLIDE 7

Noisy Channel

source source channel

y x

decode

What you want What you see

slide-8
SLIDE 8

Noisy Channel

source source channel

y x

decode

Cats meow

  • fen

NN VB RB

slide-9
SLIDE 9

Noisy Channel

source source channel

y x

decode

你好吗? How are you?

slide-10
SLIDE 10

Noisy Channel

source source channel

y x

decode

Okay, Google

slide-11
SLIDE 11

Startng and Stopping

Unigram model:

...

Bigram model:

...

Trigram model:

...

slide-12
SLIDE 12

Language Modeling Questons

  • Why do we use context?
  • What does smoothing do, and why is it

necessary?

  • What do we use to evaluate language

models?

slide-13
SLIDE 13

Tagging

slide-14
SLIDE 14

Broad POS categories

closed classes

  • pen classes

nouns verbs adjectves adverbs prepositons determiners pronouns conjunctons auxiliary verbs partcles numerals

slide-15
SLIDE 15

Syntax

slide-16
SLIDE 16

Parsing

  • C Y vs. Earley’s Algorithm

– Both dynamic programming – CNF vs. general forms

slide-17
SLIDE 17

C Y Algorithm: Chart

Noun, Verb

  • VP,S
  • S

book

Det NP

  • NP

this

Noun

  • fmight

Prep PP

through

PNoun, NP

Houston

slide-18
SLIDE 18

C Y Equatons C Y Equatons

slide-19
SLIDE 19

Semantcs

slide-20
SLIDE 20

Where’s the beef?

Sentences from the brown corpus. Extracted from the concordancer in The Compleat Lexical Tutor, htp://www.lextutor.ca/

slide-21
SLIDE 21

chicken

slide-22
SLIDE 22

Synsets for dog (n)

  • S: (n) dog, domestc dog, Canis familiaris (a member of the genus Canis

(probably descended from the common wolf) that has been domestcated by man since prehistoric tmes; occurs in many breeds) "the dog barked all night"

  • S: (n) frump, dog (a dull unatractve unpleasant girl or woman) "she got a

reputaton as a frump"; "she's a real dog"

  • S: (n) dog (informal term for a man) "you lucky dog"
  • S: (n) cad, bounder, blackguard, dog, hound, heel (someone who is

morally reprehensible) "you dirty dog"

  • S: (n) frank, frankfurter, hotdog, hot dog, dog, wiener, wienerwurst,

weenie (a smooth-textured sausage of minced beef or pork usually smoked; ofen served on a bread roll)

  • S: (n) pawl, detent, click, dog (a hinged catch that fts into a notch of a

ratchet to move a wheel forward or prevent it from moving backward)

  • S: (n) andiron, fredog, dog, dog-iron (metal supports for logs in a

freplace) "the andirons were too hot to touch"

22

slide-23
SLIDE 23
slide-24
SLIDE 24

Entty Linking

Mary picked up the ball. She threw it to me.

slide-25
SLIDE 25

Semantc oles

PropBank is a set of verb-sense-specifc “frames” with informal descriptons for their arguments. Consider the word “Agree”

  • ARG0: agreer
  • ARG1: propositon
  • ARG2: other entty agreeing

[The group] ARG0 agreed [it wouldn’t make an ofer]ARG1. Usually [John] ARG0 agrees [with Mary on everything] ARG2.

slide-26
SLIDE 26

“Fall (move downward)” in PropBank

  • arg1: logical subject, patent, thing falling
  • arg2: extent, amount fallen
  • arg3: startng point
  • arg4: ending point
  • argM-loc: medium

Sales fell to $251.2 million from $278.8 million. The average junk bond fell by 4.2%. The meteor fell through the atmosphere, crashing into Cambridge.

slide-27
SLIDE 27

M L #1: First-Order Logic

DressCode(ThePorch) Serves(UnionGrill, AmericanFood) estaurant(UnionGrill) Have(Speaker, FiveDollars) ^ ¬ Have(Speaker, LotOfTime) ∀x Person(x) Have(x, FiveDollars) ⇒ ∃x,y Person(x) ^ estaurant(y) ^ ¬HasVisited(x,y) Functon Predicates

slide-28
SLIDE 28

First Order Logic: Advantages

  • Flexible
  • Well-understood
  • Widely used
slide-29
SLIDE 29

EM

  • We ofen have unlabeled or incomplete data
  • EM is an for learning without labels, e.g.,

“classifcaton” without classes

  • Pick ra

ndom centroids!

  • Itera

te the following :!

  • Use centroids to la

bel the da ta !

  • Com

pute centroids using the la beled da ta !

  • Keep doing

this until la bels don’t cha ng e

E-step M-step

slide-30
SLIDE 30

NLP Uses NLP Uses

Answer questions using the Web Answer questions using the Web Translate documents from one language to another Translate documents from one language to another Do library research; summarize Do library research; summarize Manage messages intelligently Manage messages intelligently Help make informed decisions Help make informed decisions Follow directions given by any user Follow directions given by any user Fix your spelling or grammar Fix your spelling or grammar Grade exams Grade exams Write poems or novels Write poems or novels Listen and give advice Listen and give advice Estimate public opinion Estimate public opinion Read everything and make predictions Read everything and make predictions Interactively help people learn Interactively help people learn Help disabled people Help disabled people Help refugees/disaster victims Help refugees/disaster victims Document or reinvigorate indigenous languages Document or reinvigorate indigenous languages

slide-31
SLIDE 31

More NLP ...

  • Language Technologies Minor

– 4 LT courses plus LT project

  • 5th year Masters in Language Technologies
slide-32
SLIDE 32

More NLP Courses

  • 11-492/692 Speech Processing

– Fall: Alan W Black – Practcal Systems for Speech

  • 11-711 Algorithms and NLP

– Fall: Yulia Tsvetkov, obert Frederking – esearch oriented

  • 11-727 Computatonal Semantcs

– Spring: Ed Hovy, Teruko Mitamura

slide-33
SLIDE 33

More NLP Courses

  • 11-747 Neural Networks for NLP

– Spring: Graham Neubig

  • 11-830 Computatonal Ethics for NLP

– Spring: Yulia Tsvetkov, Alan W Black

  • 11-777 Advanced Multmodal ML

– Fall: Louis-Philippe Morency – Visual, Gesture, Speech

  • Most Neural Net Classing

– Always involve NLP