Basics in Language and Probability Philipp Koehn 3 September 2020 - PowerPoint PPT Presentation

Basics in Language and Probability Philipp Koehn 3 September 2020 Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Quotes 1 It must be recognized that the notion ”probability of a sentence” is an entirely useless one, under any known interpretation of this term. Noam Chomsky, 1969 Whenever I fire a linguist our system performance improves. Frederick Jelinek, 1988 Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Conflicts? 2 rationalist vs. empiricist scientist vs. engineer insight vs. data analysis explaining language vs. building applications Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

3 language Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

A Naive View of Language 4 • Language needs to name – nouns: objects in the world (dog) – verbs: actions (jump) – adjectives and adverbs: properties of objects and actions (brown, quickly) • Relationship between these have to specified – word order – morphology – function words Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

A Bag of Words 5 quick fox brown lazy jump dog Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Relationships 6 quick fox brown lazy jump dog Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Marking of Relationships: Word Order 7 quick fox brown lazy jump dog quick brown fox jump lazy dog Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Marking of Relationships: Function Words 8 quick fox brown lazy jump dog quick brown fox jump over lazy dog Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Marking of Relationships: Morphology 9 quick fox brown lazy jump dog quick brown fox jumps over lazy dog Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Some Nuance 10 quick fox brown lazy jump dog the quick brown fox jumps over the lazy dog Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Marking of Relationships: Agreement 11 • From Catullus, First Book, first verse (Latin): Cui dono lepidum novum libellum arida modo pumice expolitum ? Whom I-present lovely new little-book dry manner pumice polished ? (To whom do I present this lovely new little book now polished with a dry pumice?) • Gender (and case) agreement links adjectives to nouns Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Marking of Relationships to Verb: Case 12 • German: Die Frau gibt dem Mann den Apfel The woman gives the man the apple subject indirect object object Der Frau gibt der Mann den Apfel The woman gives the man the apple indirect object subject object • Case inflection indicates role of noun phrases Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Case Morphology vs. Prepositions 13 • Two different word orderings for English: – The woman gives the man the apple – The woman gives the apple to the man • Japanese: woman SUBJ man OBJ apple OBJ 2 gives • Is there a real difference between prepositions and noun phrase case inflection? Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Words 14 This is a simple sentence WORDS Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Morphology 15 This is a simple sentence WORDS be MORPHOLOGY 3sg present Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Parts of Speech 16 PART OF SPEECH DT VBZ DT JJ NN This is a simple sentence WORDS be MORPHOLOGY 3sg present Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Syntax 17 S VP SYNTAX NP NP PART OF SPEECH DT VBZ DT JJ NN This is a simple sentence WORDS be MORPHOLOGY 3sg present Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Semantics 18 S VP SYNTAX NP NP PART OF SPEECH DT VBZ DT JJ NN This is a simple sentence WORDS be SIMPLE1 SENTENCE1 MORPHOLOGY 3sg having string of words SEMANTICS few parts satisfying the present grammatical rules of a languauge Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Discourse 19 S VP SYNTAX NP NP PART OF SPEECH DT VBZ DT JJ NN This is a simple sentence WORDS be SIMPLE1 SENTENCE1 MORPHOLOGY 3sg having string of words CONTRAST SEMANTICS few parts satisfying the present grammatical rules of a languauge But it is an instructive one. DISCOURSE Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Why is Language Hard? 20 • Ambiguities on many levels • Rules, but many exceptions • No clear understand how humans process language • Can we learn everything about language by automatic data analysis? Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

21 data Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Data: Words 22 • Definition: strings of letters separated by spaces • But how about: – punctuation: commas, periods, etc. typically separated (tokenization) – hyphens: high-risk – clitics: Joe’s – compounds: website, Computerlinguistikvorlesung • And what if there are no spaces: 伦敦每日快报指出 , 两台记载黛安娜王妃一九九七年巴黎死亡车祸调查资料的手提电脑 , 被从前大都会警察总长的办公室里偷走 . Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Word Counts 23 Most frequent words in the English Europarl corpus any word nouns Frequency in text Token Frequency in text Content word 1,929,379 the 129,851 European 1,297,736 , 110,072 Mr 956,902 . 98,073 commission 901,174 of 71,111 president 841,661 to 67,518 parliament 684,869 and 64,620 union 582,592 in 58,506 report 452,491 that 57,490 council 424,895 is 54,079 states 424,552 a 49,965 member Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Word Counts 24 But also: There is a large tail of words that occur only once. 33,447 words occur once, for instance • cornflakes • mathematicians • Tazhikhistan Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Zipf’s law 25 f × r = k f = frequency of a word r = rank of a word (if sorted by frequency) k = a constant Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Zipf’s law as a graph 26 frequency rank Why a line in log-scale? fr = k ⇒ f = k r ⇒ log f = log k − log r Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

27 statistics Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Probabilities 28 • Given word counts we can estimate a probability distribution: count ( w ) P ( w ) = w ′ count ( w ′ ) � • This type of estimation is called maximum likelihood estimation . Why? We will get to that later. • Estimating probabilities based on frequencies is called the frequentist approach to probability. • This probability distribution answers the question: If we randomly pick a word out of a text, how likely will it be word w ? Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

A Bit More Formal 29 • We introduce a random variable W . • We define a probability distribution p , that tells us how likely the variable W is the word w : prob ( W = w ) = p ( w ) Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Joint Probabilities 30 • Sometimes, we want to deal with two random variables at the same time. • Example: Words w 1 and w 2 that occur in sequence (a bigram ) We model this with the distribution: p ( w 1 , w 2 ) • If the occurrence of words in bigrams is independent , we can reduce this to p ( w 1 , w 2 ) = p ( w 1 ) p ( w 2 ) . Intuitively, this not the case for word bigrams. • We can estimate joint probabilities over two variables the same way we estimated the probability distribution over a single variable: count ( w 1 ,w 2 ) p ( w 1 , w 2 ) = � w 1 ′ ,w 2 ′ count ( w 1 ′ ,w 2 ′ ) Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Conditional Probabilities 31 • Another useful concept is conditional probability p ( w 2 | w 1 ) It answers the question: If the random variable W 1 = w 1 , how what is the value for the second random variable W 2 ? • Mathematically, we can define conditional probability as p ( w 2 | w 1 ) = p ( w 1 ,w 2 ) p ( w 1 ) • If W 1 and W 2 are independent: p ( w 2 | w 1 ) = p ( w 2 ) Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020

Basics in Language and Probability Philipp Koehn 3 September 2020 - PowerPoint PPT Presentation

Basics in Language and Probability Philipp Koehn 3 September 2020 Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020 Quotes 1 It must be recognized that the notion probability of a sentence is an

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Probability Basics Probabilistic Inference Martin Emms October 1, 2020 Probability Basics

Basics of Probability Basics of Probability Janyl Jumadinova February 2426, 2020 Janyl

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

CS70: Jean Walrand: Lecture 21. Events, Conditional Probability 1. Probability Basics Review 2.

Counting and Probability Whats to come? Counting and Probability Whats to come?

Outline 1. Bayes Law L7: Probability Basics 2. Probability distributions CS 344R/393R:

Unit 2: Probability and distributions Lecture 1: Probability and conditional probability

Which probability Which probability Which probability Which probability theory for cosmology?

Recap of Basic Probability Elements of basic probability theory probability theory The

1 2 3 4 Stopping Probability Visiting Probability 5 Stopping

Lecture 15: More Probability. Summary. CS70: Onwards. Events, Conditional Probability,

Review of probability Nuno Vasconcelos UCSD Probability probability is the language to deal

Natural Language Processing Philipp Koehn 23 April 2020 Philipp Koehn Artificial Intelligence:

Imagery Imagery Perception-like experiences accompanying language comprehension or

Basic Ruby Syntax No variable declarations sum = 0 Newline is statement separator i = 1 while

20 AUGUST 2014 FOCUS & COMPETITIVE COMPARING MALAYSIAS BIOECONOMY TO SELECTED ECONOMIES

CPSC 121: Models of Computation Module 8: Sequential Circuits Module 8: Sequential Circuits By

Restless earth Geology What do you already know about Glue in learning objectives for this

Translating iTeach Principles into Practice: Applying a 5- Step Process in Planning Learning

Map of Germany, 1914 HISTORY OF GRAPHIC DESIGN Electric street lights in a German city, c. 1884

Sambuz

Useful Links

Newsletter

Mail Us

Basics in Language and Probability Philipp Koehn 3 September 2020 - PowerPoint PPT Presentation

Basics in Language and Probability Philipp Koehn 3 September 2020 Philipp Koehn Machine Translation: Basics in Language and Probability 3 September 2020 Quotes 1 It must be recognized that the notion probability of a sentence is an

Probability Basics Martin Emms October 1, 2020 Probability Basics Outline Probability

Probability Basics Probability Background Martin Emms October 1, 2020 Probability Basics

Probability Basics Probabilistic Inference Martin Emms October 1, 2020 Probability Basics

Basics of Probability Basics of Probability Janyl Jumadinova February 2426, 2020 Janyl

Continuing Probability. Wrap up: Total Probability and Conditional Probability. Continuing

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

Chapter 2 Probability 1. Definition of Probability 2. Probability of disjoint events 3.

CS70: Jean Walrand: Lecture 21. Events, Conditional Probability 1. Probability Basics Review 2.

Counting and Probability Whats to come? Counting and Probability Whats to come?

Outline 1. Bayes Law L7: Probability Basics 2. Probability distributions CS 344R/393R:

Unit 2: Probability and distributions Lecture 1: Probability and conditional probability

Which probability Which probability Which probability Which probability theory for cosmology?

Recap of Basic Probability Elements of basic probability theory probability theory The

1 2 3 4 Stopping Probability Visiting Probability 5 Stopping

Lecture 15: More Probability. Summary. CS70: Onwards. Events, Conditional Probability,

Review of probability Nuno Vasconcelos UCSD Probability probability is the language to deal

Natural Language Processing Philipp Koehn 23 April 2020 Philipp Koehn Artificial Intelligence:

Imagery Imagery Perception-like experiences accompanying language comprehension or

Basic Ruby Syntax No variable declarations sum = 0 Newline is statement separator i = 1 while

20 AUGUST 2014 FOCUS &amp; COMPETITIVE COMPARING MALAYSIAS BIOECONOMY TO SELECTED ECONOMIES

CPSC 121: Models of Computation Module 8: Sequential Circuits Module 8: Sequential Circuits By

Restless earth Geology What do you already know about Glue in learning objectives for this

Translating iTeach Principles into Practice: Applying a 5- Step Process in Planning Learning

Map of Germany, 1914 HISTORY OF GRAPHIC DESIGN Electric street lights in a German city, c. 1884

Sambuz

Useful Links

Newsletter

Mail Us

20 AUGUST 2014 FOCUS & COMPETITIVE COMPARING MALAYSIAS BIOECONOMY TO SELECTED ECONOMIES