introduction to nlp
play

Introduction to NLP Diyi Yang Some slides borrowed from Yulia - PowerPoint PPT Presentation

CS 4650/7650: Natural Language Processing Introduction to NLP Diyi Yang Some slides borrowed from Yulia Tsvetkov at CMU and Noah Smith at UW 1 Welcome! Website: https://www.cc.gatech.edu/classes/AY2021/cs7650_fall Piazza:


  1. CS 4650/7650: Natural Language Processing Introduction to NLP Diyi Yang Some slides borrowed from Yulia Tsvetkov at CMU and Noah Smith at UW 1

  2. Welcome! Website: https://www.cc.gatech.edu/classes/AY2021/cs7650_fall Piazza: piazza.com/gatech/fall2020/cs7650cs4650 Staff Email List: cs4650-7650-f20-staff@googlegroups.com 2

  3. Welcome! 3

  4. TA Office Hours 4

  5. Hybrid Mode ¡ Lectures Online ¡ Course Materials Online ¡ TA Office Hours Online ¡ Q&A with instructor in person (optional, TBD) 5

  6. Grading ¡ Homework Assignments (55%) ¡ Take-home Midterm (15%) ¡ Project Survey (20%) ¡ Quiz (10%) 6

  7. Late Polices ¡ Late Policy ¡ 5 late days to use over the duration of the semester for homework assignments only. There are no restrictions on how the late days can be used (e.g., all 5 can be used on one homework). Using late days will not affect your grade. But homework submitted late after all 5 late days have been used will receive no credit. ¡ No make-up exam ¡ Unless under emergency situation 7

  8. Survey Paper (Project) ¡ Survey on a NLP topic ¡ 2-3 students per team ¡ 2-page survey proposal (2%) ¡ 8-page final survey report (12%) ¡ Incorporating feedback (6%) 8

  9. Other Information ¡ Course Contacts: ¡ Webpage: materials and announcements ¡ Piazza: discussion forum ¡ Homework questions: Piazza, TAs’ office hours ¡ Computing Resources: ¡ Experiments can take up to hours, even with efficient computation ¡ Recommendation: start assignments early 9

  10. Introduction to NLP 10

  11. Communication With Machines ~50-70s ~80s today 11

  12. 12

  13. Conversational Agents Conversational agents contain: ● Speech recognition ● Language analysis ● Dialogue processing ● Information retrieval ● Text to speech 13

  14. 14

  15. 15

  16. Question Answering ¡ What does “divergent” mean? ¡ What year was Abraham Lincoln born? ¡ How many states were in the United States that year? ¡ How much Chinese silk was exported to England in the end of the 18th century? ¡ What do scientists think about the ethics of human cloning? 16

  17. Machine Translation 17

  18. Natural Language Processing Core Technologies Applications ¡ Language modeling ¡ Machine Translation ¡ Part-of-speech tagging ¡ Information Retrieval ¡ Syntactic parsing ¡ Question Answering ¡ Named-entity recognition ¡ Dialogue Systems ¡ Word sense disambiguation ¡ Information Extraction ¡ Semantic role labeling ¡ Summarization ¡ ... ¡ Sentiment Analysis ¡ ... NLP lies at the intersection of computational linguistics and machine learning. 18

  19. Level Of Linguistic Knowledge 19

  20. Phonetics, Phonology ¡ Pronunciation Modeling 20

  21. Words ¡ Language Modeling ¡ Tokenization ¡ Spelling correction 21

  22. Morphology ¡ Morphology analysis ¡ Tokenization ¡ Lemmatization 22

  23. Part of Speech ¡ Part of speech tagging 23

  24. Syntax ¡ Syntactic parsing 24

  25. Semantics ¡ Named entity recognition ¡ Word sense disambiguation ¡ Semantic role labeling 25

  26. Discourse 26

  27. “ Why Do We Care About This ” 27

  28. Where Are We Now? 28

  29. Where Are We Now? VS 29

  30. Where Are We Now? 30

  31. 31

  32. Why NLP is Hard? Ambiguity 1. Scale 2. Sparsity 3. 4. Variation Expressivity 5. 6. Unmodeled Variables Unknown representations 7. 32

  33. Why NLP is Hard? Ambiguity 1. Scale 2. Sparsity 3. 4. Variation Expressivity 5. 6. Unmodeled Variables Unknown representations 7. 33

  34. Ambiguity ¡ Ambiguity at multiple levels ¡ Word senses: bank (finance or river ?) ¡ Part of speech: chair (noun or verb ?) ¡ Syntactic structure: I can see a man with a telescope ¡ Multiple: I made her duck 34

  35. 35

  36. Ambiguity and Scale 36

  37. The Challenges of “Words” ¡ Segmenting text into words ¡ Morphological variation ¡ Words with multiple meanings: bank, mean ¡ Domain-specific meanings: latex ¡ Multiword expressions: make a decision, take out, make up 37

  38. Part of Speech Tagging 38

  39. Part of Speech Tagging 39

  40. Part of Speech Tagging 40

  41. Syntax 41

  42. Morphology + Syntax A ship-shipping ship, shipping shipping-ships 42

  43. Semantics ¡ Every fifteen minutes a woman in this country gives birth.. Our job is to find this woman, and stop her! – Groucho Marx 43

  44. Semantics ¡ Every fifteen minutes a woman in this country gives birth. Our job is to find this woman, and stop her! – Groucho Marx 44

  45. Syntax + Semantics ¡ We saw the woman with the telescope wrapped in paper. ¡ Who has the telescope? ¡ Who or what is wrapped in paper? ¡ An even of perception, or an assault? 45

  46. Syntax + Semantics ¡ We saw the woman with the telescope wrapped in paper. ¡ Who has the telescope? ¡ Who or what is wrapped in paper? ¡ An even of perception, or an assault? 46

  47. Corpora ¡ A corpus is a collection of text ¡ Often annotated in some way ¡ Sometimes just lots of text ¡ Examples ¡ Penn Treebank: 1M words of parsed WSJ ¡ Canadian Hansards: 10M+ words of French/English sentences ¡ Yelp reviews ¡ The Web! Rosetta Stone 48

  48. Statistical NLP ¡ Like most other parts of AI, NLP is dominated by statistical methods ¡ Typically more robust than rule-based methods ¡ Relevant statistics/probabilities are learned from data ¡ Normally requires lots of data about any particular phenomenon 49

  49. Why NLP is Hard? Ambiguity 1. Scale 2. Sparsity 3. 4. Variation Expressivity 5. 6. Unmodeled Variables Unknown representations 7. 50

  50. Sparsity ¡ Sparse data due to Zipf’s Law ¡ Example: the frequency of different words in a large text corpus 51

  51. Sparsity ¡ Order words by frequency. What is the frequency of nth ranked word? 52

  52. Sparsity ¡ Regardless of how large our corpus is, there will be a lot of infrequent words ¡ This means we need to find clever ways to estimate probabilities for things we have rarely or never seen 53

  53. Why NLP is Hard? Ambiguity 1. Scale 2. Sparsity 3. 4. Variation Expressivity 5. 6. Unmodeled Variables Unknown representations 7. 54

  54. Variation ¡ Suppose we train a part of speech tagger or a parser on the Wall Street Journal ¡ What will happen if we try to use this tagger/parser for social media ? ¡ “ikr smh he asked fir yo last name so he can add u on fb lololol” 55

  55. Variation 56

  56. Why NLP is Hard? Ambiguity 1. Scale 2. Sparsity 3. 4. Variation Expressivity 5. 6. Unmodeled Variables Unknown representations 7. 57

  57. Expressivity ¡ Not only can one form have different meanings (ambiguity) but the same meaning can be expressed with different forms: ¡ She gave the book to Tom vs. She gave Tom the book ¡ Some kids popped by vs. A few children visited ¡ Is that window still open? vs. Please close the window 58

  58. Unmodeled Variables World knowledge I dropped the glass on the floor and it broke I dropped the hammer on the glass and it broke 59

  59. Unmodeled Representation Very difficult to capture what is ! , since we don’t even know how to represent the knowledge a human has/needs: ¡ What is the “meaning” of a word or sentence? ¡ How to model context? ¡ Other general knowledge? 60

  60. Desiderate for NLP Models ¡ Sensitivity to a wide range of phenomena and constraints in human language ¡ Generality across languages, modalities, genres, styles ¡ Strong formal guarantees (e.g., convergence, statistical efficiency, consistency) ¡ High accuracy when judged against expert annotations or test data ¡ Ethical 61

  61. Symbolic and Probabilistic NLP 62

  62. Probabilistic and Connectionist NLP 63

  63. NLP vs. Machine Learning ¡ To be successful, a machine learner needs bias/assumptions; for NLP, that might be linguistic theory/representations. ¡ ! is not directly observable. ¡ Symbolic, probabilistic, and connectionist ML have all seen NLP as a source of inspiring applications. 64

  64. NLP vs. Linguistics ¡ NLP must contend with NL data as found in the world ¡ NLP ≈ computational linguistics ¡ Linguistics has begun to use tools originating in NLP! 65

  65. Fields with Connections to NLP ¡ Machine learning ¡ Deep Learning ¡ Linguistics (including psycho-, socio-, descriptive, and theoretical) ¡ Cognitive science ¡ Information theory ¡ Data science ¡ Political science ¡ Psychology ¡ Economics ¡ Education 66

  66. Today’s Applications ¡ Conversational agents ¡ Information extraction and question answering ¡ Machine translation ¡ Opinion and sentiment analysis ¡ Social media analysis ¡ Visual understanding ¡ Essay evaluation ¡ Mining legal, medical, or scholarly literature 67

  67. Factors Changing NLP Landscape Increases in computing power 1. 2. The rise of the web, then the social web 3. Advances in machine learning 4. Advances in understanding of language in social context 68

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend