natural language processing csci 4152 6509 lecture 2
play

Natural Language Processing CSCI 4152/6509 Lecture 2 Introduction - PowerPoint PPT Presentation

Natural Language Processing CSCI 4152/6509 Lecture 2 Introduction to Natural Language Processing Instructor: Vlado Keselj Time and date: 09:3510:25, 9-Jan-2020 Location: Dunn 135 CSCI 4152/6509, Vlado Keselj Lecture 2 1 / 8 Previous


  1. Natural Language Processing CSCI 4152/6509 — Lecture 2 Introduction to Natural Language Processing Instructor: Vlado Keselj Time and date: 09:35–10:25, 9-Jan-2020 Location: Dunn 135 CSCI 4152/6509, Vlado Keselj Lecture 2 1 / 8

  2. Previous Lecture Course Introduction ◮ logistics and administrivia ◮ textbook and main references ◮ evaluation scheme ◮ academic integrity policy ◮ tentative course schedule Handouts: Syllabus, A0 CSCI 4152/6509, Vlado Keselj Lecture 2 2 / 8

  3. Introduction to Natural Language Processing Reading: Chapter 1 of Jurafsky and Martin [JM] How to define NLP? 1. Direct definition ◮ What is a natural language? ◮ What are other kinds of languages? 2. NLP applications 3. NLP as a research area CSCI 4152/6509, Vlado Keselj Lecture 2 3 / 8

  4. Some NLP Applications machine translation speech analysis and generation systems spell checking and grammatical correction conversational agents document generation (or computer support in document writing) text classification, summarization, mining information retrieval and information extraction question answering support applications, such as: stemming, POS tagging, semantic tagging, and partial parsing natural language programming code generators, query generators CSCI 4152/6509, Vlado Keselj Lecture 2 4 / 8

  5. NLP as a Research Area relatively old (as old as CS), but still very active can be seen as a part of AI related to several other areas, such as: ◮ Programming and Formal Languages ◮ Information Retrieval ◮ Machine Learning ◮ Text Mining Some important conferences and journals: ◮ ACL — Association of Computational Linguistics, NAACL, EACL, HLT, AAAI, . . . ◮ Computational linguistics, Natural Language Engineering, . . . Check “NLP Research Links” on the course web site Useful research site: http://aclweb.org/anthology-new/ CSCI 4152/6509, Vlado Keselj Lecture 2 5 / 8

  6. Short History of NLP before computers 1947–54 pioneers and foundational insights 1954–66 decade of optimism (“look ma no hands”), two camps: symbolic and stochastic 1966 ALPAC report in US (negative report on MT research) 1980 emergence of various systems and approaches: stochastic paradigm, logic-based, NLU, discourse modeling 1990–2000 stochastic NLP, Web, unification-based grammars 2000– “The rise of Machine Learning” CSCI 4152/6509, Vlado Keselj Lecture 2 6 / 8

  7. NLP Methodology Overview Knowledge-driven and symbolic approaches using 1 crafted rules ◮ older methodology, scalability issues, appropriate for more controlled language formats ◮ example applications: information extraction ◮ methodology: regular expressions, unification-based methods, etc. Data-driven and stochastic approaches using 2 machine learning ◮ newer, scalable, for open-ended applications ◮ example applications: classification, clustering ◮ methodology: probabilistic models, Bayesian classifiers, etc. CSCI 4152/6509, Vlado Keselj Lecture 2 7 / 8

  8. Levels of NLP (started) phonetics: physical sounds 1 phonology: sound system (phonemes) of a spoken 2 language for next class: morphology: word structure 3 syntax: inter-word structure up to sentence 4 structure semantics: meaning up to the sentence level 5 pragmatics: “speaker’s meaning” — extended 6 from the literal sentence meaning discourse: units larger than an utterance (e.g., 7 inter-sentence meaning, references) CSCI 4152/6509, Vlado Keselj Lecture 2 8 / 8

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend