Deep Learning for Natural Language Processing Subword - PowerPoint PPT Presentation

Nov 14, 2022 •209 likes •346 views

Deep Learning for Natural Language Processing Subword Representations for Sequence Models Richard Johansson richard.johansson@gu.se how can we do part-of-speech tagging with texts like this? Twas brillig, and the slithy toves Did gyre and

Deep Learning for Natural Language Processing Subword Representations for Sequence Models Richard Johansson richard.johansson@gu.se
how can we do part-of-speech tagging with texts like this? ’Twas brillig, and the slithy toves Did gyre and gimble in the wabe; All mimsy were the borogoves, And the mome raths outgrabe. -20pt
how can we do part-of-speech tagging with texts like this? ’Twas brillig, and the slith y tov es Did gyre and gimble in the wabe; All mims y were the borogov es , And the mome rath s outgrabe. -20pt
can you find the named entities in this text? In 1932 , Torkelsson went to Stenköping . -20pt
can you find the named entities in this text? In 19 32 , Torkel sson went to Sten köping . Time Person Location -20pt
using characters to represent words: old-school approach (Huang et al., 2015) -20pt
using characters to represent words: modern approaches (Ma and Hovy, 2016) (Lample et al., 2016) -20pt
combining representations. . . ◮ we may use a combination of different word representations from Reimers and Gurevych (2017) -20pt
reducing overfitting and improving generalization ◮ character-based representations allow us to deal with words that we didn’t see in the training set ◮ we can use word dropout to force the model to rely on the character-based representation ◮ for each word in the text, we replace the word with a dummy “unknown” token with a dropout probability p -20pt
recap: BERT for different types of tasks -20pt
recap: sub-word representation in ELMo, BERT, and friends ◮ ELMo uses a CNN over character embeddings ◮ BERT uses word piece tokenization tokenizer.tokenize(’In 1932, Torkelsson went to Stenköping.’) [’in’, ’1932’, ’,’, ’tor’, ’##kel’, ’##sson’, ’went’, ’to’, ’ste’, ’##nko’, ’##ping’, ’.’] -20pt
reading ◮ Eisenstein, chapter 7: ◮ 7.1: sequence labeling as classification ◮ 7.6: neural sequence models ◮ Eisenstein, chapter 8: applications -20pt
references Z. Huang, W. Xu, and K. Yu. 2015. Bidirectional LSTM-CRF models for sequence tagging. arXiv:1508.01991. G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer. 2016. Neural architectures for named entity recognition. In NAACL . X. Ma and E. Hovy. 2016. End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. In ACL . N. Reimers and I. Gurevych. 2017. Optimal hyperparameters for deep LSTM-networks for sequence labeling tasks. arXiv:1707.06799. -20pt

Recommend

Improved subword modeling for WFST-based speech recognition Peter Smit, Sami Virpioja, Mikko

Improved subword modeling for WFST-based speech recognition Peter Smit, Sami Virpioja, Mikko Kurimo Aalto University, Department of Signal Processing and Acoustics August 23, 2017 Research questions Subword modeling WFST implementation

625 views • 33 slides

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture 5: Constraint-based grammars Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture 5: Constraint-based

730 views • 30 slides

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture 6: Compositional Semantics Simone Teufel (Materials by Ann

493 views • 22 slides

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture 10: Discourse Simone Teufel (Materials by Ann Copestake)

501 views • 36 slides

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Paula

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Paula Buttery (materials by Ann Copestake) Computer Laboratory

554 views • 37 slides

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture 7: Lexical Semantics Simone Teufel (Materials mostly by Ann

552 views • 31 slides

Information Extraction Industrial Natural Language Processing Industrial Natural Language

Industrial Natural Language Processing & Information Extraction Industrial Natural Language Processing Industrial Natural Language Processing Overview Natural Language Processing Developing and applying techniques NLP and methods for

479 views • 20 slides

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Deep learning for natural language processing A short primer on deep learning Benoit Favre < benoit.favre@univ-mrs.fr > Aix-Marseille Universit, LIF/CNRS 20 Feb 2017 Benoit Favre (AMU) DL4NLP: deep learning 20 Feb 2017 1 / 25 Deep

530 views • 25 slides

Deep learning for natural language processing Introduction to natural language processing

Deep learning for natural language processing Introduction to natural language processing Aix-Marseille Universit, LIF/CNRS 20 Feb 2017 Benoit Favre (AMU) DL4NLP: NLP intro 20 Feb 2017 1 / 24 Benoit Favre < benoit.favre@univ-mrs.fr >

920 views • 24 slides

Natural Language Processing 1 Lecture 11: Language generation and summarisation Katia Shutova

Natural Language Processing 1 Natural Language Processing 1 Lecture 11: Language generation and summarisation Katia Shutova ILLC University of Amsterdam 6 December 2018 Natural Language Processing 1 Language generation Language generation

828 views • 50 slides

Natural Language Processing 1 Lecture 10: Language generation and summarisation Katia Shutova

Natural Language Processing 1 Natural Language Processing 1 Lecture 10: Language generation and summarisation Katia Shutova ILLC University of Amsterdam 2 December 2019 1 / 51 Natural Language Processing 1 Language generation Language

830 views • 54 slides

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 12:

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 12: Information from parts of words: Subword Models Announcements Assignment 5 will be released today Another all-new assignment. You have 7

1k views • 56 slides

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 12:

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 12: Information from parts of words: Subword Models Announcements (Changes!!!) Assignment 5 written questions Will be updated tomorrow

687 views • 58 slides

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin Clark Deep Learning for NLP 5 years ago No Seq2Seq No Attention No large-scale QA/reading comprehension datasets No TensorFlow or

1.41k views • 92 slides

Natural Language Processing Fall 2018 Frank Ferraro Natural language processing ITE 358

CMSC 473/673 Natural Language Processing Fall 2018 Frank Ferraro Natural language processing ITE 358 ferraro@umbc.edu Semantics Monday: 2:15-3 Tuesday: 11:00-11:30 Vision & language processing by appointment Learning with low-to-no

1.46k views • 117 slides

Natural Language Processing 1 Lecture 8: Compositional semantics and discourse processing Katia

Natural Language Processing 1 Natural Language Processing 1 Lecture 8: Compositional semantics and discourse processing Katia Shutova ILLC University of Amsterdam 26 November 2018 1 / 45 Natural Language Processing 1 Compositional

1.06k views • 80 slides

An Overview of Decadal Climate Variability

An Overview of Decadal Climate Variability in the Historical Record Clara Deser Na0onal Center for Atmospheric Research (NCAR)

797 views • 43 slides

Growth of Sobolev norms for the cubic NLS Benoit Pausader N. Tzvetkov. Brown U. Spectral

NLS on R T 2 Growth of Sobolev norms for the cubic NLS Benoit Pausader N. Tzvetkov. Brown U. Spectral theory and mathematical physics, Cergy, June 2016 NLS on R T 2 Introduction We consider the cubic nonlinear Schr odinger

962 views • 56 slides

September 2019 Forests and New Yorks GHG Goals NYs Current GHG Inventory - Six Kyoto

September 2019 Forests and New Yorks GHG Goals NYs Current GHG Inventory - Six Kyoto gases - 100-Year GWP Emissions (mmt CO 2 e) - Life Cycle Emissions!!! 40% - Omits CO2 from biomass combustion (assumes sequestration)??? -

237 views • 12 slides

Ways to Evaluate Appropriate Limits Webinar How are brokers changing the way they evaluate

Ways to Evaluate Appropriate Limits Webinar How are brokers changing the way they evaluate & recommend appropriate limits? Which loss types would exceed your limits? What dollar amounts attributed to the known losses exceed

168 views • 12 slides

- Teaching Adults to Read: Comprehension Wyoming Adult Education 2019 Summer Institute

- Teaching Adults to Read: Comprehension Wyoming Adult Education 2019 Summer Institute Casper, WY August 7, 2019 Kathy St. John katlit2003@yahoo.com What is LINCS? How can LINCS help you? A Resource Collection containing high-quality,

856 views • 49 slides

MAMSIE Board Meeting I Leuven, 31 January 2017, 10h30-12h30 followed by a joint lunch, Van

MAMSIE Board Meeting I Leuven, 31 January 2017, 10h30-12h30 followed by a joint lunch, Van Hoof seminar room, Institute of Astronomy, KU Leuven In attendance: Profs J. Christensen-Dalsgaard, T. Rogers, S. Mathis, C. Aerts; MAMSIE team

705 views • 12 slides

Automatic Differentiation by Program Transformation Laurent Hasco et INRIA Sophia-Antipolis,

Automatic Differentiation by Program Transformation Laurent Hasco et INRIA Sophia-Antipolis, France http://www-sop.inria.fr/tropics Ecole d et e CEA-EDF-INRIA, Juin 2006 Laurent Hasco et (INRIA) Automatic Differentiation

698 views • 58 slides

WHAT IS LIVELABS? Government funded test- bed in urban locations Companies can run large scale

25/7/2014 The Challenge of Continuous Mobile Context Sensing Talk at COMSNETS 2014 Bengalaru, Jan 9 th 2014 WHAT IS LIVELABS? Government funded test- bed in urban locations Companies can run large scale experiments on REAL people in REAL

310 views • 8 slides

Deep Learning for Natural Language Processing Subword - PowerPoint PPT Presentation

Deep Learning for Natural Language Processing Subword Representations for Sequence Models Richard Johansson richard.johansson@gu.se how can we do part-of-speech tagging with texts like this? Twas brillig, and the slithy toves Did gyre and

Improved subword modeling for WFST-based speech recognition Peter Smit, Sami Virpioja, Mikko

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Paula

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Information Extraction Industrial Natural Language Processing Industrial Natural Language

Deep learning for natural language processing A short primer on deep learning Benoit Favre &lt;

Deep learning for natural language processing Introduction to natural language processing

Natural Language Processing 1 Lecture 11: Language generation and summarisation Katia Shutova

Natural Language Processing 1 Lecture 10: Language generation and summarisation Katia Shutova

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 12:

Natural Language Processing with Deep Learning CS224N/Ling284 Christopher Manning Lecture 12:

Natural Language Processing with Deep Learning CS224N The Future of Deep Learning + NLP Kevin

Natural Language Processing Fall 2018 Frank Ferraro Natural language processing ITE 358

Natural Language Processing 1 Lecture 8: Compositional semantics and discourse processing Katia

An Overview of Decadal Climate Variability

Growth of Sobolev norms for the cubic NLS Benoit Pausader N. Tzvetkov. Brown U. Spectral

September 2019 Forests and New Yorks GHG Goals NYs Current GHG Inventory - Six Kyoto

Ways to Evaluate Appropriate Limits Webinar How are brokers changing the way they evaluate

- Teaching Adults to Read: Comprehension Wyoming Adult Education 2019 Summer Institute

MAMSIE Board Meeting I Leuven, 31 January 2017, 10h30-12h30 followed by a joint lunch, Van

Automatic Differentiation by Program Transformation Laurent Hasco et INRIA Sophia-Antipolis,

WHAT IS LIVELABS? Government funded test- bed in urban locations Companies can run large scale

Deep learning for natural language processing A short primer on deep learning Benoit Favre <