Natural Language Generation Andrea Zugarini SAILab December 5th, - PowerPoint PPT Presentation

Natural Language Generation Andrea Zugarini SAILab December 5th, 2019 LabMeeting, December 5th - 2019 Natural Language Generation

Natural Language Generation Natural Language Generation is the problem of generating text automatically. Machine Translation , Text Summarization and Paraphrasing are all instances of NLG. Language generation is a very challenging problem, that does not only require text understanding , but it also involves typical human skills, such as creativity . Word representations and Recurrent Neural Networks (RNNs) are the basic tools for NLG models,usually called as end-to-end since they learn directly from data. LabMeeting, December 5th - 2019 Natural Language Generation

From Language Modelling to NLG Recap : Given a sequence of words y 1 , . . . , y n , a language model is characterized by a probability distribution: P ( y 1 , . . . , y m ) = P ( y m | y 1 , . . . , y m − 1 ) . . . P ( y 2 | y 1 ) P ( y 1 ) that can be equivalently expressed as: m � P ( y 1 , . . . , y m ) = P ( y i | y <i ) i =1 Language Modelling is strictly related to NLG. LabMeeting, December 5th - 2019 Natural Language Generation

From Language Modelling to NLG Many NLG problems are conditioned to some given context . In Machine Translation , the generated text strictly depends on the input text to translate. Hence, we can add to the equation another sequence x of size n to condition the probability distribution, m � P ( y 1 , . . . , y m ) = P ( y i | y <i , x 1 , . . . , x m ) i =1 obtaining a general formulation for any Language Generation problem. LabMeeting, December 5th - 2019 Natural Language Generation

Natural Language Generation A Machine-Learning model can then be used to learn P ( · ) . m � P ( y | x , θ ) = P ( y i | y <i , x 1 , . . . , x n , θ ) i =1 max P ( y | x , θ ) θ where P ( · ) is the model parametrized by θ that is trained to maximize the likelihood of y on a dataset of ( x , y ) sequence pairs. Note: when x ∈ ∅ we fall in Language Modelling . LabMeeting, December 5th - 2019 Natural Language Generation

Natural Language Generation Open-ended vs non open-ended generations Depending on how much x conditions P , we distinguish among two kinds of text generation: Open-ended Non open-ended ◮ Story Generation ◮ Machine Translation ◮ Text Continuation ◮ Text Summarization ◮ Text Paraphrasing ◮ Poem Generation ◮ Lyrics Generation ◮ Data-to-text generation There is no neat separation between those kind of problems. LabMeeting, December 5th - 2019 Natural Language Generation

Decoding Likelihood maximization Once these models are trained, how do we exploit in inference to generate new tokens? Straightforward approach: pick the sequence with maximum probability. m � y = arg max P ( y i | y <i , x 1 , . . . , x n , θ ) y 1 ,...,y n i =1 Finding the optimal y is not tractable. Two popular approximate methods are greedy and beam search, both successful in non open-ended domains. LabMeeting, December 5th - 2019 Natural Language Generation

Decoding Likelihood maximization Beam search is a search algorithm that explores k 2 nodes at each time step and keeps the best k paths. Greedy search is a special case of beam search, where the beam width k is set to 1 . LabMeeting, December 5th - 2019 Natural Language Generation

Decoding Likelihood maximization issues Unfortunately, likelihood maximization is only effective in non open-ended problems, where there is a strong correlation between input x and output y . In open-ended domains, instead, it ends up in repetitive , meaningless generations. To overcome such issue, sampling approaches better explore the entire learnt distribution P . LabMeeting, December 5th - 2019 Natural Language Generation

Decoding Sampling strategies The most common sampling strategy is: multinomial sampling . At each step i a token y i is sampled from P . y i ∼ P ( y i | y <i , x 1 , . . . , x n ) The higher is P ( y i | y <i , x 1 , . . . , x n ) the more y i is likely to be sampled. LabMeeting, December 5th - 2019 Natural Language Generation

Poem Generation Project reference sailab.diism.unisi.it/poem-gen/ LabMeeting, December 5th - 2019 Natural Language Generation

Poem Generation Poem Generation is an instance of Natural Language Generation (NLG). Goal : Design an end-to-end poet-based poem generator. Issue : Poet’s production is rarely enough to train a neural model. We will describe a general model to learn poet- based poem generators. We experimented it in the case of Italian poetry. LabMeeting, December 5th - 2019 Natural Language Generation

Poem Generation The sequence of text is processed by a recurrent neural network (LSTM), that has to predict the next word at each time step. Note : < EOV > , < EOT > are special tokens to indicate the end of a verse or a tercet. LabMeeting, December 5th - 2019 Natural Language Generation

Corpora We considered poetries from Dante and Petrarca. Divine Comedy ◮ 4811 tercets ◮ 108k words ◮ ABA rhyme scheme (enforced through rule-based post-processing) Canzoniere ◮ 7780 tercets ◮ 63k words Note : 100k words is 4 order of magnitude less data that traditional corpora!!! LabMeeting, December 5th - 2019 Natural Language Generation

Let’s look at the Demo: www.dantepetrarca.it LabMeeting, December 5th - 2019 Natural Language Generation

Natural Language Generation Andrea Zugarini SAILab December 5th, - PowerPoint PPT Presentation

Natural Language Generation Andrea Zugarini SAILab December 5th, 2019 LabMeeting, December 5th - 2019 Natural Language Generation Natural Language Generation Natural Language Generation is the problem of generating text automatically.

Natural Language Processing 1 Lecture 11: Language generation and summarisation Katia Shutova

Natural Language Processing 1 Lecture 10: Language generation and summarisation Katia Shutova

Outline of todays lecture Overview of Natural Language Generation Components of Natural

Let the AI do the Talk Adventures with Natural Language Generation @MarcoBonzanini PyParis 2018

Natural Language Understanding We want to communicate with computers using natural language

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Information Extraction Industrial Natural Language Processing Industrial Natural Language

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Paula

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Lecture

Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2019 Natural Language

Building Adaptable and Scalable Natural Language Generation Systems Yannis Konstas Natural

Natural Language Generation Survey in the State of the Art of Natural Topic Coverage

STS for NLG Christian Chiarcos chiarcos@uni-potsdam.de Natural Language Generation Natural

CS11-737: Multilingual Natural Language Processing Language contact Yulia Tsvetkov Language

Device independent quantum random number generation Yang Liu, Qi Zhao, Ming-Han Li, Jian-Yu Guan,

Finite-dimensional irreducible modules for an even subalgebra of U q ( sl 2 ) Alison Gordon Lynch

Large Scale Wind Power Investments Impact on Wholesale Electricity Market mer Karaduman MIT

Power Generators Presented by Samson Yu School of Electrical and Electronic Engineering (EECE),

FAR OUTSIDE THE NORM: The San Onofre Nuclear Plants Steam Generator Problems in the Context

[ A Reimagined Rainforest Alliance ] An agile project done in 6 Sprints Last Call Media 136

CommStat 9/27/18 Police Data for Drug-Related Incidents Valcour Agencies in Chittenden County

Architecture A discussion, not a deck IoT architectures Mesh-of-things--things talk to other