Shallow Text Generation Stephan Busemann DFKI GmbH - PDF document

Shallow Text Generation Stephan Busemann DFKI GmbH Stuhlsatzenhausweg 3 D-66123 Saarbrücken Stephan.Busemann@dfki.de http://www.dfki.de/~busemann Application Systems for NLG Must be Developed Quickly and in a User-Oriented Way • Requirements placed by the application – on the user: recognize and articulate needs – on the developer: make herself acquainted with the domain – on both: create and adapt a corpus of sample target texts • Requirements wrt the software – Adaptability to new tasks and domains – Scalability (low costs of the next rule) – Modularisation (interpreter, daten, knowledge, interfaces) High efficiency of development is difficult to achieve with traditional approaches to language generation Language Technology I, WS 2011/2012, 2 Source: Stephan Busemann 1

Non-Trivial Generation Systems are Expensive to Adapt to New Domains and Tasks • Examples – KPML (Bateman 1997), systemic grammars, development environment – FUF/Surge (Elhadad/Robin 1992), functional unification grammar, interpreter • Features – large multi-lingual systems – detailed, monolingual semantic representations as input – broad coverage of linguistic phenomena (goal: the more, the better) • Effort for adaptation – Rich interface to the input language of the system (logical form, SPL) – Generation of sentences reflecting the distinctions covered The excellent scope of services of generic resources can often not be utilised in practice Source: Stephan Busemann Language Technology I, WS 2011/2012, 3 Deep* vs Shallow NLG * This differs from the Chomskyan distinction between deep and surface structrure, which is sometimes used to characterize deep and surface generation • Deep generation – knowledge-based (models of the domain, of the author and the addressees, of the language(s) involved) – theoretically motivated, aiming at generic, re-usable technology – unresolved issue of general system architecture • Shallow generation – opportunistic modelling of relevant aspects of the application – diverse depth of modelling, as required by the application – some methods viewed as „short cuts“ for unsolved questions of deep generation Shallow generation can be defined in analogy to shallow analysis Language Technology I, WS 2011/2012, 4 Source: Stephan Busemann 2

There is a Smooth Transition Between Shallow and Deep Methods • Prefabricated texts shallow • „Fill in the slots“ • with flexible templates • with aggregation • with sentence planning • with document planning deep Source: Stephan Busemann Language Technology I, WS 2011/2012, 5 Shallow Architectures Have a Simple Task Structure “Deep” model with interaction „Shallow“ Model (cf. Reiter/Dale 2000) (Busemann/Horacek 1998) Content Determination Content Determination Discourse Planning Sentence Aggregation Text Organisation (Aggregation) Lexicalisation Generation of Referring Expressions Mapping Onto Linguistic Structures Surface Realisation Language Technology I, WS 2011/2012, 6 Source: Stephan Busemann 3

Overview • Motivation • The TG/2 Shallow NLG framework • Some major applications for shallow NLG • Assessment and conclusions Source: Stephan Busemann Language Technology I, WS 2011/2012, 7 Input for Air Quality Report Generation [(COOP threshold-passing) (TIME [(PRED season) (NAME [(SEASON summer) (YEAR 1999)])]) (POLLUTANT o3) (SITE "Völklingen-City") (DURATION [(MINUTE 60)]) (SOURCE [(LAW-NAME bimsch) (THRESHOLD-TYPE info-value)]) (EXCEEDS [(STATUS yes) (TIMES 1)])] In summer 1999 at the measuring station of Völklingen-City, the information value for ozone – 180 µg/m³ according to the German decree Bundesimmissions- schutzverordnung – was exceeded once during a period of 60 minutes. Language Technology I, WS 2011/2012, 8 Source: Stephan Busemann 4

Input for Air Quality Report Generation [(COOP threshold-passing) (TIME [(PRED season) (NAME [(SEASON summer) (YEAR 1999)])]) (POLLUTANT o3) (SITE "Völklingen-City") (DURATION [(MINUTE 60)]) (SOURCE [(LAW-NAME bimsch) (THRESHOLD-TYPE info-value)]) (EXCEEDS [(STATUS yes) (TIMES 1)])] Im Sommer 1999 wurde der Informationswert für Ozon an der Messstation Völklingen-City während einer 60-minütigen Einwirkungsdauer (180 µg/m³ nach Bundesimmissionsschutzverordnung) einmal überschritten. Source: Stephan Busemann Language Technology I, WS 2011/2012, 9 Input for Air Quality Report Generation [(COOP threshold-passing) (TIME [(PRED season) (NAME [(SEASON summer) (YEAR 1999)])]) (POLLUTANT o3) (SITE "Völklingen-City") (DURATION [(MINUTE 60)]) (SOURCE [(LAW-NAME bimsch) (THRESHOLD-TYPE info-value)]) (EXCEEDS [(STATUS yes) (TIMES 1)])] En été 1999, à la station de mesure de Völklingen-City, la valeur d'information pour l'ozone pour une exposition de 60 minutes (180 µg/m³ selon le decret allemand (Bundesimmissionsschutzverordnung)) a été dépassée une fois. Language Technology I, WS 2011/2012, 10 Source: Stephan Busemann 5

TG/2 Offers a Flexible Framework for NLG • TG/2 is a transparent production system • TG/2 interprets a separately defined set of condition-action rules • TG/2 maps pieces of input onto surface strings TG/2 keeps grammars largely independent from input representations DECL -> PPTIME THTYPE EXCEEDS (COOP threshold-passing) Test Predicates on properties of the input Input Grammar Rules Access Pointers yielding a part of the Input Source: Stephan Busemann Language Technology I, WS 2011/2012, 11 TG/2 Grammars Integrate Canned Texts, Templates and Context-free Rules My category is DECL. (Busemann 1996) IF the slot COOP is 'threshold-passing En été 1999 AND the slot LAW-NAME is specified la valeur limite autorisée THEN apply PPtime from slot TIME ( apply THTYPE from CURRENT-INPUT utter "(" selon le decret ... apply LAW from slot LAW-NAME ) utter ") " a été dépassée une fois apply EXCEEDS from slot EXCEEDS . utter "." WHERE THTYPE AND EXCEEDS agree in GENDER My category is THTYPE. IF there is no slot THRESHOLD-TYPE specified THEN utter "la valeur limite autoris&e2e " WHERE THTYPE has value 'fem for GENDER Language Technology I, WS 2011/2012, 12 Source: Stephan Busemann 6

TG/2 Grammars Integrate Canned Texts, Templates and Context-free Rules My category is DECL. (Busemann 1996) IF the slot COOP is 'threshold-passing En été 1999 AND the slot LAW-NAME is specified la valeur limite autorisée THEN apply PPtime from slot TIME apply THTYPE from CURRENT-INPUT ( utter "(" selon le decret ... apply LAW from slot LAW-NAME ) utter ") " a été dépassée une fois apply EXCEEDS from slot EXCEEDS . utter "." WHERE THTYPE AND EXCEEDS agree in GENDER My category is THTYPE. IF there is no slot THRESHOLD-TYPE specified THEN utter "la valeur limite autoris&e2e " WHERE THTYPE has value 'fem for GENDER Source: Stephan Busemann Language Technology I, WS 2011/2012, 13 TG/2 Grammars Integrate Canned Texts, Templates and Context-free Rules My category is DECL. (Busemann 1996) IF the slot COOP is 'threshold-passing En été 1999 AND the slot LAW-NAME is specified la valeur limite autorisée THEN apply PPtime from slot TIME ( apply THTYPE from CURRENT-INPUT utter "(" selon le decret ... apply LAW from slot LAW-NAME ) utter ") " a été dépassée une fois apply EXCEEDS from slot EXCEEDS . utter "." WHERE THTYPE AND EXCEEDS agree in GENDER My category is THTYPE. IF there is no slot THRESHOLD-TYPE specified THEN utter "la valeur limite autoris&e2e " WHERE THTYPE has value 'fem for GENDER Language Technology I, WS 2011/2012, 14 Source: Stephan Busemann 7

TG/2 Grammars Integrate Canned Texts, Templates and Context-free Rules My category is DECL. (Busemann 1996) IF the slot COOP is 'threshold-passing En été 1999 AND the slot LAW-NAME is specified la valeur limite autorisée THEN apply PPtime from slot TIME apply THTYPE from CURRENT-INPUT ( utter "(" selon le decret ... apply LAW from slot LAW-NAME ) utter ") " a été dépassée une fois apply EXCEEDS from slot EXCEEDS . utter "." WHERE THTYPE AND EXCEEDS agree in GENDER My category is THTYPE. IF there is no slot THRESHOLD-TYPE specified THEN utter "la valeur limite autoris&e2e " WHERE THTYPE has value 'fem for GENDER Source: Stephan Busemann Language Technology I, WS 2011/2012, 15 Constraints are Percolated Across the Derivation Tree • Feature unification (  ) at tree nodes • Every tree of depth 1 is licensed by a grammar rule • A feature can be assigned a value ( := ) • Two features can be constrained to have identical values ( = ) (X1.GENDER = X2.GENDER)     X1 X2     (X0.GENDER (X0.GENDER X0 X0 = X2.Gender) := fem) X2 X1 inflect(dépassé) “la valeur limite “ Language Technology I, WS 2011/2012, 16 Source: Stephan Busemann 8

Shallow Text Generation Stephan Busemann DFKI GmbH - PDF document

Shallow Text Generation Stephan Busemann DFKI GmbH Stuhlsatzenhausweg 3 D-66123 Saarbrcken Stephan.Busemann@dfki.de http://www.dfki.de/~busemann Application Systems for NLG Must be Developed Quickly and in a User-Oriented Way

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

The Problem of Shallow Worship The Book of Malachi Here is some test text Here is some test text

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

GEOTHERMAL SYSTEMS AND TECHNOLOGIES 5. SHALLOW GEOTHERMAL SYSTEMS 5. SHALLOW GEOTHERMAL SYSTEMS

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

GANocracy Outline Background: Text Generation Latent-Variable Generation Learning

God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here

5. Text CHAPTER HIGHLIGHTS Text tradition. Codes for computer text. C d f t t t

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap

Business Proposal Infographic Style Your Text Here Your Text Here Your Text Here Your Text

How to Stay Faithful in Exile Daniel 1 Here is some test text Here is some test text Here is

adaptability of RelaxAJ and expressiveness of StrongAspectJ Tomoyuki Aotani Manabu Touyama and

Implementing & Operating a Coordinated Entry System Project May 6 th , 2020 Abby Miller, HUD

Self-healing systems What are they? Tiina Niklander Seminar introduction, 2007 Earlier

Using Bittorrent and SVC for Efficient Video Sharing and Streaming Abdelhalim Amer 1 Toufik Ahmed

Lenia: Biology of Artificial Life Bert CHAN Wang-Chak Independent Researcher, Hong Kong 4:30pm,

Autonomic Systems Autonomic Systems Autonomic : adaptive : adaptive Autonomic Self

Evolutionary Architectures @neal4d nealford.com with Rebecca Parsons & Pat Kua 1 Rebecca

Definition of Immunity Protection against disease, usually Introduction to Immunology

Sambuz

Useful Links

Newsletter

Mail Us

Shallow Text Generation Stephan Busemann DFKI GmbH - PDF document

Shallow Text Generation Stephan Busemann DFKI GmbH Stuhlsatzenhausweg 3 D-66123 Saarbrcken Stephan.Busemann@dfki.de http://www.dfki.de/~busemann Application Systems for NLG Must be Developed Quickly and in a User-Oriented Way

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

The Problem of Shallow Worship The Book of Malachi Here is some test text Here is some test text

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

GEOTHERMAL SYSTEMS AND TECHNOLOGIES 5. SHALLOW GEOTHERMAL SYSTEMS 5. SHALLOW GEOTHERMAL SYSTEMS

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN50 Contractual Compliance Text Text GNSO Council Meeting Wednesday, Jun 25

GANocracy Outline Background: Text Generation Latent-Variable Generation Learning

God Rescues Daniel from the Lions Daniel 6 Here is some test text Here is some test text Here

5. Text CHAPTER HIGHLIGHTS Text tradition. Codes for computer text. C d f t t t

Stack Stack Heap Heap Data Data Text Text Program A Program B Stack Stack Text Heap

Business Proposal Infographic Style Your Text Here Your Text Here Your Text Here Your Text

How to Stay Faithful in Exile Daniel 1 Here is some test text Here is some test text Here is

adaptability of RelaxAJ and expressiveness of StrongAspectJ Tomoyuki Aotani Manabu Touyama and

Implementing &amp; Operating a Coordinated Entry System Project May 6 th , 2020 Abby Miller, HUD

Self-healing systems What are they? Tiina Niklander Seminar introduction, 2007 Earlier

Using Bittorrent and SVC for Efficient Video Sharing and Streaming Abdelhalim Amer 1 Toufik Ahmed

Lenia: Biology of Artificial Life Bert CHAN Wang-Chak Independent Researcher, Hong Kong 4:30pm,

Autonomic Systems Autonomic Systems Autonomic : adaptive : adaptive Autonomic Self

Evolutionary Architectures @neal4d nealford.com with Rebecca Parsons &amp; Pat Kua 1 Rebecca

Definition of Immunity Protection against disease, usually Introduction to Immunology

Sambuz

Useful Links

Newsletter

Mail Us

Implementing & Operating a Coordinated Entry System Project May 6 th , 2020 Abby Miller, HUD

Evolutionary Architectures @neal4d nealford.com with Rebecca Parsons & Pat Kua 1 Rebecca