Aligning Discourse and Argumentation Structures using Subtrees and - - PowerPoint PPT Presentation

aligning discourse and argumentation structures using
SMART_READER_LITE
LIVE PREVIEW

Aligning Discourse and Argumentation Structures using Subtrees and - - PowerPoint PPT Presentation

. Methodology . . . . . . . . . Introduction Corpus Results . Conclusion Acknowledgments and references Aligning Discourse and Argumentation Structures using Subtrees and Redescription Mining Laurine Huber 1 , Yannick Toussaint 1 ,


slide-1
SLIDE 1

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Aligning Discourse and Argumentation Structures using Subtrees and Redescription Mining

Laurine Huber1, Yannick Toussaint1, Charlotte Roze1, Mathilde Dargnat2 and Chloé Braud1

1LORIA, Université de Lorraine, France 2ATILF, Université de Lorraine, France

ArgMining August 1, 2019

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 1 / 35

slide-2
SLIDE 2

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Discourse structure

▶ Semantic and pragmatic relations between text segments (reason, cause, concession ...) ▶ Rhetorical Structure Theory [Mann and Thompson, 1988] (RST) ▶ Distinction between nucleus and satellite

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 2 / 35

slide-3
SLIDE 3

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Argumentation Structure

▶ Argumentation relations between text segments (support, attack, ...) ▶ Macro-structure of argumentation [Freeman, 2011] ▶ Distinction between premisse and conclusion

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 3 / 35

slide-4
SLIDE 4

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

So what?

Goal: Understand the similarities between discourse and argumentation structures. ▶ Building bridges between theories ▶ Improve Argument Mining systems

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 4 / 35

slide-5
SLIDE 5

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Corpus

▶ ArgMicroTexts corpus [Peldszus and Stede, 2015] * ▶ 112 short argumentative texts ▶ 18 controversial questions ”Should Germany introduce the death penalty?”

1: The death penalty is a legal means that as such is not practicable in Germany. 2: For one thing, inviolable human dignity is anchored in our constitution, 3: and furthermore no one may have the right to adjudicate upon the death of another human being. 4: Even if many people think that a murderer has already decided on the life or death

  • f another person,

5: this is precisely the crime that we should not repay with the same. * available online

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 5 / 35

slide-6
SLIDE 6

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Corpus

▶ ArgMicroTexts corpus [Peldszus and Stede, 2015] * ▶ 112 short argumentative texts ▶ 18 controversial questions ”Should Germany introduce the death penalty?”

1: The death penalty is a legal means that as such is not practicable in Germany. 2: For one thing, inviolable human dignity is anchored in our constitution, 3: and furthermore no one may have the right to adjudicate upon the death of another human being. 4: Even if many people think that a murderer has already decided on the life or death

  • f another person,

5: this is precisely the crime that we should not repay with the same. * available online

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 5 / 35

slide-7
SLIDE 7

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Corpus

▶ ArgMicroTexts corpus [Peldszus and Stede, 2015] * ▶ 112 short argumentative texts ▶ 18 controversial questions ”Should Germany introduce the death penalty?”

1: The death penalty is a legal means that as such is not practicable in Germany. 2: For one thing, inviolable human dignity is anchored in our constitution, 3: and furthermore no one may have the right to adjudicate upon the death of another human being. 4: Even if many people think that a murderer has already decided on the life or death

  • f another person,

5: this is precisely the crime that we should not repay with the same. * available online

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 5 / 35

slide-8
SLIDE 8

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Corpus

▶ ArgMicroTexts corpus [Peldszus and Stede, 2015] * ▶ 112 short argumentative texts ▶ 18 controversial questions ”Should Germany introduce the death penalty?”

1: The death penalty is a legal means that as such is not practicable in Germany. 2: For one thing, inviolable human dignity is anchored in our constitution, 3: and furthermore no one may have the right to adjudicate upon the death of another human being. 4: Even if many people think that a murderer has already decided on the life or death

  • f another person,

5: this is precisely the crime that we should not repay with the same. * available online

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 5 / 35

slide-9
SLIDE 9

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Corpus

▶ Macro-structure of argumentation [Peldszus and Stede, 2016] ▶ RST

▶ (SDRT [Lascarides and Asher, 2007])

[e1] The death penalty is a legal means that as such is not practicable in Germany . [e2] For one thing, inviolable human dignity is anchored in our constitution,

1

[e3] and furthermore no one may have the right to adjudicate upon the death of another human being.

2

[e4] Even if many people think that a murderer has already decided on the life

  • r death of another

person,

3

[e5] this is precisely the crime that we should not repay with the same.

4 5

c6 c7 c9 c8

(a) ARG annotation (b) RST annotation

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 6 / 35

slide-10
SLIDE 10

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Overview of the approach

Goal: can we align ARG and RST at the subtree level ?

  • 1. Representing ARG and RST structures as trees
  • 2. Building two descriptions of each text

▶ ARG and RST descriptions ▶ A description is a set of subtrees

  • 3. Aligning set of subtrees that describe almost the same set of

texts

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 7 / 35

slide-11
SLIDE 11

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Representing ARG and RST structures as trees

Goal: Unify and anonymise the structures. ▶ Transform ARG and RST structures into labeled trees ▶ Keep only structure, no text

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 8 / 35

slide-12
SLIDE 12

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Representing ARG and RST structures as trees

Goal: Unify and anonymise the structures. ▶ Transform ARG and RST structures into labeled trees ▶ Keep only structure, no text

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 8 / 35

slide-13
SLIDE 13

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Representing ARG and RST structures as trees : ARG

[e1] The death penalty is a legal means that as such is not practicable in Germany . [e2] For one thing, inviolable human dignity is anchored in our constitution,

1

[e3] and furthermore no one may have the right to adjudicate upon the death of another human being.

2

[e4] Even if many people think that a murderer has already decided on the life

  • r death of another

person,

3

[e5] this is precisely the crime that we should not repay with the same.

4 5

c6 c7 c9 c8

ARG annotation

CC _ _ und _ _ sup sup reb

ARG tree derivation ▶ Root: central claim ▶ Parent: conclusion ▶ Child: premisse

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 9 / 35

slide-14
SLIDE 14

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Representing ARG and RST structures as trees : RST

RST annotation

CC _ _ conces _ _ conj reason reason

RST tree derivation ▶ Root: most central nucleus ▶ Parent: nucleus ▶ Child: satellite

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 10 / 35

slide-15
SLIDE 15

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Building two descriptions of the corpus

Goal: Produce 2 descriptions of each texts in term of subtrees

  • 1. Extract all subtrees of ARG
  • 2. Extract all subtrees of RST

Frequent subgraph mining: gSpan [Yan and Han, 2002]

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 11 / 35

slide-16
SLIDE 16

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Building two descriptions of the corpus

Goal: Produce 2 descriptions of each texts in term of subtrees

  • 1. Extract all subtrees of ARG
  • 2. Extract all subtrees of RST

Frequent subgraph mining: gSpan [Yan and Han, 2002]

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 11 / 35

slide-17
SLIDE 17

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Building two descriptions of the corpus: subtrees extraction

CC _ _ und _ _ sup sup reb

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 12 / 35

slide-18
SLIDE 18

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Building two descriptions of the corpus: subtrees extraction

CC _ _ und _ _ sup sup reb

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 13 / 35

slide-19
SLIDE 19

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Building two descriptions of the corpus: subtrees extraction

▶ f is the frequency of occurrence of subtrees in the corpus

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 14 / 35

slide-20
SLIDE 20

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Building two descriptions of the corpus: subtrees extraction

▶ keep subtrees with f ≥ 2

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 15 / 35

slide-21
SLIDE 21

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

Goal: Find an ARG description and a RST description that characterize almost the same set of objects ▶ Two difgerent descriptions of the each text

▶ ARG = {a0, a1, ..., a98} ▶ RST = {r0, r1, ..., r311}

▶ A set of objects: a set of texts from the corpus ▶ A text ti is described by

▶ a subset of ARG ▶ a subset of RST

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 16 / 35

slide-22
SLIDE 22

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

Goal: Find an ARG description and a RST description that characterize almost the same set of objects ▶ Two difgerent descriptions of the each text

▶ ARG = {a0, a1, ..., a98} ▶ RST = {r0, r1, ..., r311}

▶ A set of objects: a set of texts from the corpus ▶ A text ti is described by

▶ a subset of ARG ▶ a subset of RST

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 16 / 35

slide-23
SLIDE 23

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

Rd1 : a57 ↔ ∅

CC _ _ sup sup

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 17 / 35

slide-24
SLIDE 24

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

Rd1 : a57 ← → r123

CC _ _ sup sup CC _ _ list reason

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 18 / 35

slide-25
SLIDE 25

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

Rd1 : a57 ← → r123 ∨ r65

CC _ _ sup sup CC _ _ list reason CC _ _ reason reason

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 19 / 35

slide-26
SLIDE 26

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

Rd1 : a57 ← → r123 ∨ r65 ∨ r40

CC _ _ sup sup CC _ _ list reason CC _ _ reason reason CC _ motivation

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 20 / 35

slide-27
SLIDE 27

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Redescription mining

▶ A redescription is pair of queries

▶ qArg a logical formulae over the Arg subtrees ▶ qRst a logical formulae over the Rst subtrees

▶ qArg and qRst should describe almost the same set of texts ▶ ”Almost”: given a similarity threshold calculated with Jaccard index

Jacc(qArg, qRst) = supp(qArg∧qRst)

supp(qArg∨qRst)

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 21 / 35

slide-28
SLIDE 28

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Experiment setup

▶ Algorithm: ReRemi ▶ Conjunctions and disjunctions allowed ▶ Length of the query limited to 4 ▶ Output: 35 redescriptions

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 22 / 35

slide-29
SLIDE 29

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Results

id q1 q2 J(q1,q2) # texts Rd1 a57 r123 ∨ r65 ∨ r40 0.691 54 Rd2 a58 r61 ∨ r119 ∨ r125 0.351 13 Rd3 a23 ∨ a59 r125 0.3 8 3 over 35 obtained redescriptions aX and rX correspond to ARG and RST subtrees respectively.

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 23 / 35

slide-30
SLIDE 30

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Results

Rd1 : a57 ← → r123 ∨ r65 ∨ r40

CC _ _ sup sup CC _ _ list reason CC _ _ reason reason CC _ motivation a57 r123 r65 r40

RST is more fjne grained than ARG

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 24 / 35

slide-31
SLIDE 31

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Well captured information

(a) ARG annotation (b) RST annotation

CC _ _ sup sup CC _ _ reason reason

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 25 / 35

slide-32
SLIDE 32

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Anonymization lead to wrong captured patterns

(a) ARG annotation (b) RST annotation

CC _ _ sup sup CC _ _ reason reason

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 26 / 35

slide-33
SLIDE 33

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Results

Rd2 : a58 ← → r61 ∨ r119 ∨ 125

CC _ _ _ sup sup sup CC _ _ _ conj reason reason CC _ _ joint reason CC _ _ _ list list reason a58 r61 r119 r125

Rd2 is a specialization of Rd1

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 27 / 35

slide-34
SLIDE 34

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Results

Rd3 : a23 ∨ a59 ← → r125

CC _ _ und _ _ sup sup reb CC _ _ _ _ sup sup sup sup CC _ _ _ list list reason a23 a59 r125

2 ̸= ARG representations of the one RST subtree

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 28 / 35

slide-35
SLIDE 35

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Conclusion

▶ Turn a linguistic problem into a Data Mining problem ▶ Systematic, generic and automatic comparison ▶ Understand the links between ̸= theories

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 29 / 35

slide-36
SLIDE 36

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Future work

▶ Take segments into account ▶ Play with parameters of ReReMi ▶ Propose an exhaustive analysis of the redescriptions ▶ Investigate other Data Mining formalisms (e.g. FCA, association rules) ▶ Extend to other formalisms (e.g. SDRT)

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 30 / 35

slide-37
SLIDE 37

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Thank you!

laurine.huber yannick.toussaint charlotte.roze chloe.braud        @ loria . fr mathilde.dargnat @ atilf . fr

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 31 / 35

slide-38
SLIDE 38

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

Acknowledgments

This work was supported partly by the french PIA project ”Lorraine Université d’Excellence“, reference ANR-15-IDEX-04-LUE, and the PEPS blanc from CNRS (INS2I).

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 32 / 35

slide-39
SLIDE 39

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

References I

Freeman, J. B. (2011). Argumentation Structure: Representation and Theory. Springer, Dordrecht. Lascarides, A. and Asher, N. (2007). Segmented Discourse Representation Theory: Dynamic Semantics With Discourse Structure. In Bunt, H. and Muskens, R., editors, Computing Meaning, volume 3. Springer Netherlands, Dordrecht. Mann, W. and Thompson, S. (1988). Rhetorical structure theory: Towards a functional theory of text organization. TEXT, 8:243–281.

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 33 / 35

slide-40
SLIDE 40

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

References II

Peldszus, A. and Stede, M. (2015). An annotated corpus of argumentative microtexts. In Proceedings of the First European Conference on Argumentation: Argumentation and Reasoned Action, volume 2, pages 801–816. Peldszus, A. and Stede, M. (2016). Rhetorical structure and argumentation structure in monologue text. In Proceedings of the Third Workshop on Argument Mining (ArgMining2016), pages 103–112, Berlin, Germany. Association for Computational Linguistics.

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 34 / 35

slide-41
SLIDE 41

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Introduction Corpus Methodology Results Conclusion Acknowledgments and references

References III

Yan, X. and Han, J. (2002). gSpan: graph-based substructure pattern mining. In 2002 IEEE International Conference on Data Mining, 2002. Proceedings., pages 721–724, Maebashi City, Japan. IEEE.

  • L. Huber (LORIA)

Aligning Disc. and Arg. structures 35 / 35