Content & Context in Argumentative Relation Classification - - PowerPoint PPT Presentation

content context
SMART_READER_LITE
LIVE PREVIEW

Content & Context in Argumentative Relation Classification - - PowerPoint PPT Presentation

Content & Context in Argumentative Relation Classification ------- ArgMining 2019 Juri Opitz & Anette Frank Heidelberg University 1 Argumentative Relation Classification Marijuana should be legalized. con pro


slide-1
SLIDE 1

Content & Context

in Argumentative Relation Classification

  • ArgMining 2019

Juri Opitz & Anette Frank Heidelberg University

1

slide-2
SLIDE 2

Argumentative Relation Classification

Legalizing marijuana can increase use by teens, with harmful results. Legalization allows the government to set age-restrictions on buyers. Marijuana should be legalized. “attack” “con” “pro” However, Admittedly, On the other hand, ...

2

slide-3
SLIDE 3

Intermediate insight

  • in some cases, inspection of shallow discourse clues can help predict

argumentative relations with high accuracy

3

slide-4
SLIDE 4

(....)

AU-1 Moreover, AU-2

(....) AU-2 supports AU-1 (....)

AU-1

(....) (....)

Moreover, AU-2

(....) AU-2 supports AU-1 single

  • doc

multi- doc

4

slide-5
SLIDE 5

Research questions

  • we want to investigate to what extent systems rely on shallow discourse clues
  • where do we stand in content-based argumentative relation classification?

○ necessary for large scale cross-document argumentative relation mining ■ argumentative units for many debates can be mined from millions of documents scattered across the www ■ to assess relations between them we cannot rely on discourse clues but need systems which learn the content/meaning of argumentative units

5

slide-6
SLIDE 6

Methodology

1. we replicate a competitive argumentative relation classifier: SVM (Stab and Gurevych, 2017) with

i. discourse features ii. sentiment features iii. bag-of-word features iv. bag-of-production-rule features v. GloVe features vi. structural features

2. we extract these features from different spans

a. features extracted from the argumentative unit span (“content”) b. features extracted from the unit’s embedding context (“context”) c. features extracted from both (“full-access”)

6

slide-7
SLIDE 7

Data

  • 402 Student essays (Stab and Gurevych, 2017)
  • annotated with argumentative units and more than 3,000 relations
  • class distribution: ca. 10% ‘attack’, ca. 90% ‘support’
  • annotated unit spans correspond to argumentative clauses

○ “On the one hand, [AU: Legalization can increase use by teens, with harmful effects]” “context features” “content features” “full access”

7

slide-8
SLIDE 8

AU-1 AU-2 CTX AU-1 AU-2 CTX

8

slide-9
SLIDE 9

F1 Results: Attack vs. Support

14.5 pp

20.2 pp

9

majority

slide-10
SLIDE 10

F1 Results: Attack vs. Support vs. Neither

24.9 pp

15.9 pp

11.5 pp

10

majority

slide-11
SLIDE 11

Is everything lost for ?

  • No!

○ still outperforms majority baseline by a good margin ■ +9.5 pp. macro F1 in support vs. attack ■ +10.5 pp. macro F1 in support attack vs. neither

11

slide-12
SLIDE 12

Cross document potential of , and

To investigate, how the three systems port to a cross document scenario, we conduct two simulation studies:

  • random context: we shuffle the contexts of testing instances to simulate

porting to open world where AUs may appear in arbitrary contexts

  • no context: we mask the contexts of testing instances for all three systems

12

slide-13
SLIDE 13

No-context

“On the one hand, [AU: Legalization can increase use by teens, with harmful effects]”

13

<MASK>

slide-14
SLIDE 14

Randomized context

“On the one hand, [AU: Legalization can increase use by teens, with harmful effects]”

14

Moreover, However, Therefore,

slide-15
SLIDE 15

Macro F1 Results

15

“bar > 0: content-based is better”

slide-16
SLIDE 16

Results

  • we see the reverse picture:

○ models which access context (full-feature model and context-only model) fall behind the content-based system

16

Problem Context can be exploited by systems in single-document contexts but can lead to confusion when discourse markers are missing

  • r cannot be trusted (cross-document)

Insight Context-focused systems are not safe for porting to cross-document scenarios Recommendation Develop content-based systems for cross-document scenarios

slide-17
SLIDE 17

Take-Aways

We have shown that

  • shallow discourse clues are very strong indicators for argumentative relations
  • a very naive system that only sees context can strongly outperform a system

which sees the content and also outperforms a system which sees everything

17

Insight 1 Good scores may not reflect capacity to model argumentative content Insight 2 Argumentative relation classification needs better modeling of content

slide-18
SLIDE 18

Conclusions

Need work towards content-based argumentative relation classification

○ to address large scale argumentative relation mining across document boundaries ○ Student essay data can serve as a first benchmark ■ task: predict relations based on the content of argumentative units, mask context ○ Our results may serve as a baseline

18

slide-19
SLIDE 19

Thank you for your attention!

19