Factuality Prediction over Unified Datasets Gabriel Stanovsky, - - PowerPoint PPT Presentation

factuality prediction over unified datasets
SMART_READER_LITE
LIVE PREVIEW

Factuality Prediction over Unified Datasets Gabriel Stanovsky, - - PowerPoint PPT Presentation

Factuality Prediction over Unified Datasets Gabriel Stanovsky, Judith Eckle-Kohler, Yevgeniy Puzikov, Ido Dagan and Iryna Gurevych Bar-Ilan University, UKP - TU Darmstadt ACL 2017 Factuality Task Definition Authors commitment towards a


slide-1
SLIDE 1

Factuality Prediction over Unified Datasets

Gabriel Stanovsky, Judith Eckle-Kohler, Yevgeniy Puzikov, Ido Dagan and Iryna Gurevych

Bar-Ilan University, UKP - TU Darmstadt

ACL 2017

slide-2
SLIDE 2

Factuality

Task Definition

Author’s commitment towards a proposition

  • Factual
  • It is not surprising that the Cavaliers lost the championship
  • Uncertain
  • She still has to check whether the experiment succeeded
  • Counter-factual
  • Don was dishonest when he said he paid his taxes
  • Useful for
  • Knowledge base population
  • Question answering
  • Recognizing textual entailment
slide-3
SLIDE 3

In this talk

  • Problem: Limited Generality
  • Previous work focused on specific flavors of factuality
  • Approach
  • Build a unified dataset
  • Train a new model
  • Contributions
  • Normalized annotations
  • Large aggregated corpus
  • Improving performance across datasets
slide-4
SLIDE 4

Problem: Limited Generality

slide-5
SLIDE 5

Datasets

  • Many annotation efforts
  • FactBank (Saur´ı and Pustejovsky, 2009)
  • UW (Lee et al., 2015)
  • Meantime (Minard et al., 2016)
  • … and more
  • Datasets differ in various aspects
  • Discrete vs. continuous values
  • Expert vs. crowdsourced annotation
  • Point of view
slide-6
SLIDE 6

Annotated Examples

FactBank vs. UW

Officials have been careful not to draw any firm conclusions

CT+

  • 1.2

CT-

FactBank UW

Doesn’t annotate adjectival predicates Continuous scale [-3, +3]

slide-7
SLIDE 7

Annotated Examples

FactBank vs. UW

Kavan said the Czech would no longer become “the powerless victim of an invasion.”

3.0 CT+

  • 0.6

CT- 3.0

FactBank UW

8 discrete values Doesn’t annotate hypotheticals Continuous scale [-3, +3] Doesn’t annotate adjectival predicates

slide-8
SLIDE 8

Previous Work: Factuality Prediction

  • Models were designed and evaluated on specific datasets
  • For example, Lee et al. (2015):
  • Used SVM on syntactic features
  • lemma, POS, dependency paths
  • Tested on the UW corpus

 Non-comparable results  Limited portability

slide-9
SLIDE 9

Solution: Unified Corpus Extending TruthTeller Evaluation

slide-10
SLIDE 10

Simple Normalization

  • Mapping discrete values to the continuous UW scale
  • Simple mapping based on overlapping annotations
slide-11
SLIDE 11

Unified Factuality Corpus

slide-12
SLIDE 12

Biased Distribution

  • Corpus skewed towards factual
  • Inherent trait of the news domain?
slide-13
SLIDE 13

Solution: Unified Corpus Model: Extending TruthTeller Evaluation

slide-14
SLIDE 14

TruthTeller (Lotan et al., 2013)

  • Rule based approach on dependency trees
  • Karttunen implicative signatures
  • Syntactic cues (modality, negation, etc.)
  • Hand-written lexicon of 1,700 predicates
slide-15
SLIDE 15

Extending TruthTeller

  • Semi automatic extension of lexicon by 40%
  • Translated from German verb classes (Eckle-Kohler, ACL 2016)
  • Supervised learning: TruthTeller as signal
  • Application of implicative signatures on PropS
slide-16
SLIDE 16

Solution: Unified Corpus Extending TruthTeller Evaluation

slide-17
SLIDE 17

Metrics (lee et al., 2015)

  • 1. Mean Absolute Error
  • Range: [0, 6]
  • Smaller is better!
  • 2. Pearson Correlation
  • How good is a system in recovering the variation
  • Well-suited for the biased news domain
slide-18
SLIDE 18

Evaluations

slide-19
SLIDE 19

Evaluations

Marking all propositions as factual Is a strong baseline on this dataset

slide-20
SLIDE 20

Evaluations

Dependency features correlate well

slide-21
SLIDE 21

Evaluations

Applying implicative signatures on AMR did not work well

slide-22
SLIDE 22

Evaluations

Hard coded rules aren't robust Enough across datasets

slide-23
SLIDE 23

Evaluations

Our extension of TruthTeller gets good results across all datasets

slide-24
SLIDE 24

Conclusions and Future Work

  • Resources made publicly available
  • Unified Factuality corpus
  • Conversion code and trained models
  • Future work
  • Annotate diverse domains
  • Integrate TruthTeller with more lexical-syntactic feats.
  • Try our online demo:

http://u.cs.biu.ac.il/~stanovg/factuality.html

Thanks for listening!