CrowdTruth Metrics for Capturing Ambiguity Interlinking Workers, - - PowerPoint PPT Presentation

crowdtruth metrics for capturing ambiguity
SMART_READER_LITE
LIVE PREVIEW

CrowdTruth Metrics for Capturing Ambiguity Interlinking Workers, - - PowerPoint PPT Presentation

CrowdTruth Metrics for Capturing Ambiguity Interlinking Workers, Annotations and Input Data Traditional Humans provide annotations establishing the ground Human truth = the correct output for each example (the gold standard ) Annotation


slide-1
SLIDE 1

CrowdTruth Metrics for Capturing Ambiguity

Interlinking Workers, Annotations and Input Data

slide-2
SLIDE 2
  • Humans provide annotations establishing the ground

truth = the correct output for each example (the gold standard)

  • Machines learn the ground truth
  • Ground Truth Quality: typically measured by

inter-annotator agreement (e.g. majority vote); founded

  • n the ideal for single, universally constant truth
  • which means - ambiguity of textual interpretation is
  • ften lost

Traditional Human Annotation

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

slide-3
SLIDE 3
  • Annotator disagreement is signal, not noise.
  • It can indicate of the variation in human semantic

interpretation

  • Can be used to capture ambiguity, vagueness,

similarity, over-generality, as well as quality

CrowdTruth Methodology

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

What causes disagreement to happen?

slide-4
SLIDE 4

Disagreement because of Low Quality Workers

Do the sentences express a relation?

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

slide-5
SLIDE 5

Disagreement because of Sentence Clarity

Do the sentences express a relation between and ?

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

slide-6
SLIDE 6

Disagreement because of Sentence Clarity

Do the sentences express a relation between and ? → agreement 95% → agreement 75% → agreement 50%

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

slide-7
SLIDE 7

Disagreement because of an Ambiguous Annotation Task

What is the relation expressed?

  • r

?

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

slide-8
SLIDE 8

Triangle of disagreement as model for crowdsourcing systems Ambiguity at any corner disseminates in the other corners

CrowdTruth.org #CrowdTruth Anca Dumitrache @anca_dmtrch

e.g. sentence, paragraph, image, sound etc.

slide-9
SLIDE 9

CrowdTruth quality metrics

  • CrowdTruth.org

#CrowdTruth Anca Dumitrache @anca_dmtrch

slide-10
SLIDE 10

CrowdTruth.org github.com/CrowdTruth/CrowdTruth-core pypi.org/project/CrowdTruth data.CrowdTruth.org