vox populi annotation measuring intensity of ideological
play

Vox Populi Annotation: Measuring Intensity of Ideological - PowerPoint PPT Presentation

Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments LREC, Marrakech, Morocco, May 28-30, 2008 Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science


  1. Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments LREC, Marrakech, Morocco, May 28-30, 2008 Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science Carnegie Mellon University 1

  2. Goal: Annotating Intensity of Expressing Ideology at the Sentence Level 2

  3. Sentence of High Intensity • In the first weeks of the Intifada, for example, Palestinian public protests and civilian demonstrations were answered brutally by Israel, which killed tens of unarmed protesters. 3

  4. Sentence of Low Intensity • The Rhodes aggrements of 1949 set them as the ceasefire lines between Israel and the Arab states. 4

  5. Annotating Intensity is Hard • Hard to define Strong, Medium, and Weak • Hard to train annotators • Hard to achieve high inter-rater agreement 5

  6. Solution: Vox Populi Annotation • Aggregate group judgments on a simple, forced binary question • “Which side do you think the sentence was written from?” 6

  7. Two Problems • How many annotators are needed? • Are these group judgments random? 7

  8. Number of Annotators • A statistical testing problem • The more annotators, the finer difference in intensity we can discern. 8

  9. Number of Annotators 1.0 � 0.9 � 0.75 0.8 0.6 0.6 p value � 0.4 � 0.2 � � � 0.0 � � � � � � � � � � � � � � � � � � � 5 10 15 20 25 sample size 9

  10. Reliability • Reliable = two groups agree with each other • Measure Pearson’s correlation coefficient 10

  11. Annotation Study • 250 sentences from editorials on the Israeli- Palestinian conflict • 18 participants • “Do you think the sentence is written from the Israeli or Palestinian perspective?” 11

  12. Distribution of Intensity 50 40 Frequency 30 20 10 0 0.0 0.2 0.4 0.6 0.8 1.0 Vox Populi Intensity 12

  13. Reliability Assessment 0.5 Vox Populi � � � random 0.5 0.4 � random 0.99 � 0.3 correlation 0.2 � � 0.1 0.0 � 0.1 1 2 3 4 5 6 group size 13

  14. Where to recruit many annotators? 14

  15. Conclusion • Vox Populi Annotation for hard annotation tasks • Solution to two problems in VPA • Positive correlation observed in an empirical annotation study 15

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend