Bridging the ROUGE/Human Evaluation Gap in Multi- Document Summarization
John M. Conroy Judith D. Schlesinger
IDA Center for Computing Sciences,USA
Dianne P. O’Leary
University of Maryland, College Park, USA
Bridging the ROUGE/Human Evaluation Gap in Multi- Document - - PowerPoint PPT Presentation
Bridging the ROUGE/Human Evaluation Gap in Multi- Document Summarization John M. Conroy Judith D. Schlesinger IDA Center for Computing Sciences,USA Dianne P. OLeary University of Maryland, College Park, USA Outline CLASSY 07
John M. Conroy Judith D. Schlesinger
IDA Center for Computing Sciences,USA
Dianne P. O’Leary
University of Maryland, College Park, USA
P
sq(t |) = 1
4 s(t) + 1 4 q(t) + 1 2 (t)
for selection.
mn m m n n
1 1 11 1 1
– Person writing summary judges all summaries?
– Personal interest (bias?) affects assessment.
– Removing self assessment score was 4.7, T-test indicates humans like their own summary more than other human summaries.