Peer-review analysis
Comprehensive exam
Presentered by : Wenting Xiong Committees: Diane Litman Rebecca Hwa Jingtao Wang
1
Peer-review analysis Comprehensive exam Presentered by : Wenting - - PowerPoint PPT Presentation
Peer-review analysis Comprehensive exam Presentered by : Wenting Xiong Diane Litman Committees: Rebecca Hwa Jingtao Wang 1 Motivation Goal Mine useful information in peers feedback and represent them in a intuitive and concise way
1
Mine useful information in peers’ feedback and represent them in a intuitive and concise way
– Identify review helpfulness
– Summarize reviewers’ comments
– Sense-making of review comments interactive review exploration
2
3
4
5
2007>,<Ghose 2010>, <O'Mahony 2010>
6
– Controversial results about the effectiveness of subjectivity features
shows positive words correlate with greater helpfulness <Ghose, et. al, 2010>
– Data sparsity issues?
Category Feature type Linguistic Unigrams, bigrams Low level Structural Syntactic Semantic: 1) domain lexicons 2) Subjectivity Sentiment analysis Readability metrics High level Social factors Reviewer profile Product ratings
7
e.g. ¡<Kim 2006>
Error rate of preference pair < 0.5 <Liu 2007>
8
9
10
dimensions
11
12
13
– Most frequent noun-phrase + sentiment-pivot expansion <Liu, 2004> – PMI (pointwise Mutual information) with meronymy discriminators + WordNet <Popescu 2005>
– Multiple-aspect sentiment model <Titov 2008> – Content-attitude model <Sauper 2011>
14
– Manually constructed subjective lexicons
– Adj & adv <Turney 2001> – opinion-bearing words <Liu 2004>
– Relaxiation labeling <Popescu 2005> – Scoring <Brody 2010>
– SCL algorithm <Blitzer 2007>
– MAS -- aspect-independent + aspect-dependent <Titov 2008> – Content-attitude models -- predicted posterior of sentiment distribution <Sauper, 2011>
15
– Bag-of-Words vs. Bag-of-Opinions <Qu 2010>
16
17
18
19
<Malakasiotis 2009>
20
21
22
arguments <Lin 2001>
23
24
25
I. Content selection
26
– Word-level
<Wang 2011>, external domain knowledge <Zhuang 2006>
– Sentence-level
– Summary-level
models <wang 2011>
27
– Rank-based sentence selection
<Wang, 2011>
– Topic-based selection
– Structured evaluative summary
– Generate evaluative arguments based on aggregation of extracted information <Carenini, 2006> – Graph-based summarization using adjacently matrix to model dialogue structure <Wang, 2011>
28
– Multiple human wrote gold-standards – SCU <Ani 2007>
– Automatically compare with gold-standard – Consider correlation based on unigram, bigram, longest common subsequence <Lin
2004>
– Good summary should be similar to the input – KL divergence, JS divergence <Ani 2009> Manual summary Manual rating Responsiv eness ✔ ✔ Pyramid ✗ ✔ ROUGE ✔ ✗ Fully auto ✗ ✗
model's distribution divergence
30
31
32
33
reviewSpotlight <Yatani, 2011>)
34
2009>
– Requires NLP techniques for opinion mining and sentiment analysis
(Aspect that opinions are most (dis)consisitant) <Carenini 2006>
35
36
37
– Compare single product reviews feature-by-feature <Liu 2005> – Connect interesting events of different period of times (Continuum, <Andre
2007>)
– Explore the connection of entities across documents (Jigsaw, <Stasko
2010>)
– Faceted metadata for image browsing <Yee 2003> – Facetbox for presenting filtering by facet-data <Lee 2009> – Exploring term-based language patterns across document <Don 2007>
– Themail <Viegas 2006>, Contitunn <Andre 2007> Tiara <Liu 2009>, TwitInfo
<Marcus 2011> etc.
38
– Paired <Liu 2005> – Matrix <Orlke 2009>
– Discover the entity-coverage in documents (Jigsaw <Stasko 2010>) – Visual DL search result with categorical and hierarchical axes <Shneiderman 2000>
– Exploring relationships between entities and documents (Jigsaw
<Stasko 2010>)
– *Diagram of social network (TIARA <Liu 2009>)
– Triangle scatter-plot of opinions (OpinionSeer <Wu 2010>) – *Opinion space <Faridani 2010>
39
– ThemeScapes <Wise 1995>
– OpinionSeer <Wu 2010>
+ stacked graph + triangle scatter plot
– TIARA <Liu 2010>
– Discover unperceivable interactions among multiple factors
– Concise but hard to interpret – Interaction is more complex and hard to design
40
Design principles and examples
From the data perspective
– Overview then detailed view
From the interaction perspective
– zoom-in and zoom-out for exploration – Hierarchic filtering for search and browse – Detail information as tooltip in explanatory visualization
– View switching for interest-specific visualization techniques – Query-based content browsing – Pivot action for navigating between related items
– Overview + detailed view – Support local interactions (hierarchically structured data) – A view of selection history of browsing
41
42
43
In terms of my own research interest
– How to model the real helpfulness of peer-reviews
– How to identify common themes and aggregate comments from different reviewers
– How to create informative representation of reviews – And design intuitive interactive-exploration for students or teachers to mind useful information
Challenges and contributions
44