MultiConVis: A Visual Text Analytics System for Exploring a - - PowerPoint PPT Presentation

multiconvis a visual text analytics system for exploring
SMART_READER_LITE
LIVE PREVIEW

MultiConVis: A Visual Text Analytics System for Exploring a - - PowerPoint PPT Presentation

Department of Computer Science University of British Columbia MultiConVis: A Visual Text Analytics System for Exploring a Collection of Online Conversations Enamul Hoque, Giuseppe Carenini {enamul, carenini}@cs.ubc.ca NLP group @ UBC Rise of


slide-1
SLIDE 1

MultiConVis: A Visual Text Analytics System for Exploring a Collection of Online Conversations

Enamul Hoque, Giuseppe Carenini

{enamul, carenini}@cs.ubc.ca NLP group @ UBC Department of Computer Science University of British Columbia

slide-2
SLIDE 2

Rise of Text Conversations

  • People engage in asynchronous conversations frequently
  • e.g., blogs, forums.

Blogs:

  • More than 100 millions of blogs
  • The audience is rising exponentially
  • Many different categories: politics, technology, business, sports,…

2

slide-3
SLIDE 3

Problem Scenario

  • Lot of articles and comments were posted on Macrumors.
  • John is interested about buying iPhone6.
  • He decides to explore blogs about this issue to verify whether the bending issue

is serious.

3

slide-4
SLIDE 4

Problem Scenario

4

4

Existing Interfaces

  • Lack of high-level abstraction
  • Only show conversations/comments

as paginated lists ordered by recency

  • Too many conversations
  • Too many comments

=> Information Overload Users

  • Focus on most recent

conversations/comments

  • Generate short responses
  • Leave conversations prematurely
slide-5
SLIDE 5

Our Solution

tightly integrate text analysis and interactive visualization to support users in exploring collection of online conversations. NLP Interactive visualization

5

slide-6
SLIDE 6

ConVis: Exploring a Long Conversation

Conversation Overview Topics Authors Conversation view

6

highly negative

highly positive comment length

Enamul Hoque and Giuseppe Carenini (EuroVis 2014, IUI 2015).

slide-7
SLIDE 7

MultiConVis: Exploring a Collection of Conversations

  • Large number of topics-> organize topics into hierarchy
  • Designed on top of ConVis: switch from exploring a collection of conversations to a single

conversation

7

slide-8
SLIDE 8

Contributions

  • Hierarchical topic modeling method
  • organizes large set of topics from multiple

conversations

  • User-centered design of MultiConVis.
  • multi-scale exploration of a collection of

conversations

  • Evaluation of MultiConVis :
  • user performance and subjective opinions

compared to a traditional interface

slide-9
SLIDE 9

Contributions

  • Hierarchical topic modeling method
  • organizes large set of topics from multiple

conversations

  • User-centered design of MultiConVis.
  • multi-scale exploration of a collection of

conversations

  • Evaluation of MultiConVis :
  • user performance and subjective opinions

compared to a traditional interface

slide-10
SLIDE 10

Topic Hierarchy Generation for Multiple Conversations

Bottom-up approach:

10

Collection-level topics

Conversation C1 Conversation Ci Conversation Cn

… … … … T1 Ti Tn Generate topics for each conversation Taking conversational features into account (Joty et al., 2013) The sets of topics {T1, Ti, Tn}are clustered into a hierarchical topic structure

1 2

slide-11
SLIDE 11

1) Create a weighted undirected graph:

Nodes: Topics from conversations Edge weight w(x,y): Similarity between two topics x and y Sum of the pairwise similarity between their sentences

Smaller iPhone Structural parts

Topic Hierarchy Generation for Multiple Conversations

Apple customer care Thin metal Apple responses

11

(, )

slide-12
SLIDE 12

1) Create a weighted undirected graph: 2) Apply Graph based clustering

  • Normalized cut criteria (Shi & Malik, 2000)
  • Num. of topics:

Maximize:

3) Label each cluster

Smaller iPhone Structural parts Apple customer care Thin metal Apple responses

Customer care

12

Structural issues

Topic Hierarchy Generation for Multiple Conversations

(, )

= ∑ ,

,∈

,

∈,∈

− ( ∑ (, )

,∈

∑ (, )

∈,∈

)

  • (Newman and Girvan, 2004)
slide-13
SLIDE 13

Contributions

  • Hierarchical topic modeling method
  • organizes large set of topics from multiple

conversations

  • User-centered design of MultiConVis.
  • multi-scale exploration of a collection of

conversations

  • Evaluation of MultiConVis :
  • user performance and subjective opinions

compared to a traditional interface

slide-14
SLIDE 14

User Requirements Analysis

14

Why and how people explore a collection of conversations?

  • Information seeking
  • Fact checking
  • Guidance seeking
  • Keep track of arguments and evidences
  • When aspect: Find out what are people

thinking or feeling about X over time”

  • Have fun and enjoyment

(Hearst 08)

Topics Sentiment Time Authors

slide-15
SLIDE 15

User Requirements Analysis

15

Why and how people explore a collection of conversations?

  • Variety seeking behaviour:
  • Read various sub-topics of a topic
  • Skimming behaviour: Explore vs. focused

reading

  • Switching between multiple-levels of

granularity:

Various levels All Conversations Subset of relevant Conversations One Conversation

  • > Comments
slide-16
SLIDE 16

Data Abstractions

Levels Facets

Collection of Conversations One Conversation Topics Hierarchy with all topics from all conversations List of topics Time

  • Start day/time
  • Volume of comments over time

comments are ordered chronologically Sentiment

  • Sentiment distribution for each

conversation

  • Sentiment evolution over time

for each conversation Sentiment distribution for each comment Authors Number of authors for each conversation List of authors

16

slide-17
SLIDE 17

Topic hierarchy

Visual Encoding: Set of Conversations

Conversation List Timeline Search

17

slide-18
SLIDE 18

Visual Encoding: Set of Conversations

Conversation List Timeline

  • Topic hierarchy
  • node labels are more important,
  • Links are less important,
  • Indented tree representation: compact
  • Can show 50 nodes without vertical scrolling,

sufficient for most datasets

  • font size: How much this topic has been discussed

18

slide-19
SLIDE 19

Conversation List

Sentiment distribution Title Text snippet Count (topics) Count (authors) Volume of comments over time

Visual Encoding: Set of Conversations

19

Information scent

slide-20
SLIDE 20

Video Demo

20

slide-21
SLIDE 21

Contributions

  • Hierarchical topic modeling method
  • organizes large set of topics from multiple

conversations

  • User-centered design of MultiConVis.
  • multi-scale exploration of a collection of

conversations

  • Evaluation of MultiConVis :
  • user performance and subjective opinions

compared to a traditional interface

slide-22
SLIDE 22

User Evaluation

Case studies:

  • Participants explored the datasets according to their information needs
  • Regular blog reader: iPhone bending
  • Journalist: ObamaCare health reform
  • Business analyst: iWatch release
  • In follow-up interviews: topic hierarchy was extremely useful

Laboratory study:

  • Compare with a traditional interface
  • Task: Explore the given set of conversations, write a summary of major

keypoints

22

slide-23
SLIDE 23

Evaluation: Lab Study

  • 16 subjects (aged 18-37, 6 females)
  • Within subjects

Traditional interface MultiConVis

23

slide-24
SLIDE 24

User Study: Selected Results

  • Time-to-task completion: No significant difference
  • Subjective ratings:
  • Preference:
  • MultiConVis (75%): topic organization, visual overview of conversations
  • Traditional interface (25%): simplicity and familiarity

24

1 2 3 4 5

Usefulness Ease of use Enjoyable Find major points Find more insightful comments Write a more informative summary

MultiConVis Traditional Interface

slide-25
SLIDE 25

Conclusions

1) Hierarchical topic modeling for a collection of online conversations

  • consider unique features of conversations.

2) Design of MultiConVis.

  • Multi-scales exploration of a collection of conversation
  • Consistency of encoding among various scales

3) Evaluation

  • MultiConVis was preferred by majority of participants
  • Assessment of different interface features

25

slide-26
SLIDE 26

Future Work

  • Interactive topic hierarchy revisions
  • Allow user to modify topic hierarchy
  • Apply and tailor to specific conversational genres
  • Community question answering forums
  • MOOC forums
  • ….
  • Online longitudinal study
  • For ecologically validity

26

slide-27
SLIDE 27

For More Information…

www.cs.ubc.ca/cs-research/lci/research-groups/natural-language-processing/

27

Raymond T. Ng Tamara Munzner

Thanks: