Visual Text Analytics for Online Conversations Enamul Hoque PhD - - PowerPoint PPT Presentation

visual text analytics for online conversations
SMART_READER_LITE
LIVE PREVIEW

Visual Text Analytics for Online Conversations Enamul Hoque PhD - - PowerPoint PPT Presentation

Visual Text Analytics for Online Conversations Enamul Hoque PhD Candidate, Computer Science, UBC enamul@cs.ubc.ca Problem Scenario Lot of articles and comments were posted on Macumers. John is interested about buying iPhone6. He


slide-1
SLIDE 1

Visual Text Analytics for Online Conversations

Enamul Hoque

PhD Candidate, Computer Science, UBC enamul@cs.ubc.ca

slide-2
SLIDE 2

Problem Scenario

  • Lot of articles and comments were posted on Macumers.
  • John is interested about buying iPhone6.
  • He decides to explore blogs about this issue to verify whether the bending issue

is serious.

2

slide-3
SLIDE 3

Problem Scenario

3 3

Existing Interfaces

  • Lack of high-level abstraction
  • Only show conversations/comments

as paginated lists ordered by recency

  • Too many conversations
  • Too many comments

=> Information Overload Users

  • Focus on most recent

conversations/comments

  • Generate short responses
  • Leave conversations prematurely
slide-4
SLIDE 4

Our Goal

tightly integrate text analysis and interactive visualization to support users in exploring collection of online conversations. NLP Interactive visualization

4

slide-5
SLIDE 5

Tools for Exploring Online Conversations

5

ConVis, Eurovis 2014 MultiConVis, IUI 2016 CQAVis, IUI 2017

slide-6
SLIDE 6

Overall Approach

6

Characterizing the Domain of Blogs Blog Data and Tasks Abstractions Interactive Visualization of Conversations Mining Blog Conversations

slide-7
SLIDE 7

Characterizing the Domain of Blogs

7

Why and how people read blogs?

Tasks Data

  • Computer mediated communications
  • Social media
  • Human computer interactions (HCI)
  • Information retrieval

Information seeking Guidance seeking Keep track of arguments and evidences Have fun and enjoyment Variety seeking behaviour Skimming behaviour

slide-8
SLIDE 8

Blog Data and Tasks Abstractions

TASKS What this conversation is about? Which topics are generating more discussions? What do people say about topic X? How controversial was the conversation? Were there substantial differences in opinion? Why are people supporting/ opposing an opinion? …

8

Topic Author Opinion Thread Comment

x x x x x x x x x x x x x … … … … …

Data Variables

slide-9
SLIDE 9

Text Analysis for Conversations

  • Topic modeling

– Take advantage of the conversational structure – Graph based clustering (normalized n-cut) – Generate keyphrases for each cluster

  • Co-ranking
  • Sentiment analysis

– So-CAL: Lexicon-based approach – Compute polarity distribution for each comment

9

(Taboada et al., JCL 2011) (Joty et al., 2013)

slide-10
SLIDE 10

Designing ConVis: High-Fidelity Prototype

Conversation Overview Topics Authors Conversation view

10

For particular tasks such as document comprehension, overview + details has been found more

  • effective. (Cockburn et al. 2008)

highly negative

highly positive comment length

slide-11
SLIDE 11

MultiConVis: Exploring a Collection of Conversations

  • Large number of topics-> organize topics into hierarchy
  • Designed on top of ConVis: switch from exploring a collection of conversations to a single

conversation

11

slide-12
SLIDE 12

Topic Hierarchy Generation for Multiple Conversations

Bottom-up approach:

12

Collection-level topics

Conversation C1 Conversation Ci Conversation Cn

… … … … T1 Ti Tn Generate topics for each conversation Taking conversational features into account (Joty et al., 2013) The sets of topics {T1, Ti, Tn}are clustered into a hierarchical topic structure

1 2

slide-13
SLIDE 13

1) Create a weighted undirected graph

Nodes: Topics from conversations Edge weight w(x,y): Similarity between two topics x and y

2) Apply Graph based clustering

  • N-cut criteria

3) Label each cluster

Smaller iPhone Structural parts

Topic Hierarchy Generation for Multiple Conversations

Apple customer care Thin metal Apple responses

(Shi & Malik, 2000)

Structural issues Customer care

slide-14
SLIDE 14

User-centered Design of MultiConVis

14

Topic hierarchy Conversation List Timeline Search

slide-15
SLIDE 15

Further Information

UBC @ NLP

15