Argument Search with Voice Assistants Master's Thesis by Kevin Lang - - PowerPoint PPT Presentation

argument search with voice assistants
SMART_READER_LITE
LIVE PREVIEW

Argument Search with Voice Assistants Master's Thesis by Kevin Lang - - PowerPoint PPT Presentation

Argument Search with Voice Assistants Master's Thesis by Kevin Lang Referees: Advisor: Prof. Dr. Benno Stein Johannes Kiesel Prof. Dr. Ing. Eva Hornecker Outline Motivation Study 1: Online Survey Study 2: Wizard of Oz


slide-1
SLIDE 1

Argument Search with Voice Assistants

Master's Thesis by Kevin Lang

Referees:

  • Prof. Dr. Benno Stein
  • Prof. Dr. Ing. Eva Hornecker

Advisor: Johannes Kiesel

slide-2
SLIDE 2

Outline

  • Motivation
  • Study 1: Online Survey
  • Study 2: Wizard of Oz Experiment
  • Summary
  • Conclusion
  • Future Work

2

slide-3
SLIDE 3

Why…

  • adopting a pet?
  • buying a car?
  • voting for this candidate?

Motivation

3

slide-4
SLIDE 4

Where can I find arguments?

  • Sources?
  • Trustworthy?
  • Convincing?
  • Counter-arguments?

Motivation

4

slide-5
SLIDE 5

Motivation

  • Argument Search Engine for the Web (Wachsmuth et al., 2017)

5

slide-6
SLIDE 6

Motivation

2001: A Space Odyssey, 1968 Star Trek IV: The Voyage Home, 1986

6

slide-7
SLIDE 7

Conversational Voice Assistant

  • convenient to use and hands-free
  • used for many small tasks
  • Future goals:

○ Voice assistant as discussion partner ○ Decision making

Motivation

7

➔ Search for arguments

slide-8
SLIDE 8

Core Questions in this Thesis

Why people want to use a voice assistant for argument search? How does the user interact with the novel system? Which responses do they expect from it?

Motivation

8

slide-9
SLIDE 9

Online Survey

  • Asking about the acceptance of:

○ Motivations ○ Situations (Locations, Audiences) ○ Possible Features

Study 1

9

slide-10
SLIDE 10

Process

Study 1

  • 67 participants
  • 39 English, 28 German
  • 18~30 years(49), 31~49(11), 50~64(5), 65+ (1)

10

slide-11
SLIDE 11

Motivations

Study 1

11

slide-12
SLIDE 12

Motivations

Study 1

11

slide-13
SLIDE 13

Motivations

Study 1

11

slide-14
SLIDE 14

Motivations

Study 1 Fun motivations derived from: "Like Having a Really Bad PA": The Gulf Between User Expectation and Experience of Conversational Agents (Luger and Sellen, 2016)

11

slide-15
SLIDE 15

Situations

Study 1

  • Similar insights: “Evaluating the Social Acceptability of Voice Based Smartwatch Search” (Efthymiou and Halvey, 2016)

12

slide-16
SLIDE 16

Situations

Study 1

  • Similar insights: “Evaluating the Social Acceptability of Voice Based Smartwatch Search” (Efthymiou and Halvey, 2016)

12

slide-17
SLIDE 17

Situations

Study 1

  • Similar insights: “Evaluating the Social Acceptability of Voice Based Smartwatch Search” (Efthymiou and Halvey, 2016)

12

slide-18
SLIDE 18

Features

Study 1

13

slide-19
SLIDE 19

Features

Study 1

13

slide-20
SLIDE 20

Features

Study 1

13

slide-21
SLIDE 21

Features

Study 1

13

slide-22
SLIDE 22

Features

Study 1

13

slide-23
SLIDE 23

Features

Study 1

13

slide-24
SLIDE 24

Ranking Criteria

Study 1

14

slide-25
SLIDE 25

Ranking Criteria

Study 1

14

slide-26
SLIDE 26

Implementation and Evaluation

  • Argument search engine not reliable enough
  • Bad voice recognition
  • Wrong matching of intents

15

Study 2

slide-27
SLIDE 27

Wizard of Oz Experiment

  • Mock-up prototype
  • Avoid problems in

○ speech recognition ○ intent matching ○ system errors

Study 2

16

slide-28
SLIDE 28

Variables

Study 2

Motivations: Behaviour of the system:

  • Making a decision
  • Convince somebody
  • Without category-guideline*
  • With category-guideline

* “Investigating how conversational search agents affect user's behaviour, performance and search experience” (Dubiel et al., 2018)

17

slide-29
SLIDE 29

Topics

Study 2

18

slide-30
SLIDE 30

Experimental set-up

Study 2

19

slide-31
SLIDE 31

Agent-side

Study 2

  • Prepared topics with arguments

○ Splitted in categories ○ Annotated with total numbers

  • Behaviour rules

○ Conversational rules ○ Utterances for intents ○ How to present arguments

20

slide-32
SLIDE 32

User-side

Study 2

  • Set-up:

○ Comfortable sofa ○ Voice interface on armrest

  • Participants:

○ 12 male, 6 female ○ 18~30 years (13), 31~49 (5) ○ English level intermediate or proficient

21

slide-33
SLIDE 33

Transcript

Study 2

  • Transcribed 72 audio records, classified with action tags
  • 936 turns by the agent, 956 turns by the users
  • 1.808 classified actions by the agent, 1.033 by the users

22

slide-34
SLIDE 34

Results

Study 2

actions by agent # Read pro arguments No arguments left Ask category Count arguments Ask pro or con arguments Read con arguments Ask more arguments ... 204 178 170 165 161 160 158 actions by users # Affirmation Request pro arguments Negation Open topic Request additional information Request con arguments Activate ... 247 126 105 77 65 62 55

23

slide-35
SLIDE 35

Results

Study 2

actions by agent # Read pro arguments No arguments left Ask category Count arguments Ask pro or con arguments Read con arguments Ask more arguments ... 204 178 170 165 161 160 158 actions by users # Affirmation Request pro arguments Negation Open topic Request additional information Request con arguments Activate ... 247 126 105 77 65 62 55

23

slide-36
SLIDE 36

Results

Study 2

actions by agent # Read pro arguments No arguments left Ask category Count arguments Ask pro or con arguments Read con arguments Ask more arguments ... 204 178 170 165 161 160 158 actions by users # Affirmation Request pro arguments Negation Open topic Request additional information Request con arguments Activate ... 247 126 105 77 65 62 55

23

slide-37
SLIDE 37

Results

Study 2

actions by agent # Read pro arguments No arguments left Ask category Count arguments Ask pro or con arguments Read con arguments Ask more arguments ... 204 178 170 165 161 160 158 actions by users # Affirmation Request pro arguments Negation Open topic Request additional information Request con arguments Activate ... 247 126 105 77 65 62 55

23

slide-38
SLIDE 38

Study 2

24

Additional Information requests for...

Definitions:

“What does WWF stand for?” → encyclopedia

slide-39
SLIDE 39

Study 2

24

Additional Information requests for...

Definitions:

“What does WWF stand for?” → encyclopedia

Product details:

“How much is the average cost of an electric car?” → shops & product databases

slide-40
SLIDE 40

Study 2

24

Additional Information requests for...

Definitions:

“What does WWF stand for?” → encyclopedia

Product details:

“How much is the average cost of an electric car?” → shops & product databases

Other resources:

“Do you know how many people will be at the Zoo Erfurt tomorrow?” → blogs, scientific paper, statistics

slide-41
SLIDE 41

Study 2

24

Definitions:

“What does WWF stand for?” → encyclopedia

Product details:

“How much is the average cost of an electric car?” → shops & product databases

Other resources:

“Do you know how many people will be at the Zoo Erfurt tomorrow?” → blogs, scientific paper, statistics

Agent:

“What do you think of this topic?” → decision-making ability

Additional Information requests for...

slide-42
SLIDE 42

Making a Decision (D) vs. Convincing Somebody (C)

Study 2

25

slide-43
SLIDE 43

Study 2

25

Making a Decision (D) vs. Convincing Somebody (C)

slide-44
SLIDE 44

Study 2

25

Making a Decision (D) vs. Convincing Somebody (C)

slide-45
SLIDE 45

Study 2

25

Making a Decision (D) vs. Convincing Somebody (C)

slide-46
SLIDE 46

Category-Guideline

Quantitative data: + slightly better ratings in every aspect Qualitative data: + +

  • verview

comparison instruction number of categories felt limited

Study 2

26

slide-47
SLIDE 47

Overall Impression

+ +

  • fresh and new experience, very comfortable to do it hands-free

Nice flexible input

  • verview + memory problems

Skipping and navigation missing additional information speech synthesis

Study 2

27

slide-48
SLIDE 48
  • First work which combines argument mining, explorative search and

voice-based interface

Summary

28

  • 1st study: online survey about motivational & situational aspects +

possible features

  • 2nd study: design of a mock-up prototype + evaluation with Wizard of

Oz experiment

  • User ratings and measurements of the experiments
  • Transcript of 72 sessions between the human agent and the users
slide-49
SLIDE 49

Insights

  • Pre-Analysis:

○ Situation: at home, mostly alone or with friends ○ Motivation: preferred for tasks with low impact

  • The Application:

○ Missing overview of the arguments ○ Memory and navigational problems ○ Possibility to request additional information

Conclusion

29

slide-50
SLIDE 50

Future Work

  • Comparison to argument search with web-interface
  • Definition of the goal for exploratory tasks
  • Including displays in form of home devices or smartphones
  • Missing evaluation:

○ States and transitions between user and agent (Markov model) ○ Sentiment of the requests ○ Obstacles and solutions ○ Category selection

30

slide-51
SLIDE 51

Thank you for your attention!

31