Towards Personalized Review Summarization via User-Aware Sequence - - PowerPoint PPT Presentation

β–Ά
towards personalized review summarization via user aware
SMART_READER_LITE
LIVE PREVIEW

Towards Personalized Review Summarization via User-Aware Sequence - - PowerPoint PPT Presentation

Towards Personalized Review Summarization via User-Aware Sequence Network ADVISOR: JIA-LING, KOH SOURCE: AAAI-19 SPEAKER: WEI, LAI DATE: 2020/5/25 OUTLINE 01 03 INTRODUCTION EXPERIMENTS 02 04 METHOD CONCLUSION 2 4 User-aware


slide-1
SLIDE 1

ADVISOR: JIA-LING, KOH SOURCE: AAAI-19 SPEAKER: WEI, LAI DATE: 2020/5/25

Towards Personalized Review Summarization via User-Aware Sequence Network

slide-2
SLIDE 2

OUTLINE

03

INTRODUCTION EXPERIMENTS METHOD CONCLUSION

02 04 01

2

slide-3
SLIDE 3

4

slide-4
SLIDE 4

INTRODUCTION

Review summarization aims to generate a condensed summary for a review or multiple reviews. Different users may:

  • Care about different contents
  • Have their own writing styles

User-aware Sequence Network (USN)

3

slide-5
SLIDE 5

METHOD

User-aware Sequence Network (USN)

5

slide-6
SLIDE 6

Problem Formulation

User

  • Review
  • Summary

𝑣 𝑦 𝑧 {𝑦!, 𝑦",… , 𝑦%} {𝑧!, 𝑧",… , 𝑧&}

*m

) (

𝑾𝒕 𝑾𝒖

Source vocabulary (#:30,000) Target vocabulary (#:30, 000)

6

slide-7
SLIDE 7

Sequence-to-sequence attentional model

Get To The Point: Summarization with Pointer-Generator Networks

7

slide-8
SLIDE 8

8

slide-9
SLIDE 9

review embedding (dim=128) single-layer encoder hidden state (dim=256)

User-aware Encoder

9

slide-10
SLIDE 10

User-aware Encoder

𝑕𝑏𝑒𝑓# = 𝜏(𝑋

$ β„Ž#; 𝑣 + 𝑐$)

β„Žβ€²# = β„Ž# βŠ™ 𝑕𝑏𝑒𝑓# User Selection Strategy

(dim=128) User embedding (dim=256) Encoder hidden state

10

slide-11
SLIDE 11

Context vector

11

slide-12
SLIDE 12

Context Vector

Decoder hidden state 𝑓%,# = π‘Š'tanh(𝑋

(β„Žβ€²# + 𝑋)(𝑑% + 𝑐()

𝑏%,# = exp(𝑓%,#) βˆ‘# exp(𝑓%,#) 𝑑′% = =

#

𝛽%,#β„Žβ€²#

(dim=256) 12

slide-13
SLIDE 13

Vocabulary state

13

slide-14
SLIDE 14

Vocabulary State

Decoder hidden state 𝑻𝒖 User-specific vocabulary memory (K most user-specific words)

𝑕%,$ = π‘Š'tanh(𝑋

*𝑉$ + 𝑋)*𝑑% + 𝑐*)

𝛾%,$ = exp(𝑕%,$) βˆ‘$ exp(𝑕%,$) 𝑛% = =

$

𝛾%,$𝑉$

14

slide-15
SLIDE 15

Vocabulary distribution

15

slide-16
SLIDE 16

Vocabulary Distribution

𝑻𝒖 𝑻𝒖

𝑠′% = 𝑋

+) 𝑑)%; 𝑑%; 𝑣; 𝑛% + 𝑐+)

𝑄′,-. = 𝑑𝑝𝑔𝑒𝑛𝑏𝑦(𝑋

  • 𝑠)% + 𝑐-)

User Prediction Strategy User Memory Prediction Strategy

Readout state

16

slide-17
SLIDE 17

Final distribution

17

slide-18
SLIDE 18

Pointer-generator Model

Get To The Point: Summarization with Pointer-Generator Networks

18

slide-19
SLIDE 19

Vocabulary Distribution

𝑄

*/0 = 𝜏(𝑋 */[𝑑)%; 𝑑%; 𝑛%] + 𝑐*/)

User Memory Generation Strategy 𝑄 π‘₯ = 1 βˆ’ π‘ž*/0 𝑄),-. π‘₯ + π‘ž*/0 =

$:2'34

𝛾%,$

19

slide-20
SLIDE 20

Loss

𝑀 = βˆ’ 1 π‘š =

%35 6

π‘šπ‘π‘•π‘„

,-.(𝑧%)

20

slide-21
SLIDE 21

EXPERIMENTS

Data

Training: 536255-(5000+5000) = 526255 Validation: 5000 Test: 5000 21

slide-22
SLIDE 22

Comparison Methods

  • Lead-1 : extractive the first sentence
  • LexRank

: extractive approach based on PageRank algorithm

  • S2S+Att : abstractive model
  • SEASS : selective network + S2S + Att in sentence summarization
  • PGN : copy mechanism + S2S + Att in document summarization

22

slide-23
SLIDE 23

EXPERIMENTS

Results

23

slide-24
SLIDE 24

EXPERIMENTS

Aspect-level Coverage ho hote tel

24

slide-25
SLIDE 25

EXPERIMENTS

Aspect-level Coverage

25

slide-26
SLIDE 26

EXPERIMENTS

Different User-based Strategies

  • 1. User selection
  • 2. User prediction
  • 3. User memory prediction
  • 4. User memory generation

26

slide-27
SLIDE 27

EXPERIMENTS

User-specific vocabulary size

27

slide-28
SLIDE 28

EXPERIMENTS

User-based Selective Gate Visualization

  • Output summary: excellent service, comfy be
  • True Summary: excellent service, very comfortable bed

28

slide-29
SLIDE 29

EXPERIMENTS

Case study

29

slide-30
SLIDE 30

Conclusion

  • Propose a User-aware Sequence Network (USN) to consider user

information into personalized review summarization.

  • Propose 4 user-based strategies.
  • Construct a new dataset (Trip).
  • Future work: extend model to the multi-review scenario.

30