Expediting Search Trend Detection via Prediction of Query Counts - - PowerPoint PPT Presentation

expediting search trend detection via prediction of query
SMART_READER_LITE
LIVE PREVIEW

Expediting Search Trend Detection via Prediction of Query Counts - - PowerPoint PPT Presentation

Expediting Search Trend Detection via Prediction of Query Counts Nadav Golbandi Liran Katzir* Yehuda Koren* Ronny Lempel Yahoo! Labs, Haifa, Microsoft Research, Google, Haifa, Israel Yahoo! Labs, Haifa, Israel ATL-Israel Israel


slide-1
SLIDE 1

Expediting Search Trend Detection via Prediction of Query Counts

Nadav Golbandi

Yahoo! Labs, Haifa, Israel

Liran Katzir*

Microsoft Research, ATL-Israel

Yehuda Koren*

Google, Haifa, Israel

Ronny Lempel

Yahoo! Labs, Haifa, Israel

International Conference on Web Search and Data Mining, February 6th - 8th, 2013 Rome, Italy

slide-2
SLIDE 2

Search Trends

Search queries with a sudden popularity surge

0.2 0.4 0.6 0.8 1 12 24 36 48 60

#queries Time in Hours

Amanda Peet 0.2 0.4 0.6 0.8 1 12 24 36 48 60

#queries Time in Hours

Hybrid Cars

slide-3
SLIDE 3

Trending Topics

Microsoft MSN biggest movers Yahoo! Trending Now Google Trends

slide-4
SLIDE 4

Prior-art Detection Algorithm

1. Create a language model at time . 2. Estimate trendiness score

, = log | −max

  • log | .

3. Output top-k queries.

  • A. Dong, Y. Chang, Z. Zheng, G. Mishne, J. Bai, R. Zhang, K. Buchner, C. Liao, and F.
  • Diaz. Towards recency ranking in web search. In WSDM, pages 11-20, 2010.
  • A. Dong, Y. Chang, Z. Zheng, G. Mishne, J. Bai, R. Zhang, K. Buchner, C. Liao, and F.
  • Diaz. Towards recency ranking in web search. In WSDM, pages 11-20, 2010.

lowers the score of periodic queries

slide-5
SLIDE 5

Proposed Method:

New Algorithm

query counts during [t-1,t) Prediction Procedure predicted counts for [t,t+1) Prior-art Algorithm trending searches for time t+1 query counts during [t-1,t) Prior-art Algorithm trending searches for time t

slide-6
SLIDE 6

Goal: Keep precision and recall, reduce detection time !

New Algorithm

query counts during [t-1,t) Prediction Procedure predicted counts for [t,t+1) Prior-art Algorithm trending searches for time t+1 query counts during [t-1,t) Prior-art Algorithm trending searches for time t

slide-7
SLIDE 7

Intuition – counts time series

() () () !() Query q’s count … time

1m 1m 1m 1m

slide-8
SLIDE 8

Intuition – prediction input

#(, 30)

() () () !() Query q’s count … time

slide-9
SLIDE 9

Intuition – prediction target

#(, 30)

() () () !() Query q’s count … time

31, 32 … 90 What will be the query volume during following hour?

slide-10
SLIDE 10

Intuition – prediction target value

#(, 30)

() () () !() Query q’s count … time

31, 32 … 90 What will be the query volume during following hour?

& , 30 =

+ + ⋯ + !

slide-11
SLIDE 11

#(, ′) & *(, ′)

Explanatory and Response variables

, , … , ! + + ⋯ + ! #(, 30) &(, 30) #(, ) &(, ) #(, 31) &(, 31) #(, 32) &(, 32)

  • .

?

slide-12
SLIDE 12

#(, ′) & *(, ′)

Explanatory and Response variables

, , … , ! + + ⋯ + ! #(, 30) &(, 30) #(, ) &(, ) #(, 31) &(, 31) #(, 32) &(, 32)

  • .

? Training Phase

(only on trends)

Runtime Phase

(on all queries)

slide-13
SLIDE 13

Prediction Models

  • .

Auto-regressive Time-smoothed, single model

This method fits a vector - such that

.

slide-14
SLIDE 14

Prediction Models

  • (ℎ)

.

Auto-regressive Time-smoothed, per-hour model

This method fits a vector -(ℎ)for every hour ℎ such that

.

slide-15
SLIDE 15

Prediction Models

60 62 64 66 68 70 72 15 30 45 60 75 90 Per-Hour Model Single Model

% of good predictions Length of #(, )

  • (ℎ)

.

Auto-regressive Time-smoothed, per-hour model

This method fits a vector -(ℎ)for every hour ℎ such that

.

slide-16
SLIDE 16

Example of a specific hour model

2 4 6 8 10

  • 30
  • 20
  • 10

Coefficient

Index

  • .(18:00-19:00)
slide-17
SLIDE 17

Proposed Method

New Algorithm

query counts during [t-1,t) Prediction Procedure predicted counts for [t,t+1) Prior-art Algorithm trending searches for time t+1 query counts during [t-1,t) Prior-art Algorithm trending searches for time t

slide-18
SLIDE 18

Experimental Precision and Recall

0.95 0.97 0.99 1.01 1.03 1.05 10 20 %improvement Precision 0.95 0.97 0.99 1.01 1.03 1.05 10 20 %improvement Recall Number of search trends Number of search trends

slide-19
SLIDE 19

Experimental detection time

10 20 30 5 10 15 20 Number of search trends Detection time Average reduction in detection time

slide-20
SLIDE 20

An alternative perspective

Building a language model in a temporal corpus involves a trade off. Taking a long history creates bias. Taking a short history creates variance. The new method can be viewed as a time- sensitive language model.

slide-21
SLIDE 21

Conclusion/Discussion

  • We presented a new scheme for building a

language model as a building block in a temporal dynamic environment.

  • The new scheme relatively maintains system

performance (precision and recall), with reduction in detection time.

slide-22
SLIDE 22

Thanks