Cross-Domain Learning-to-rank with SVM Erheng Zhong 1 1 Department of - - PowerPoint PPT Presentation

cross domain learning to rank with svm
SMART_READER_LITE
LIVE PREVIEW

Cross-Domain Learning-to-rank with SVM Erheng Zhong 1 1 Department of - - PowerPoint PPT Presentation

Preliminary Cross-domain Learning-to-Rank Summary Cross-Domain Learning-to-rank with SVM Erheng Zhong 1 1 Department of Computer Science and Technology, HKUST COMP621U Presentation, 04/07/2011 Erheng Zhong Cross-Domain Learning-to-rank


slide-1
SLIDE 1

Preliminary Cross-domain Learning-to-Rank Summary

Cross-Domain Learning-to-rank with SVM

Erheng Zhong1

1Department of Computer Science and Technology, HKUST

COMP621U Presentation, 04/07/2011

Erheng Zhong Cross-Domain Learning-to-rank

slide-2
SLIDE 2

Preliminary Cross-domain Learning-to-Rank Summary

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-3
SLIDE 3

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-4
SLIDE 4

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Definition

A relationship between a set of items. A weak order or total preorder of objects. (mathematics) A central part of many information retrieval problems!

Erheng Zhong Cross-Domain Learning-to-rank

slide-5
SLIDE 5

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Applications

Search Engine

Erheng Zhong Cross-Domain Learning-to-rank

slide-6
SLIDE 6

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Applications

Recommendation System

Erheng Zhong Cross-Domain Learning-to-rank

slide-7
SLIDE 7

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Applications

Computational Advertising

Erheng Zhong Cross-Domain Learning-to-rank

slide-8
SLIDE 8

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-9
SLIDE 9

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Concepts

Learning-to-rank [1] is to automatically construct a ranking model from training data. Training Data:

Lists of <query,item> pairs with some partial order specified between pairs < X, y >; where X = {xi = (qk, tkj)}ℓ

i=1 and y = {yi}ℓ i=1

Ranking Model:

A function computing relevance of items for actual queries f

  • x = (q, t)
  • = ¯

y

Erheng Zhong Cross-Domain Learning-to-rank

slide-10
SLIDE 10

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Features

http://research.microsoft.com/en-us/projects/ mslr/feature.aspx

Erheng Zhong Cross-Domain Learning-to-rank

slide-11
SLIDE 11

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Framework

Erheng Zhong Cross-Domain Learning-to-rank

slide-12
SLIDE 12

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

Three groups with different input representations and loss functions: Pointwise Approach:

Each query-document pair in the training data has a numerical or ordinal score. A regression problem.

Pairwise Approach:

A binary classifier which can tell which document is better in a given pair of documents. The goal is to minimize average number of inversions in ranking.

Listwise Approach:

They directly optimize the value of one evaluation measure.

Erheng Zhong Cross-Domain Learning-to-rank

slide-13
SLIDE 13

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

Three groups with different input representations and loss functions: Pointwise Approach:

Each query-document pair in the training data has a numerical or ordinal score. A regression problem.

Pairwise Approach:

A binary classifier which can tell which document is better in a given pair of documents. The goal is to minimize average number of inversions in ranking.

Listwise Approach:

They directly optimize the value of one evaluation measure.

Erheng Zhong Cross-Domain Learning-to-rank

slide-14
SLIDE 14

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

Three groups with different input representations and loss functions: Pointwise Approach:

Each query-document pair in the training data has a numerical or ordinal score. A regression problem.

Pairwise Approach:

A binary classifier which can tell which document is better in a given pair of documents. The goal is to minimize average number of inversions in ranking.

Listwise Approach:

They directly optimize the value of one evaluation measure.

Erheng Zhong Cross-Domain Learning-to-rank

slide-15
SLIDE 15

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

Three groups with different input representations and loss functions: Pointwise Approach:

Each query-document pair in the training data has a numerical or ordinal score. A regression problem.

Pairwise Approach:

A binary classifier which can tell which document is better in a given pair of documents. The goal is to minimize average number of inversions in ranking.

Listwise Approach:

They directly optimize the value of one evaluation measure.

Erheng Zhong Cross-Domain Learning-to-rank

slide-16
SLIDE 16

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-17
SLIDE 17

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Concepts and Notations

Transfer learning [2] refers to the machine learning framework in which one extracts knowledge from some auxiliary domains to help boost the learning performance in a target domain. Auxiliary domain: Ds = {Xs, ys} Target domain: Dt = {Xℓ, yℓ; Xu} Ps((x), y)=Pt((x), y)

Erheng Zhong Cross-Domain Learning-to-rank

slide-18
SLIDE 18

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

“what to transfer” [2] Model-based Transfer:

Discover shared parameters or prior between cross-domain models.

Feature-based Transfer:

Find a “good” feature representation that reduces the difference and prediction error between domains.

Instance-based Transfer:

Re-weight some labeled data in the auxiliary domain for use in the target domain.

Erheng Zhong Cross-Domain Learning-to-rank

slide-19
SLIDE 19

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

“what to transfer” [2] Model-based Transfer:

Discover shared parameters or prior between cross-domain models.

Feature-based Transfer:

Find a “good” feature representation that reduces the difference and prediction error between domains.

Instance-based Transfer:

Re-weight some labeled data in the auxiliary domain for use in the target domain.

Erheng Zhong Cross-Domain Learning-to-rank

slide-20
SLIDE 20

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

“what to transfer” [2] Model-based Transfer:

Discover shared parameters or prior between cross-domain models.

Feature-based Transfer:

Find a “good” feature representation that reduces the difference and prediction error between domains.

Instance-based Transfer:

Re-weight some labeled data in the auxiliary domain for use in the target domain.

Erheng Zhong Cross-Domain Learning-to-rank

slide-21
SLIDE 21

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Approaches

“what to transfer” [2] Model-based Transfer:

Discover shared parameters or prior between cross-domain models.

Feature-based Transfer:

Find a “good” feature representation that reduces the difference and prediction error between domains.

Instance-based Transfer:

Re-weight some labeled data in the auxiliary domain for use in the target domain.

Erheng Zhong Cross-Domain Learning-to-rank

slide-22
SLIDE 22

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Applications

Text classification Sentiment analysis Image classification Name-entity recognition WiFi localization Spam Filtering . . . Ranking!

Erheng Zhong Cross-Domain Learning-to-rank

slide-23
SLIDE 23

Preliminary Cross-domain Learning-to-Rank Summary Ranking Learning-to-rank Transfer Learning

Applications

Text classification Sentiment analysis Image classification Name-entity recognition WiFi localization Spam Filtering . . . Ranking!

Erheng Zhong Cross-Domain Learning-to-rank

slide-24
SLIDE 24

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-25
SLIDE 25

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Sparsity Problem

No enough labeled data in the current domain. Heterogeneous feature spaces? Text search ⇒ Image search? Out-of-date data? Log data past years ⇒ Search task this year? Heterogeneous tasks? Web page ranking ⇒ Expert finding? . . .

Erheng Zhong Cross-Domain Learning-to-rank

slide-26
SLIDE 26

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-27
SLIDE 27

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Basic RankSVM

RankSVM [3] is a pairwise approach which aims to learn a linear function f(x) = wTx min

w,ξ

1 2||w||2

2 + λ

  • i,j

ξij (1) s.t. zijwT(xi − xj) ≥ 1 − ξij, ξij ≥ 0, i, j = 1, . . . , ℓ where zij is the binary preference defined as follows, zij =

  • +1

if ti ≻ tj, −1 if ti ≺ tj.

Erheng Zhong Cross-Domain Learning-to-rank

slide-28
SLIDE 28

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Model-based Transfer: M-SVM

Schölkopf et al. [4] incorporate knowledge from auxiliary domain using biased regularization. min

w,ξ

1 2||w − w0||2

2 + λ

  • i,j

ξij (2) s.t. zij wT(xℓ

i − xℓ j ) ≥ 1 − ξij, ξij ≥ 0, i, j = 1, . . . , ℓ

Erheng Zhong Cross-Domain Learning-to-rank

slide-29
SLIDE 29

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Instance-based Transfer: I-SVM

Chen et al. [5] pick those relevant instances from auxiliary domain and eliminate others, by adding weights for instances in the auxiliary domain. min

w,ξ,ξ0

1 2||w||2

2 + λ

  • i,j

ξij + λ

  • i,j

ρijξ0

ij

(3) s.t. zij wT(xℓ

i − xℓ j ) ≥ 1 − ξij, ξij ≥ 0, i, j = 1, . . . , ℓ

z0

ij wT(xs i − xs j ) ≥ 1 − ξ0 ij , ξ0 ij ≥ 0, i, j = 1, . . . , s

where ρij is the weight on the labeled data pairs in the auxiliary domain.

Erheng Zhong Cross-Domain Learning-to-rank

slide-30
SLIDE 30

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Feature-based Transfer: F-SVM

Chen et al. [5] transform instances into a common feature space by learning a projection matrix θ ∈ Rd×d min

w,ξ,ξ0,θ

1 2||w||2

2 + λ

  • i,j

ξij + λ

  • i,j

ξ0

ij

(4) s.t. zij wTθT(xℓ

i − xℓ j ) ≥ 1 − ξij, ξij ≥ 0, i, j = 1, . . . , ℓ

z0

ij wTθT(xs i − xs j ) ≥ 1 − ξ0 ij , ξ0 ij ≥ 0, i, j = 1, . . . , s

Erheng Zhong Cross-Domain Learning-to-rank

slide-31
SLIDE 31

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Outline

1

Preliminary Ranking Learning-to-rank Transfer Learning

2

Cross-domain Learning-to-Rank Motivations Approach: RankSVM Main Results

Erheng Zhong Cross-Domain Learning-to-rank

slide-32
SLIDE 32

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Measures

NDCG (Normalized Discounted Cumulative Gain) MAP (Mean Average Precision)

Erheng Zhong Cross-Domain Learning-to-rank

slide-33
SLIDE 33

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Datasets: Model-based Transfer

Erheng Zhong Cross-Domain Learning-to-rank

slide-34
SLIDE 34

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Results: Model-based Transfer

Erheng Zhong Cross-Domain Learning-to-rank

slide-35
SLIDE 35

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Datasets: Feature-based and Instance-based Transfer

Erheng Zhong Cross-Domain Learning-to-rank

slide-36
SLIDE 36

Preliminary Cross-domain Learning-to-Rank Summary Motivations Approach: RankSVM Main Results

Results: Instance-based and Feature-based Transfer

Erheng Zhong Cross-Domain Learning-to-rank

slide-37
SLIDE 37

Preliminary Cross-domain Learning-to-Rank Summary

Summary

M-SVM: Adapt a trained model to fit the data in the target domain. F-SVM: Transform the feature space to well bridge auxiliary and target domains. I-SVM: Leverage relevant instances in the auxiliary domains to increase the training data pool in the target domain. M-SVM is efficient while F-SVM and I-SVM are flexible.

Erheng Zhong Cross-Domain Learning-to-rank

slide-38
SLIDE 38

Preliminary Cross-domain Learning-to-Rank Summary

Summary

Methods Pointwise Pairwise Listwise Model-based

√ Feature-based

√ Instance-based

Y?

Erheng Zhong Cross-Domain Learning-to-rank

slide-39
SLIDE 39

Appendix Reference

Reference I

Tie-Yan Liu Learning to Rank for Information Retrieval. Foundations and Trends in Information Retrieval: 3(3):225-331, 2009 Sinno Jialin Pan and Qiang Yang. A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering, 22(10):1345-1359, 2010. Thorsten Joachims. Optimizing Search Engines Using Clickthrough Data. Proceedings of the ACM SIGKDD International Conference

  • n Knowledge Discovery and Data Mining (KDD’2002),

133-142, 2002.

Erheng Zhong Cross-Domain Learning-to-rank

slide-40
SLIDE 40

Appendix Reference

Reference II

Bernhard Schölkopf, Ralf Herbrich, and Alex J. Smola. Ranking Model Adaptation for Domain-specific Search. Proceeding of the ACM Conference on Information and Knowledge Management (CIKM’2009), 197-206, 2009. Depin Chen, Yan Xiong, Jun Yan, Gui-Rong Xue, Gang Wang, and Zheng Chen. Knowledge Transfer for Cross Domain Learning to Rank. Information Retrival, 13:236-253, 2010.

Erheng Zhong Cross-Domain Learning-to-rank