Cost-sensitive Dynamic Feature Selection e III 1 and Jason Eisner 2 - - PowerPoint PPT Presentation

cost sensitive dynamic feature selection
SMART_READER_LITE
LIVE PREVIEW

Cost-sensitive Dynamic Feature Selection e III 1 and Jason Eisner 2 - - PowerPoint PPT Presentation

Cost-sensitive Dynamic Feature Selection e III 1 and Jason Eisner 2 He He 1 , Hal Daum 1 University of Maryland, College Park 2 Johns Hopkins University June 30, 2012 He He, Hal Daum e III and Jason Eisner Cost-sensitive Dynamic Feature


slide-1
SLIDE 1

Cost-sensitive Dynamic Feature Selection

He He1, Hal Daum´ e III1 and Jason Eisner2

1University of Maryland, College Park 2Johns Hopkins University

June 30, 2012

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 1 / 14

slide-2
SLIDE 2

Dynamic Feature Selection

Feature selection in real life is a sequential decision-making process.

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 2 / 14

slide-3
SLIDE 3

Dynamic Feature Selection

Feature selection in real life is a sequential decision-making process. Observation cost Decision coughing free cold flu H1N1 sore throat free

cold flu

H1N1

headache free cold

flu

H1N1

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 2 / 14

slide-4
SLIDE 4

Dynamic Feature Selection

Feature selection in real life is a sequential decision-making process. Observation cost Decision coughing free cold flu H1N1 sore throat free

cold flu

H1N1

headache free cold

flu

H1N1

temperature (101◦) $1

cold

flu H1N1

nasal swab test (pos.) $10

cold

flu

H1N1

viral culture test (pos.) $50

cold flu

H1N1

molecular test (pos.) $100

cold flu

H1N1

blood test (pos.) $100

cold flu

H1N1

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 2 / 14

slide-5
SLIDE 5

Dynamic Feature Selection

Feature selection in real life is a sequential decision-making process. Observation cost Decision coughing free cold flu H1N1 sore throat free

cold flu

H1N1

headache free cold

flu

H1N1

temperature (101◦) $1

cold

flu H1N1

nasal swab test (pos.) $10

cold

flu

H1N1

viral culture test (pos.) $50

cold flu

H1N1

molecular test (pos.) $100

cold flu

H1N1

blood test (pos.) $100

cold flu

H1N1

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 2 / 14

slide-6
SLIDE 6

Dynamic Feature Selection

Feature selection in real life is a sequential decision-making process. Observation cost Decision coughing free cold flu H1N1 sore throat free

cold flu

H1N1

headache free cold

flu

H1N1

temperature (101◦) $1

cold

flu H1N1

nasal swab test (neg.) $10

cold

flu

H1N1

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 2 / 14

slide-7
SLIDE 7

Dynamic Feature Selection

Feature selection in real life is a sequential decision-making process. Observation cost Decision coughing free cold flu H1N1 sore throat free

cold flu

H1N1

headache free cold

flu

H1N1

temperature (101◦) $1

cold

flu H1N1

nasal swab test (neg.) $10

cold

flu

H1N1

viral culture test (pos.) $50

cold

flu

H1N1

molecular test (pos.) $100

cold flu

H1N1

blood test (pos.) $100

cold flu

H1N1

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 2 / 14

slide-8
SLIDE 8

Cost-sensitive Dynamic Feature Selection

Feature Cost

  • Computation time
  • Data acquisition expense

Dynamic Selection

  • Based on previous selected features and their values
  • Compute features on-the-run

Given a pretrained classifier and feature cost, Goal

  • Sequentially select features for each instance at test time
  • Achieve a user-specified accuracy-cost trade-off

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 3 / 14

slide-9
SLIDE 9

Dynamic Feature Selection as an MDP

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 4 / 14

slide-10
SLIDE 10

Dynamic Feature Selection as an MDP

At time step t, for one example, State st Selected features and their values Action at ∈ At Acquire some features or stop Policy π Map from state to action: π(st) = at Reward r r(st, at) = margin(st, at) − λ · cost(st, at) margin: score of the true class - highest score of other classes λ: trade-off parameter

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 4 / 14

slide-11
SLIDE 11

Imitation Learning

Oracle

  • Demonstrate optimal actions π∗(s) = a∗

t

Agent

  • Learn a policy to mimic the oracle’s behavior
  • π(st) = at

Imitation via Supervised Classification

  • Training examples {(φ(sπ∗), π∗(s))}
  • Feature: φ(s)

label: π∗(s) classifier: ˆ π

  • Minimize a surrogate loss ℓ(s, π) w.r.t. to π∗, e.g. hinge loss in

SVM.

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 5 / 14

slide-12
SLIDE 12

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-13
SLIDE 13

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-14
SLIDE 14

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature nasal swab test viral culture test molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-15
SLIDE 15

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature (101◦)

  • 0.10

0.01

  • 0.11

nasal swab test viral culture test molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-16
SLIDE 16

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature nasal swab test (pos.) 0.50 0.04 0.46 viral culture test molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-17
SLIDE 17

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature nasal swab test viral culture test (pos.) 0.60 0.19 0.41 molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-18
SLIDE 18

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature nasal swab test viral culture test molecular test (pos.) 0.70 0.38 0.32 blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-19
SLIDE 19

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature nasal swab test viral culture test molecular test blood test (pos.) 0.65 0.38 0.27

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-20
SLIDE 20

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

temperature 2 nasal swab test (pos.) 0.50 0.04 0.46 viral culture test molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-21
SLIDE 21

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

3 temperature (101◦) 0.55 0.05 0.50 2 nasal swab test (pos.) 0.50 0.04 0.46 viral culture test molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-22
SLIDE 22

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

3 temperature (101◦) 0.55 0.05 0.50 2 nasal swab test (pos.) 0.50 0.04 0.46 4 viral culture test (pos.) 0.80 0.24 0.56 molecular test blood test

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-23
SLIDE 23

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

3 temperature (101◦) 0.55 0.05 0.50 2 nasal swab test (pos.) 0.50 0.04 0.46 4 viral culture test (pos.) 0.80 0.24 0.56 molecular test 5 blood test (pos.) 0.90 0.62 0.28

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-24
SLIDE 24

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

3 temperature (101◦) 0.55 0.05 0.50 2 nasal swab test (pos.) 0.50 0.04 0.46 4 viral culture test (pos.) 0.80 0.24 0.56 6 molecular test (pos.) 0.95 1.00

  • 0.05

5 blood test (pos.) 0.90 0.62 0.28

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-25
SLIDE 25

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward
  • Stop in the global maximum-reward state

free → nasal swab test → temperature → viral culture test → stop r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

3 temperature (101◦) 0.55 0.05 0.50 2 nasal swab test (pos.) 0.50 0.04 0.46 4 viral culture test (pos.) 0.80 0.24 0.56 6 molecular test (pos.) 0.95 1.00

  • 0.05

5 blood test (pos.) 0.90 0.62 0.28

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-26
SLIDE 26

Forward Selection Oracle

  • Select the feature that yields the maximum immediate reward
  • Stop in the global maximum-reward state

free → nasal swab test → temperature → viral culture test → stop

  • Have access to the ground truth, only available during training

r(st, at) = margin(st, at) − cost(st, at) λ = 1, cost scaled to [0, 1], H1N1=positive

  • rder

feat. marg. cost reward 1 coughing, sore throat, headache

  • 0.20

0.00

  • 0.10

3 temperature (101◦) 0.55 0.05 0.50 2 nasal swab test (pos.) 0.50 0.04 0.46 4 viral culture test (pos.) 0.80 0.24 0.56 6 molecular test (pos.) 0.95 1.00

  • 0.05

5 blood test (pos.) 0.90 0.62 0.28

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 6 / 14

slide-27
SLIDE 27

Policy Features

At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-28
SLIDE 28

Policy Features

  • selected features

e.g. free = coughing, sore throat, headache; nasal swab test = pos. At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-29
SLIDE 29

Policy Features

  • selected features

e.g. free = coughing, sore throat, headache; nasal swab test = pos.

  • confidence score and its change

e.g. 0.04, 0.23, 0.73; −0.16, −0.27, 0.43 At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-30
SLIDE 30

Policy Features

  • selected features

e.g. free = coughing, sore throat, headache; nasal swab test = pos.

  • confidence score and its change

e.g. 0.04, 0.23, 0.73; −0.16, −0.27, 0.43

  • Does the guess change?

e.g. Yes At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-31
SLIDE 31

Policy Features

  • selected features

e.g. free = coughing, sore throat, headache; nasal swab test = pos.

  • confidence score and its change

e.g. 0.04, 0.23, 0.73; −0.16, −0.27, 0.43

  • Does the guess change?

e.g. Yes

  • cost and its change

e.g. 0.04; 0.04 At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-32
SLIDE 32

Policy Features

  • selected features

e.g. free = coughing, sore throat, headache; nasal swab test = pos.

  • confidence score and its change

e.g. 0.04, 0.23, 0.73; −0.16, −0.27, 0.43

  • Does the guess change?

e.g. Yes

  • cost and its change

e.g. 0.04; 0.04

  • cost divided by confidence score

e.g. 1.00, 5.75, 18.25 At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-33
SLIDE 33

Policy Features

  • selected features

e.g. free = coughing, sore throat, headache; nasal swab test = pos.

  • confidence score and its change

e.g. 0.04, 0.23, 0.73; −0.16, −0.27, 0.43

  • Does the guess change?

e.g. Yes

  • cost and its change

e.g. 0.04; 0.04

  • cost divided by confidence score

e.g. 1.00, 5.75, 18.25

  • current guess

e.g. H1N1 At step 2, compute φ(s2) feat. cold flu H1N1 cost free 0.20 0.50 0.30 0.00 swab 0.04 0.23 0.73 0.04

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 7 / 14

slide-34
SLIDE 34

Imitation Learning via Classification

sπ: states visited by π T: task horizon J(π): task loss (negative reward) of π

Theorem

Ross & Bagnell (2010) Let Esπ∗[ℓ(s, π)] = ǫ, then J(π) ≤ J(π∗) + T 2ǫ.

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 8 / 14

slide-35
SLIDE 35

Imitation Learning via Classification

sπ: states visited by π T: task horizon J(π): task loss (negative reward) of π

Theorem

Ross & Bagnell (2010) Let Esπ∗[ℓ(s, π)] = ǫ, then J(π) ≤ J(π∗) + T 2 ǫ. Why do we have quadratically increasing loss?

  • He He, Hal Daum´

e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 8 / 14

slide-36
SLIDE 36

Imitation Learning via Classification

sπ: states visited by π T: task horizon J(π): task loss (negative reward) of π

Theorem

Ross & Bagnell (2010) Let Esπ∗[ℓ(s, π)] = ǫ, then J(π) ≤ J(π∗) + T 2 ǫ. Why do we have quadratically increasing loss?

  • Trains only under states the oracle visited
  • Ignores the difference between the oracle’s and the agent’s state

distribution

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 8 / 14

slide-37
SLIDE 37

Dataset Aggregation (DAgger) (Ross et al. (2011))

Let π1 = π∗. In iteration i, execute policy πi and collect dataset Di = {(φ(sπi), π∗(s))}; learn πi+1 from the aggregated dataset D1 D2 · · · Di. Return the best policy evaluated on validation set.

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 9 / 14

slide-38
SLIDE 38

Dataset Aggregation (DAgger) (Ross et al. (2011))

Let π1 = π∗. In iteration i, execute policy πi and collect dataset Di = {(φ(sπi), π∗(s))}; learn πi+1 from the aggregated dataset D1 D2 · · · Di. Return the best policy evaluated on validation set. Qπ′

t (s, π): t-step cost of executing π initially then running π′

ǫN = minπ∈Π 1

N

N

i=1 Esπi [ℓ(s, π)]

Theorem

Ross et al. (2011) For DAgger, if Qπ∗

T−t+1(s, π) − Qπ∗ T−t+1(s, π∗) ≤ u and

N is ˜ O(uT), there is a policy π ∈ π1:N s.t. J(π) ≤ J(π∗) + uTǫN + O(1).

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 9 / 14

slide-39
SLIDE 39

Coaching

The oracle’s policy can be too good to learn!

  • Far from the agent’s learning space
  • Policy features are insufficient

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 10 / 14

slide-40
SLIDE 40

Coaching

The oracle’s policy can be too good to learn!

  • Far from the agent’s learning space

use kernels

  • Policy features are insufficient

more descriptive features

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 10 / 14

slide-41
SLIDE 41

Coaching

The oracle’s policy can be too good to learn!

  • Far from the agent’s learning space

use kernels Overhead!

  • Policy features are insufficient

more descriptive features

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 10 / 14

slide-42
SLIDE 42

Coaching

The oracle’s policy can be too good to learn!

  • Far from the agent’s learning space

use kernels Overhead!

  • Policy features are insufficient

more descriptive features “Hope” action (McAllester et al. (2010); Chiang et al. (2009))

Combines the current policy’s preference and the reward: ˜ a∗

t

= arg max

a∈At

η · scoreπi(a) + r(st, a) instead of a∗

t

= arg max

a∈At

r(st, a)

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 10 / 14

slide-43
SLIDE 43

Experimental Results

Baselines (|w|/cost, Forward): add feature statically from a ranked list Trade-off: λ = (0, 0.1, 0.25, 0.5, 1, 1.5, 2) Coaching: initialize η to 1 and decrease by e−1 in each iteration

0.0 0.2 0.4 0.6 0.8 1.0

average cost per example

0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 1.00

accuracy

|w|/cost Forward DAgger Coaching Oracle

Figure: Radar (binary).

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 11 / 14

slide-44
SLIDE 44

Experimental Results

0.0 0.2 0.4 0.6 0.8 1.0

average cost per example

0.4 0.5 0.6 0.7 0.8 0.9

accuracy

|w|/cost Forward DAgger Coaching Oracle

Figure: Digit (10 classes).

0.0 0.2 0.4 0.6 0.8 1.0

average cost per example

0.60 0.65 0.70 0.75 0.80 0.85 0.90

accuracy

|w|/cost Forward DAgger Coaching Oracle

Figure: Segmentation (7 classes).

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 12 / 14

slide-45
SLIDE 45

Conclusion and Future Work

Conclusion

  • Feature selection as an MDP
  • Imitation learning techniques
  • Iterative policy training
  • Coaching as a ”local update”

method Future Work

  • Include feature dependency using

feature templates

  • Learn feature weights jointly with

the policy

  • Apply to ensemble learning (select

model dynamically)

  • Structured prediction problem

where − policy features might require inference under features selected so far − feature cost may need to be inferred at runtime

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 13 / 14

slide-46
SLIDE 46

Bibliography

Chiang, David, Knight, Kevin, and Wang, Wei. 11,001 new features for statistical machine translation. In NAACL-HLT, 2009. McAllester, D., Hazan, T., and Keshet, J. Direct loss minimization for structured prediction. In NIPS, 2010. Ross, St´ ephane and Bagnell, J. Andrew. Efficient reductions for imitation

  • learning. In AISTATS, 2010.

Ross, St´ ephane., Gordon, Geoffrey J., and Bagnell, J. Andrew. A reduction

  • f imitation learning and structured prediction to no-regret online
  • learning. In AISTATS, 2011.

He He, Hal Daum´ e III and Jason Eisner Cost-sensitive Dynamic Feature Selection June 30, 2012 14 / 14