EFFICIENT NONMYOPIC ACTIVE SEARCH
Shali Jiang, Gustavo Malkomes, Geoff Converse, Alyssa Shofner, Roman Garnett, Benjamin Moseley Washington University in St. Louis 12.10.16
EFFICIENT NONMYOPIC ACTIVE SEARCH Shali Jiang, Gustavo Malkomes, - - PowerPoint PPT Presentation
EFFICIENT NONMYOPIC ACTIVE SEARCH Shali Jiang, Gustavo Malkomes, Geoff Converse, Alyssa Shofner, Roman Garnett, Benjamin Moseley Washington University in St. Louis 12.10.16 1. ACTIVE SEARCH Finding interesting points Active search 1 In
Shali Jiang, Gustavo Malkomes, Geoff Converse, Alyssa Shofner, Roman Garnett, Benjamin Moseley Washington University in St. Louis 12.10.16
unusual goal: locating as many members of a particular class as possible.
1Garnett, Krishnamurthy, Xiong, Schneider (CMU), Mann (Uppsala).
ICML 2012.
Active Search Active Search 3
Active Search Active Search 4
Active search is Bayesian optimization with binary rewards and cumulative regret.
Active Search Active Search 5
We approach this problem via Bayesian decision theory.
maximizing the expected utility.
Active Search Active Search 6
The natural utility function for this problem is the number of interesting points found.
Active Search Expected utility 7
The optimal policy may be derived by sequentially maximizing the expected utility of the final dataset. With a budget of B, at time t, we select arg max
xt
E
xt
[expected utility starting from point xt].
Active Search Expected utility 8
This may be written recursively: [expected utility starting from point] = [current utility] + [expected utility of point]
+ Eyt
. Automatic dynamic tradeoff between exploration and exploitation!
Active Search Expected utility 9
effectively pretend there is only a small number of
Active Search Expected utility 10
[expected utility starting from point] = [current utility] + [expected utility of point]
+ Eyt
.
Active Search Expected utility 11
[expected utility of next few points] = [current utility] + [expected utility of point]
+ Eyt
. (ℓ is normally 2–3).
Active Search Expected utility 12
Active Search Expected utility 13
Let ℓ, m ∈ N+, ℓ < m. For any q > 0, there exists a search problem P such that ED
> q; that is, the m-step active-search policy can outperform the ℓ-step policy by any arbitrary degree.
Active Search Expected utility 14
selected simultaneously in one big batch.
context (and in this case, exact and efficient).
tradeoff restored!
Active Search Expected utility 15
in the CiteSeer database.
← − − − − − − →
cites/cited by
paper A paper B
Results 17
that single positive observation.
compared with our method (ENS).
Results 18
100 200 300 400 500 50 100 150 200
number of queries number of targets found 1-step 2-step
ENS
Results 19
20 40 60 5 10 15 20
number of queries number of targets found 1-step 2-step
ENS
Results 20
query number policy 100 300 500 700 900
25.5 80.5 141 209 273 two-step 24.9 89.8 155 220 287
ENS–900
25.9 94.3 163 239 308
ENS–700
28.0 105 188 259
ENS–500
28.7 112 189
ENS–300
26.4 105
ENS–100
30.7
Results 21