SLIDE 20
5.3. Growing proportion of devices dynamic devices
Dependence on D/(S + D)
Proportion of dynamic devices (%)
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9
Gain compared to random channel selection
0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 Optimal strategy
UCB1, α=0.5 Thomson-sampling
Figure 4: Almost optimal, for any proportion of dynamic devices, after a short learning time. Up-to 16% gain over the naive approach!
Lilian Besson (CentraleSupélec & Inria) MAB Learning in IoT Networks CROWNCOM 2017 15 / 18