Multi-armed Bandit Policies
Lower Bounds and Selectivity of Weak-Consistent Policies in Stochastic Multi-Armed Bandit Problem
21
Optimizing Adaptive Marketing Experiments with the Multi-Armed Bandit
148
The Sample Complexity of Exploration in the Multi-Armed Bandit Problem
26
The Non-stationary Stochastic Multi-armed Bandit Problem
21
Slow Fading Channel Selection: A Restless Multi-Armed Bandit Formulation
5
Customer Acquisition via Display Advertising Using Multi-Armed Bandit Experiments
69
Selecting Multiple Web Adverts - a Contextual Multi-armed Bandit with State Uncertainty
31
Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems
27
Transfer restless multi armed bandit policy for energy efficient heterogeneous cellular network
19
Transfer Restless Multi-Armed Bandit Policy for Energy Efficient Heterogeneous Cellular Network
28
Investigación Operativa. Multi-armed restless bandits, index policies, and dynamic priority allocation
10
Algorithms for the multi-armed bandit problem
32
Monotone multi-armed bandit allocations
5
Multi-armed Bandit Problems with History
11
Mechanisms with learning for stochastic multi armed bandit problems
44
Scalable Discrete Sampling as a Multi-Armed Bandit Problem
17
Towards an Improved Strategy for Solving Multi Armed Bandit Problem
5
A multi-armed bandit approach for exploring partially observed networks
18
muMAB. A multi-armed bandit model for wireless network selection
22
On the Complexity of Best-Arm Identification in Multi-Armed Bandit Models
42