two-armed bandit problem
The Finite Horizon Two Armed Bandit Problem with Binary Responses:A Multidisciplinary Survey of the History, State of the Art, and Myths
45
Multi-Armed Bandit Algorithms for a Mobile Service Robot's Spare Time in a Structured Environment
13
Lower Bounds and Selectivity of Weak-Consistent Policies in Stochastic Multi-Armed Bandit Problem
21
Multi-Armed Bandit in Action: Optimizing Performance in Dynamic Hybrid Networks
14
Training a Quantum Neural Network to Solve the Contextual Multi Armed Bandit Problem
11
Transfer restless multi armed bandit policy for energy efficient heterogeneous cellular network
19
How To Solve Two Sided Bandit Problems
6
Kernel Estimation and Model Combination in A Bandit Problem with Covariates
37
A multi-armed bandit approach for batch mode active learning on information networks
40
Enhancing Evolutionary Conversion Rate Optimization via Multi-Armed Bandit Algorithms
8
Combinatorial Multi-Armed Bandit and Its Extension to Probabilistically Triggered Arms
33
Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks
34
The Sample Complexity of Exploration in the Multi-Armed Bandit Problem
26
Overlapping Multi-Bandit Best Arm Identification
5
Towards an Improved Strategy for Solving Multi Armed Bandit Problem
5
MODIFIED ACTION VALUE METHOD APPLIED TO ‘n’-ARMED BANDIT PROBLEMS USING REINFORCEMENT LEARNING
7
Cost sensitive decision tree learning using a multi armed bandit framework
201
Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems
27
The effectiveness of a low intensity problem solving intervention for common adolescent mental health problems in New Delhi, India: protocol for a school based, individually randomized controlled trial with an embedded stepped wedge, cluster randomized controlled recruitment trial
18
Learning Structured Predictors from Bandit Feedback for Interactive NLP
11