Bandit Problems
The consequences of behavioural bias: Bandit problems and product liability law
243
Optimistic Bayesian Sampling in Contextual-Bandit Problems
38
MODIFIED ACTION VALUE METHOD APPLIED TO ‘n’-ARMED BANDIT PROBLEMS USING REINFORCEMENT LEARNING
7
Klein, Nicolas (2010): Learning and Experimentation in Strategic Bandit Problems. Dissertation, LMU München: Volkswirtschaftliche Fakultät
149
Optimal Policies for Observing Time Series and Related Restless Bandit Problems
93
Mechanisms with learning for stochastic multi armed bandit problems
44
Approximations of the Restless Bandit Problem
37
BinaryBandit:An Efficient Julia Package for Optimization and Evaluation of the Finite Horizon Bandit Problem with Binary Responses
15
Exploration vs Exploitation with Partially Observable Gaussian Autoregressive Arms
8
Parallelizing Exploration-Exploitation Tradeoffs in Gaussian Process Bandit Optimization
51
Optimizing Adaptive Marketing Experiments with the Multi-Armed Bandit
148
Regret Bounds and Minimax Policies under Partial Monitoring
52
Counterfactual Learning from Bandit Feedback under Deterministic Logging : A Case Study in Statistical Machine Translation
11
On Bandit Organizations and Their (IL)Legitimacy: Concept Development and Illustration
45
The Sample Complexity of Exploration in the Multi-Armed Bandit Problem
26
Bandit Structured Prediction for Neural Sequence to Sequence Learning
11
Kernel Estimation and Model Combination in A Bandit Problem with Covariates
37
Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization
52
LIMSI Submission for WMT’17 Shared Task on Bandit Learning
6
A multi-armed bandit approach for exploring partially observed networks
18