The Bandit
Bandit Learning with Concurrent Transmissions for Energy-Efficient Flooding in Sensor Networks
14
Approximations of the Restless Bandit Problem
37
Bandit learning in concave N player games
11
Using Confidence Bounds for Exploitation-Exploration Trade-offs
26
On Multilabel Classification and Ranking with Bandit Feedback
37
Profile-Based Bandit with Unknown Profiles
40
Uncertainties Related To Structural Model Outputs As A Function Of The Engineering Demand Parameter And Of The Computational Method
10
On the Complexity of Best-Arm Identification in Multi-Armed Bandit Models
42
Regret Bounds and Minimax Policies under Partial Monitoring
52
Kernel Estimation and Model Combination in A Bandit Problem with Covariates
37
On Bandit Organizations and Their (IL)Legitimacy: Concept Development and Illustration
45
Training a Quantum Neural Network to Solve the Contextual Multi Armed Bandit Problem
11
Bandit Structured Prediction for Neural Sequence to Sequence Learning
11
A multi-arm bandit neighbourhood search for routing and scheduling problems
34
Towards an Improved Strategy for Solving Multi Armed Bandit Problem
5
Counterfactual Learning from Bandit Feedback under Deterministic Logging : A Case Study in Statistical Machine Translation
11
The consequences of behavioural bias: Bandit problems and product liability law
243
A multi-armed bandit approach for exploring partially observed networks
18
LIMSI Submission for WMT’17 Shared Task on Bandit Learning
6
BinaryBandit:An Efficient Julia Package for Optimization and Evaluation of the Finite Horizon Bandit Problem with Binary Responses
15