Value and Policy Iteration
Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory
9
Value-iteration based fitted policy iteration: learning with a single trajectory
8
CertRL : Formalizing Convergence Proofs of Value and Policy Iteration in Coq
55
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
32
On policy iteration as a Newton’s method and polynomial policy iteration algorithms
6
Policy Iteration for Factored MDPs
9
Approximate Modified Policy Iteration
22
Least-Squares Policy Iteration
43
Approximate Modified Policy Iteration
9
Policy Iteration (Ch. 17.3)
28
Least-Squares Methods for Policy Iteration
38
Rollout, Approximate Policy Iteration, and Distributed Reinforcement Learning. Chapter 4 Approximate Policy Iteration for Infinite Horizon Problems
40
Approximate Policy Iteration for Markov Control Revisited
6
Analysis of Classification-based Policy Iteration Algorithms
30
Regularized Policy Iteration with Nonparametric Function Spaces
66
The divergence of reinforcement learning algorithms with value-iteration and function approximation
9
The divergence of reinforcement learning algorithms with value-iteration and function approximation
9
Approximate policy iteration: A survey and some new methods
50
Finite-Sample Analysis of Least-Squares Policy Iteration
34
Approximate Policy Iteration for Semi-Markov Control Revisited
7