Policy gradient
Policy Gradient in Continuous Time
21
Chinese Grammatical Error Diagnosis Based on Policy Gradient LSTM Model
6
Learning of Soccer Player Agents Using a Policy Gradient Method: Pass Selection
5
Policy-Gradient Algorithms for Partially Observable Markov Decision Processes
303
Large-Scale Interactive Recommendation with Tree-Structured Policy Gradient
9
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
8
Multi Task Semantic Dependency Parsing with Policy Gradient for Learning Easy First Strategies
11
Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing
8
Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient
8
Comparing policy gradient and value function based reinforcement learning methods in simulated electrical power trade
8
Bayesian Policy Gradient and Actor-Critic Algorithms
53
Policy Gradient Methods: Variance Reduction and Stochastic Convergence
224
Temporal difference Learning with Sampling Baseline for Image Captioning
8
Towards Coherent and Cohesive Long form Text Generation
11
A kernel based true online Sarsa(λ) for continuous space control problems
16
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
51
Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
43
Research and Application of the Novel Deep Plugging Method in the Oilfield
10
The weighted gradient: A color image gradient applied to morphological segmentation
11
The gradient of a graph
17