Top PDF discounted Markov decision processes

Simplex Algorithm for Countable state Discounted Markov Decision Processes

... We consider discounted Markov Decision Processes (MDPs) with countably-infinite state spaces, finite action spaces, and unbounded rewards. Typical examples of such MDPs are inventory ...

36

Strategy improvement algorithm for singularly perturbed discounted Markov decision processes

... perturbed Markov decision process with the discounted reward ...and discounted factor are perturbed ...irreducible processes. We introduce the limit Markov control problem which ...

7

Strategy iteration algorithms for games and Markov decision processes

... bound holds for games and for MDPs. For many years, people were unable to ﬁnd examples upon which strategy improvement equipped with the greedy policy took sig- niﬁcantly more than a linear number of iterations to ...

226

Variance Optimization for Continuous Time Markov Decision Processes

... the discounted MDP in infinite stage and the average reward problem in infinite stage [2] ...the decision maker’s expected reward is often assumed to be a constant, and then the investor chooses a policy ...

15

Randomized and Relaxed Strategies in Continuous-Time Markov Decision Processes

... The following remark explains the novelty of the current work and its connection to the previous results and the known methods. As was mentioned (see also section 5), the discounted cost is a special case of the ...

31

Policy-Gradient Algorithms for Partially Observable Markov Decision Processes

... around the correct phoneme, the more likely the agent is to receive a positive reward. The effect of past transitions on the current Viterbi probabilities become progressively smaller as time goes on, possibly at an ...

303

Some contributions to Markov decision processes

... standard discounted MDP model can be equiv- alently viewed as an undiscounted MDP ...general discounted MDP model with a state-action-dependent ...the discounted MDP model would immediately follow ...

160

Markov Decision Processes and Approximate Dynamic Programming Methods for Optimal Treatment Design.

... Maxwell et al. [67] use an ADP approach to determine the best strategy for dynamic repo- sitioning of ambulances in metropolitan areas in order to maximize the number of calls reached within a designated length of time. ...

149

ON THE FIRST PASSAGE g-MEAN-VARIANCE OPTIMALITY FOR DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES

... on discounted continuous-time MDPs in a ﬁnite or countable state space and with a bounded reward ...risk-averse decision maker might prefer a policy with a reasonable mean performance g (not necessarily the ...

19

Discrete Time Hybrid Decision Processes: The Discounted Case

... a Markov-type hybrid process from stochastic kernel and credibilistic ker- ...hybrid processes in the near future, it is meaningful to consider the case where the behavior of hybrid processes given ...

5

What if the World Were Different? Gradient-Based Exploration for New Optimal Policies

... as Markov decision processes, this problem can be modeled as a constrained optimization problem, in which the agent balances the benefits arising from changing the world with the potential costs ...

14

Approximate Newton Methods for Policy Search in Markov Decision Processes

... An avenue of research that has received less attention is the application of Newton’s method to Markov decision processes. Although such an extension of the GPOMDP algorithm is provided in the work ...

51

A POMDP Framework to Find Optimal Inspection and Maintenance Policies via Availability and Profit Maximization for Manufacturing Systems

... to decision maker's ...Observable Markov Decision Process (POMDP) framework for a partially observable and stochastically deteriorating system in which inspection and maintenance optimal policies of ...

8

Continuous Observation Partially Observable Semi Markov Decision Processes for Machine Maintenance

discounted Markov decision processes

Simplex Algorithm for Countable state Discounted Markov Decision Processes

Strategy improvement algorithm for singularly perturbed discounted Markov decision processes

Strategy iteration algorithms for games and Markov decision processes

Variance Optimization for Continuous Time Markov Decision Processes

Randomized and Relaxed Strategies in Continuous-Time Markov Decision Processes

Policy-Gradient Algorithms for Partially Observable Markov Decision Processes

Some contributions to Markov decision processes

Markov Decision Processes and Approximate Dynamic Programming Methods for Optimal Treatment Design.

ON THE FIRST PASSAGE g-MEAN-VARIANCE OPTIMALITY FOR DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES

Discrete Time Hybrid Decision Processes: The Discounted Case

What if the World Were Different? Gradient-Based Exploration for New Optimal Policies

Approximate Newton Methods for Policy Search in Markov Decision Processes

A POMDP Framework to Find Optimal Inspection and Maintenance Policies via Availability and Profit Maximization for Manufacturing Systems

Continuous Observation Partially Observable Semi Markov Decision Processes for Machine Maintenance

Partially Observable Markov Decision Processes for Prostate Cancer Screening.

Continuous-observation partially observable semi-Markov decision processes for machine maintenance

A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes

Adaptive Layer Approach For Power Management In Wireless Communication

Robust Approximate Bilinear Programming for Value Function Approximation

Optimal Control of Customers to the Service Facility with Two Types of Customers

Related subjects