• No results found

Setting the Markov Decision Processes (MDP) Algorithms

Strategy iteration algorithms for games and Markov decision processes

Strategy iteration algorithms for games and Markov decision processes

... For Markov Decision Processes We will begin by describing the optimality equations for the average-reward criterion in ...uni-chain setting, the structure of the MDP guarantees that every ...

226

Efficient algorithms for budget-constrained Markov decision processes

Efficient algorithms for budget-constrained Markov decision processes

... action-space Markov decision processes (MDPs) form a classical topic in control, game theory, and learn- ing, and as a result are widely applied, increasingly, in very large-scale ...Many ...

7

Markov Decision Processes

Markov Decision Processes

... R t+1 S t+1 Figure 3.1: The agent–environment interaction in a Markov decision process. More specifically, the agent and environment interact at each of a sequence of discrete time steps, t = 0, 1, 2, 3, . ...

15

Sufficient Markov Decision Processes.

Sufficient Markov Decision Processes.

... Data-driven decision support systems are being deployed across a wide range of application domains including medicine, engineering, and ...data-driven decision problems with an infinite or indefinite time ...

121

Configurable Markov Decision Processes

Configurable Markov Decision Processes

... both Markov decision processes with imprecise probabilities and non- stationary Markov decision processes do not admit the pos- sibility to dynamically alter the environmental ...

10

Robust Markov Decision Processes

Robust Markov Decision Processes

... Although transition sampling has theoretical appeal, it is often prohibitively costly or even infeasible in practice. To obtain independent samples for each state-action pair, one needs to repeatedly direct the MDP into ...

48

Some contributions to Markov decision processes

Some contributions to Markov decision processes

... For every t = 0, 1, 2, . . . , let H t denote the space of admissible trajec- tories up to time t. To put it precisely, H 0 := S, and H t := K × H t−1 when t ≥ 1. Generally speaking, an MDP is a discrete-time ...

160

One-Counter Markov Decision Processes

One-Counter Markov Decision Processes

... given probability. Such questions are similar in spirit to questions asked in the rich literature on “adversarial queueing theory” (see, e.g., [4]), although this is a somewhat different setting. These ...

36

Scalable Verification of Markov Decision Processes

Scalable Verification of Markov Decision Processes

... Introduction Markov decision processes (MDP) describe systems that interleave nondetermin- istic actions and probabilistic transitions, possibly with rewards or costs assigned to the actions ...

13

Compositional Reasoning for Markov Decision Processes

Compositional Reasoning for Markov Decision Processes

... The rest of this paper is organised as follows. In Section 2 we introduce the model of weighted MDPs, the notation of hyper-derivations and some important properties. Then we define a behavioural preorder based on ...

16

Hedging Bets in Markov Decision Processes

Hedging Bets in Markov Decision Processes

... 2 Model We describe here the model of Markov decision processes with alternative objectives (MDPAO). As in a traditional MDP, the process consists of a set of states and a set of actions. At a state, ...

20

Markov decision processes with uncertain parameters

Markov decision processes with uncertain parameters

... a Markov decision process with coupling constraints, which is often motivated by combining several MDPs into ...combined decision of all agents ...

136

Augmenting Markov Decision Processes with Advising

Augmenting Markov Decision Processes with Advising

... for setting the balance be- tween “following desirable” and “avoiding undesirable”, a slider for balancing the importance between achieving goals and the tradeoff ...

8

Multiple-Environment Markov Decision Processes

Multiple-Environment Markov Decision Processes

... Markov decision processes (MDP) are a standard formalism for modeling systems that ex- hibit both stochastic and non-deterministic ...far. Algorithms for finite state MDPs are known for a ...

13

Solving Hybrid Markov Decision Processes

Solving Hybrid Markov Decision Processes

... Abstract. Markov decision processes (MDPs) have developed as a stan- dard for representing uncertainty in decision-theoretic planning. How- ever, MDPs require an explicit representation of the ...

11

Bounded-parameter Markov decision processes

Bounded-parameter Markov decision processes

... on Markov decision processes; in the following, we mention just a few of the closest such ...expensive algorithms for ...special algorithms to exploit this ...

39

Analysis and Simplex-type Algorithms for Countably Infinite Linear Programming Models of Markov Decision Processes.

Analysis and Simplex-type Algorithms for Countably Infinite Linear Programming Models of Markov Decision Processes.

... two algorithms converge to ...those algorithms proceed, they require transition probabilities and rewards in more ...the algorithms require to ...

172

Structural Results for Constrained Markov Decision Processes

Structural Results for Constrained Markov Decision Processes

... INTRODUCTION Markov Decision Processes (MDPs) have proven to be a useful tool in modelling the dynamic control of service ...inform decision-makers of the class of policies they should ...

159

Compositional reasoning for weighted Markov decision processes

Compositional reasoning for weighted Markov decision processes

... Weighted Markov decision processes (MDPs) have long been used to model quantitative aspects of systems in the presence of uncertainty. However, much of the literature on such MDPs takes a monolithic ...

43

Hazard Avoidance Alerting With Markov Decision Processes

Hazard Avoidance Alerting With Markov Decision Processes

... particular Markov state and transition model might describe its behavior well most of the time, but badly on rare occasions, such as when a failure occurs in the ...

141

Show all 10000 documents...

Related subjects