Top PDF Related Work – Markov Decision Processes and Control

Markov Decision Processes for Control of a Sensor Network-based Health Monitoring System

... Related Work Markov decision processes have been used elsewhere for control of physical ...our work, the ability to compute the entire policy offline and thus make only ...

6

Markov Decision Processes

... 1 We use the terms agent, environment, and action instead of the engineers’ terms controller, controlled system (or plant), and control signal because they are meaningful to a wider audience. 2 We restrict ...

15

Sufficient Markov Decision Processes.

... 3.2 Related Work Trafficking detection: There have been several softwares developed to combat human trafficking using statistics and machine learning on online ...our work is the Human Trafficking ...

121

Configurable Markov Decision Processes

... both Markov decision processes with imprecise probabilities and non- stationary Markov decision processes do not admit the pos- sibility to dynamically alter the environmental ...

10

Robust Control of Uncertain Markov Decision Processes with Temporal Logic Specifications

... robust control policy that maximizes the worst-case probability of satisfying the specification over all transition matrices in the uncertainty ...robust control policy on the original dynamical ...future ...

10

Robust, risk-sensitive, and data-driven control of Markov Decision Processes

... Specifically, we estimate the expected performance of any given policy (and its gradient with respect to certain policy parameters) from a training set comprising obser[r] ...

211

Results in stochastic control: optimal prediction problems and Markov decision processes

... discrete-time Markov decission processes, and summarizes a set of results of use in order to analyse an MDP derived from a portfolio optimization problem in Chapter ...the work of Bäuerle and Rieder ...

142

Risk-sensitive Markov Decision Processes

... gradual-impulse control problem of continuous- time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total ...

183

Some contributions to Markov decision processes

... present work we provide reasonably verifiable conditions, which, on the one hand, guarantee the transformed undiscounted model to be absorbing, and on the other hand, also allow the state-action-dependent discount ...

160

One-Counter Markov Decision Processes

... with control, and thus to ...existing work in the QBD literature on MDPs does not establish any results about the computational complexity, or even decidability, of basic analysis problems for general ...

36

Scalable Verification of Markov Decision Processes

... 2 Related Work The Kearns algorithm [13] is the classic ‘sparse sampling algorithm’ for large, infinite horizon, discounted ...can work with large, potentially infinite state MDPs because it explores ...

13

Compositional Reasoning for Markov Decision Processes

... future work to provide a coinductive formulation of the preorder and study its logical ...of Markov decision processes particularly in the presence of ...for Markov chains; see Chapter 10 of ...

16

Hedging Bets in Markov Decision Processes

... optimal control, finance, and robotic motion planning, it cannot capture scenarios where there are multiple objectives to optimize but only one of these gets realized in the ...

20

Markov decision processes with uncertain parameters

... Theoretical results In the beginning, we pursued the theoretical implications of uncertainty in Markov decision processes, including the aforementioned models, but also para- metric formalisms and ...

136

Augmenting Markov Decision Processes with Advising

... Robots are given missions such as live observations (cam- era feed, establishing maps, monitoring threats) and situa- tional actions (e.g. open a door, disarm traps). The robot we work with are NERVA robots ...

8

Multiple-Environment Markov Decision Processes

... a control strategy that exhibits good performances under several hypotheses formalized by different models for the environment, and those environments may not be distinguishable or we may not want to distinguish ...

13

Solving Hybrid Markov Decision Processes

... 5 Conclusions and Future Work In this paper, a novel approach for solving continuous and hybrid MDPs is de- scribed. In the first phase we use an exploration strategy of the environment and a machine learning ...

11

Bounded-parameter Markov decision processes

... 8. Related work and conclusions Our definition for bounded-parameter MDPs is related to a number of other ideas appearing in the literature on Markov decision processes; in the ...

39

Structural Results for Constrained Markov Decision Processes

... INTRODUCTION Markov Decision Processes (MDPs) have proven to be a useful tool in modelling the dynamic control of service ...formulating control problems as MDPs can be beneficial in ...

159

Policy gradient in Lipschitz Markov Decision Processes

... automatic control problems, natural resource management, ...the Markov Decision Process (MDP) and the policy model ...constant related to each component of the gradient and we show how such ...

29

Related Work – Markov Decision Processes and Control

Related subjects