` MDPs @ CS department

Markov decision processes

Markov decision processes (MDPs) offer an elegant mathematical framework for representing planning and decision problems in the presence of uncertainty. However, a simple textbook MDP uses discrete state, discrete time and it does not consider structure when modeling the process dynamics. Such a representation is very limited in its scope to represent real-world domains, which are often factorized, include continuous quantities (such as, temperature, speed, position, etc.) and/or imperfect observations. The aim of our research is (1) to devise MDP models that offer more natural representations of complex real-world decision problems, and (2) to develop algorithmic solutions that let us solve these problems more efficiently.

Research projects:

Our most recent research work focuses on the development of Approximate Linear Programming (ALP) methods for solving large factored MDP with continuous or hybrid state and action spaces. We have experimentally shown that we can solve large temporal optimization problems with high-dimensional state-spaces and outperform existing discretization approaches, both in terms of result quality and the efficiency of computation. Our current research work in this area follows three objectives: First, we plan to show the applicability of the new framework in solving real-world control problems with large-scale topologies (sensory networks, traffic control, etc.). Our preliminary experimental results on simulated traffic networks are very encouraging and indicate that solving of complex real-world planning problems with distributed actions is indeed possible. Second, a limitation of the ALP approach is that basis functions that effect the quality of the solution must be provided to the solver in advance. We study methods for automatic basis-functions learning to alleviate the problem. Third, we explore the application of ALP solutions to MDP problems with continuous-time transitions and partly observable states.

CS Members:

Publications:

Recent presentations:


The web page is updated by milos.