• Aucun résultat trouvé

Exploiting Causality for Efficient Monitoring in POMDPs

N/A
N/A
Protected

Academic year: 2022

Partager "Exploiting Causality for Efficient Monitoring in POMDPs"

Copied!
1
0
0

Texte intégral

(1)

Exploiting Causality for Efficient Monitoring in POMDPs

Stefano V. Albrecht School of Informatics University of Edinburgh Edinburgh EH8 9AB, UK s.v.albrecht@sms.ed.ac.uk

Subramanian Ramamoorthy School of Informatics University of Edinburgh Edinburgh EH8 9AB, UK s.ramamoorthy@ed.ac.uk

Abstract

Partially observable Markov decision processes (POMDPs) are a useful model for decision making in systems with uncertain states. One of the core tasks in a POMDP is the monitoring task, in which the belief state (i.e. the probability distribution over system states) is updated based on incomplete and noisy observations. This can be a hard problem in complex real-world systems due to the often very large state space. In this article, we explore the idea of accelerating the monitoring task by automatically exploiting causality in the system. We consider a specific type of causal relation, calledpassivity, which pertains to how system variables cause changes in other variables.

Specifically, a system variable is called passive if it changes its value only if it is directly acted upon, or if at least one of the variables that directly affect it (i.e. parent variables) change their values. This property can be readily determined from the conditional probability table of the system variable. We present a novel monitoring method, called Passivity-based Monitoring(PM), which maintains a factored belief state representation and exploits passivity to perform selective updates over the factored beliefs. PM produces exact belief states under certain assumptions and approximate belief states otherwise, where the approximation error is bounded by the degree of uncertainty in the process. We show empirically, in synthetic processes with varying sizes and degrees of passivity, that PM is faster than two standard monitoring methods while achieving competitive accuracy. Furthermore, we demonstrate how passivity occurs naturally in a real-world system such as a multi-robot warehouse, and how PM can exploit this to accelerate the monitoring task.

The full article can be found at:http://arxiv.org/abs/1401.7941

Références

Documents relatifs

In this article, a novel monitoring mechanism is proposed for wireless sensor networks, in which we have proposed a technique for objects detection based on

After passivity analysis of the system to establish the properties of passivity between the inputs and outputs, we present design and validation of a robust lateral nested

Also, we presented a formulation of factored Q-value functions that can be used as the payoff function for these CGBGs, and have shown how GMAA ∗ , an existing heuristic search

However, the new control law based on passivity ensures a locally asymptotic stability of the whole closed-loop system that is not demonstrated for PI controllers and these latter

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

In order to compensate for the camera movement after acquisition, they used a template matching method that consisted of selecting small high-contrast regions of an image

of knowing if the users did the task correctly a second way of estimating the accuracy has been also tested. It is called Test accuracy. In this case, all the samples belonging to

The implemented method is composed of an off-line phase, in which the fingerprint data base is constructed adopting an accurate model, and an on-line phase in which a