• Aucun résultat trouvé

Evaluating probabilistic reasoning systems

N/A
N/A
Protected

Academic year: 2022

Partager "Evaluating probabilistic reasoning systems"

Copied!
2
0
0

Texte intégral

(1)

Evaluating Probabilistic Reasoning Systems

Adnan Darwiche and Rina Dechter

Theprobabilistic reasoning evaluationtook place during the month preceding the UAI conference. Its results will be presented at the applications workshop, and a full report on the methodology, benchmarks and results will be posted at the evaluation webpage following the UAI conference.

Motivation

Over the past two decades a variety of exact and approximate algorithms were developed across several communities (e.g. UAI, NIPS, SAT/CSPs) for an- swering optimization and likelihood queries over probabilistic graphical models.

Since all these tasks are NP-hard, theoretical guarantees are rare and empirical evaluation becomes a central evaluation tool. Yet, the empirical comparison be- tween algorithms requires agreement on representations, benchmarks and eval- uation criteria which is challenging, especially in the context of approximation algorithms.

Some communities have already addressed similar challenges through yearly empirical evaluations and competitions (e.g. SAT, CSP and planning) which proved effective, leading to algorithmic advances and to software development and dissemination. We believe that such an effort could benefit probabilistic inference algorithms as well. Probabilistic reasoning presents additional chal- lenges, however, as it tends to be harder, requires heterogenous knowledge rep- resentation frameworks, and must deal with the issue of evaluating approximate inference algorithms.

Goals

Our goal is to use the evaluation as a process that will help establish some standards for evaluating probabilistic reasoning systems based on both exact and approximate algorithms. Another long term goal is to reinforce a tradition of building and sharing probabilistic reasoning systems that allow easy access to state-of-the-art inference algorithms by members of the broader scientific and engineering communities. We hope to achieve a number of objectives:

Increase the utilization of probabilistic inference algorithms in real-world applications by reducing the investment needed for building applications based on probabilistic reasoning.

1

(2)

Allow newer members of the inference community to quickly capitalize on the expertise of more senior members of the community by providing broader access to existing code.

Foster an environment where reported empirical results are accompanied by the very systems used to obtain them.

The actual UAI’08probabilistic reasoning evaluation took place during the month preceding the conference and its results are presented and discussed during the applications workshop. The evaluation includes both Bayesian and Markov networks and consider three inference tasks: probability of evidence (partition function), most probable explanations (also called MPE or energy minimization), and node marginals. The evaluation will consider both exact and approximate algorithms, especially anytime algorithms that improve their approximations with time. Details of the evaluation can be found at:

http://graphmod.ics.uci.edu/uai08/Evaluation

A full report on the methodology, benchmarks and the results will be posted at the evaluation webpage following the conference.

Organizing Committee

Fahiem Bacchus: http://www.cs.toronto.edu/∼fbacchus/

Jeff Bilmes: http://ssli.ee.washington.edu/people/bilmes/

(co-chair) Adnan Darwiche: http://www.cs.ucla.edu/∼darwiche/

(co-chair) Rina Dechter: http://www.ics.uci.edu/∼dechter/

Hector Geffner: http://www.tecn.upf.es/∼hgeffner/

Alexander Ihler: http://www.ics.uci.edu/∼ihler/

Joris Mooij: http://www.jorismooij.nl/

Kevin Murphy: http://www.cs.ubc.ca/∼murphyk/

2

Références

Documents relatifs

The five keynote presenters at RuleML+RR 2018 were: Hannah Bast (University of Freiburg) presenting “Efficient and Convenient Search on Very Large Knowledge Bases”; Georg Gottlob

– Probabilty theory as calculus of uncertainty – Learning probabilistic models form data. Events,

The main differences are as follows: (i) in this paper we have added a parallel composition operator to our probabilistic process calculus; (ii) to define the operational semantics

We are given (i) a streaming application whose dependence graph is a linear chain; (ii) a one-to-many mapping of the application onto a fully heterogeneous target platform, where

In the previous section, we have shown how to compute the throughput when all communication times and all processing times are exponential variables (and this even in polynomial

From the theoretical point of view, our work is the first contribution on probabilistic contracts that consider both reliability and availability with compositional reasoning

Since the case of checkerboards (in dimension two) is interesting and challenging, following a question by Nicolas Burq, in the next section we investigate the case of

When the probabilistic choice is only partially known, the constraint choose is introduced into the propagation queue of the constraint solver and then, a filtering algorithm