• Aucun résultat trouvé

Heuristic Search Value Iteration for zero-sum Stochastic Games

N/A
N/A
Protected

Academic year: 2021

Partager "Heuristic Search Value Iteration for zero-sum Stochastic Games"

Copied!
13
0
0

Texte intégral

Figure

TABLE II: An example game Γ α (α ∈ [−1, +1]) with example upper and lower bound games Γ up and Γ lo , their NESs and their (NE) values.
TABLE IV: Running time (in seconds) with a 3600s timeout (TO), number of playouts (P), iterations (I) and visited states (N) for hard instances of the Soccer problem, solved with serialized and simultaneous versions of HSVI, ShGp and ShBR, using γ = 0.95 a
Table I only presents a limited selection of operators that can be used to compute upper and lower bounds of the optimal value function
Fig. 1: Alesia’s game field/board (copied from [32])

Références

Documents relatifs

However, a central question in the theory of 2-person zero-sum stochastic differential games is that of sufficient conditions, under which the game ad- mits a value, that is,

Keywords Zero-sum stochastic games, Shapley operator, o-minimal structures, definable games, uniform value, nonexpansive mappings, nonlinear Perron-Frobenius theory,

• The policy iteration algorithm for ergodic mean-payoff games is strongly polynomial when restricted to the class of ergodic games such that the expected first return (or hitting)

data covering the whole eruption (grey) and the intrusion determined for the first part of the eruption using the Projected Disk method with cGNSS data, InSAR S1 D1 data or

diffusion processes, finding that a notion of thermodynamic consistency and of symmetry of the thermodynamic forces is useful to produce and interpret a tighter bound on a class

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

Concernant le processus de privatisation à atteint 668 entreprises privatisées à la fin de l’année 2012, ce total la privatisation des entreprise économique vers les repreneurs

Integrating the power spectral densities of the magnetic and electric fields in the 10 to 500 Hz frequency range shows that the energy density of waves is higher inside the