• Aucun résultat trouvé

Deep Sets for Generalization in RL

N/A
N/A
Protected

Academic year: 2021

Partager "Deep Sets for Generalization in RL"

Copied!
16
0
0

Texte intégral

Figure

Figure 1: Modular architectures with attention. Left: policy. Right: reward function.
Figure 2: Reward function and policy learning. a: Train- Train-ing (left) and testTrain-ing (right) performances of the reward function after convergence (stars indicate significant  differ-ences w.r.t
Figure 5: Representation of possible ob- ob-jects types and categories.
Figure 6: Data distributions for the supervised learning of the reward function. Sorted counts of positive examples per training set descriptions.
+5

Références

Documents relatifs

From the preceding section, we know that the distribution of the random path is given by a Markov chain: Each left step is followed by a right step, and each right step is followed by

If business and human capital actor-networks that represent powerful specializations in the region mobilize around particular strategies, they may influence the output of

is sufficient to diffuse elements, grow grains, and develop semiconductor products • Cu rich grown absorbers contain fewer defects and have a potential to produce higher V oc

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

GREEN yellow...

light blue dark blue.. www.lutinbazar.fr

The leaf-specific LOX2, LOX3, LOX4, and LOX6 contrib- ute to the synthesis of jasmonate precursors and play specific roles in growth restriction in response to wounding or in

Pocket Reference / Red Hat Linux Pocket Administrator / Petersen/ 222974-8 / Chapter 1 Color profile: Generic CMYK printer profile.. Composite