• Aucun résultat trouvé

Covariance Matrix Adaptation for Direct Reinforcement Learning

N/A
N/A
Protected

Academic year: 2022

Partager "Covariance Matrix Adaptation for Direct Reinforcement Learning"

Copied!
59
0
0

Texte intégral

(1)

Covariance Matrix Adaptation for Direct Reinforcement Learning

Freek Stulp

Cognitive Robotics Group – ENSTA-ParisTech FLOWERS Team – INRIA/Bordeaux

Olivier Sigaud

Institut des Systmes Intelligents et de Robotique – UPMC CNRS UMR 7222

JFPDA 23.05.2012

(2)

Motivation

PI 2 powerful algorithm for direct RL on robots

Tuning exploration parameter in PI

2

Grasping under Uncertainty

IROS’11, ICRA’11

Pick-and-Place Tasks

Humanoids’11

(3)

Motivation

PI 2 powerful algorithm for direct RL on robots

Tuning exploration parameter in PI

2

(4)

Motivation

PI 2 powerful algorithm for direct RL on robots Tuning exploration parameter in PI

2

Update rules for PI 2 and CEM / CMA-ES almost identical

But CEM and CMA-ES additionaly tune exploration parameter automatically

Goals

Analysis and comparison of PI

2

/ CEM / CMA-ES Novel algorithm PI

2

-CMA

Essentially PI

2

with automatic exploration parameter tuning

(5)

Motivation

PI 2 powerful algorithm for direct RL on robots Tuning exploration parameter in PI

2

Update rules for PI 2 and CEM / CMA-ES almost identical

But CEM and CMA-ES additionaly tune exploration parameter automatically

Goals

Analysis and comparison of PI

2

/ CEM / CMA-ES Novel algorithm PI

2

-CMA

Essentially PI

2

with automatic exploration parameter tuning

(6)

Outline

PI 2

CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(7)

Outline

PI 2 CEM

CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(8)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(9)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(10)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(11)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(12)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(13)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA Evaluation

Adaptive Exploration

(14)

Outline

PI 2 CEM CMA-ES

Reward-Weighted Averaging

Comparison

Evaluation Task

PI 2 -CMA

Evaluation

Adaptive Exploration

(15)

PI 2 - Derivation Outline

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

From first principles of Stochastic Optimal Control Start with Hamilton Jacobi Bellman equations

1

Log transformation + benign assumption

⇒ Linear!

2

Transform PDE to path integral with Feynman-Kac theorem

⇒ Roll-outs!

3

Apply to parameterized policies

⇒ Model free!

⇒ Iterative update rule for θ

(16)

PI 2 - Algorithm

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Words

Images

Formulae

(17)

Input:

DMP with initial parameters θ, cost function J

While (cost not converged) Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t

(

θ

+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(18)

Input: DMP with initial parameters θ

, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t

(

θ

+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2θT

tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(19)

Input: DMP with initial parameters θ, cost function J

While (cost not converged) Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t

(

θ

+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2θT

tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P

τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(20)

Input: DMP with initial parameters θ, cost function J

While (cost not converged) Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t

(

θ

+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2θT

tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P

τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(21)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update

compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t

(

θ

+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2θT

tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P

τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(22)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors

execute DMP determine cost

Update

compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(23)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update

compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(24)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update

compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(25)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update

compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(26)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost

prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(27)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging

parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(28)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(29)

Input: DMP with initial parameters θ, cost function J While (cost not converged)

Explore

sample exploration vectors execute DMP

determine cost

Update compute prob. from cost prob.-weighted averaging parameter update

1 τ

¨xt=α(β(g−xt)−xt) +˙ gT t(θ+t,k)

J(τi)=φtN+ ZtN

ti (qt+1 2

θT tRθt)dt

t,k∼ N(0,Σ) θk=θ+k

θ←θ+δθ

P τi,k

= e−1

λJ(τi,k)

PK k=1[e−1

λJ(τi,k) ]

δθti= K X

k=1 h

P

τi,k

Mti,kti,k i

(30)

PI 2 - Algorithm

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Advantages

No gradient ⇒ Deals with discontinous noisy cost functions Update δ within convex hull of

k=1...K

⇒ Safe update rule Arbitrary cost functions

Model-free Fast convergence

Only one open parameter: magnitude of exploration (

t,k

∼ N(0, Σ)) Disadvantage

No global convergence guarantees...

Robotics: This is where imitation comes in! π(θ

imit

) ≈ π(θ

)

Applied to very high-dimensional, complex tasks...

(31)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K

N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(32)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K∼ N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(33)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K∼ N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(34)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K∼ N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K

θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(35)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K∼ N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(36)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K∼ N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(37)

Cross-Entropy Method ( CEM )

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

θk=1...K∼ N(θ,

Σ)

∀k Jk

=

J(θk

)

θk=1...K←sortθk=1...Kw.r.tJk=1...K θnew

=

Ke

X

k=1

1

Keθk

Σ

new

=

Ke

X

k=1

1

Ke

k−θ)(θk−θ)|

This algorithm can be interpreted as performing reward-weighted averaging

(38)

CMA-ES

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Covariance Matrix Adaptation - Evolutionary Strategy Like CEM , but

Different mapping from cost to probability

More sophisticated method for updating covariance matrix:

pσ←

(1

−cσ

)

+

q

(2

−cσ

P

Σ

−1θnew−θ

σ

(1)

σnew

=

σ×

exp

cσ

kpσk

EkN

(0,

I)k−

1 (2)

pΣ

(1

−cΣ

)

pΣ

+

q

cΣ

(2

−cΣ

Pθnew−θ

σ

(3)

Σ

new

= (1

−c1−cµ

) Σ +

c1

(p

ΣpTΣ

+

δ(hσ

)Σ)

+

cµ

Ke

X

k=1

Pk

k−θ)(θk−θ)|

(4)

(39)

PI 2 / CEM / CMA-ES - Similarities

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

PI 2 / CEM / CMA-ES are all based on

Exploration: sample from a Gaussian θ

k

∼ N (θ, Σ) Parameter update: Reward-Weighted Averaging θ

new

= P

K

k=1

P

k

θ

k

Similarity striking, as algorithms derived from very different principles!

CEM is a special case of CMA-ES : proof in paper.

(maybe this was already known?)

(40)

PI 2 / CEM / CMA-ES - Similarities

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

PI 2 / CEM / CMA-ES are all based on

Exploration: sample from a Gaussian θ

k

∼ N (θ, Σ) Parameter update: Reward-Weighted Averaging θ

new

= P

K

k=1

P

k

θ

k

Similarity striking, as algorithms derived from very different principles!

CEM is a special case of CMA-ES : proof in paper.

(maybe this was already known?)

(41)

PI 2 / CEM / CMA-ES - Similarities

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

PI 2 / CEM / CMA-ES are all based on

Exploration: sample from a Gaussian θ

k

∼ N (θ, Σ) Parameter update: Reward-Weighted Averaging θ

new

= P

K

k=1

P

k

θ

k

Similarity striking, as algorithms derived from very different principles!

CEM is a special case of CMA-ES : proof in paper.

(maybe this was already known?)

(42)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Also some differences Evaluated on following task:

J(τ

t

) = δ(t − 0.3) · ((x

t

− 0.5) 2 + (y

t

− 0.5) 2 ) +

P

D

d=1

(D + 1 − d)(¨ a

t

) 2 P

D

d=1

(D + 1 − d) (5)

(43)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Also some differences Evaluated on following task:

J(τ

t

) = δ(t − 0.3) · ((x

t

− 0.5) 2 + (y

t

− 0.5) 2 ) +

P

D

d=1

(D + 1 − d)(¨ a

t

) 2 P

D

d=1

(D + 1 − d) (5)

(44)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Exploration Noise

PI

2

θ

k,t

∼ N (θ, Σ)

CEM

θ

k

∼ N (θ, Σ)

CMA-ES

θ

k

∼ N (θ, Σ)

(45)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Exploration Noise

PI

2

θ

k,t

∼ N (θ, Σ)

CEM

θ

k

∼ N (θ, Σ)

CMA-ES

θ

k

∼ N (θ, Σ)

(46)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Eliteness

PI

2

CEM CMA-ES

(47)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Eliteness

PI

2

CEM CMA-ES

(48)

PI 2 / CEM / CMA-ES - Differences

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Covariance Matrix Updating

PI

2

No

CEM

Yes (simple)

CMA-ES

Yes (sophisticated)

(49)

PI 2 -CMA

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Suggests a new algorithm: PI 2 -CMA Constant exploration noise Eliteness measure from PI

2

Covariance matrix updating from CEM / CMA-ES

Basically PI 2 , but with adaptive exploration

In PI

2

, exploration magnitude must be tuned by hand

Next evaluation demonstrates advantages of PI

2

-CMA

(50)

PI 2 -CMA

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Suggests a new algorithm: PI 2 -CMA Constant exploration noise Eliteness measure from PI

2

Covariance matrix updating from CEM / CMA-ES Basically PI 2 , but with adaptive exploration

In PI

2

, exploration magnitude must be tuned by hand

Next evaluation demonstrates advantages of PI

2

-CMA

(51)

PI 2 -CMA- Adaptive Exploration

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

(52)

PI 2 -CMA- Adaptive Exploration

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

⇒ Exploration magnitude influences convergence speed and exploitation

(53)

PI 2 -CMA- Adaptive Exploration

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

(54)

PI 2 -CMA- Adaptive Exploration

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

(55)

PI 2 -CMA- Adaptive Exploration

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

More recent results (not in paper)

(56)

PI 2 -CMA- Adaptive Exploration

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

(57)

Summary

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

PI 2 / CEM / CMA-ES have identical update rules: reward-weighted averaging Apply covariance matrix adaptation (as in CEM / CMA-ES ) to PI 2

Novel algorithm PI 2 -CMA

With adaptive exploration (other algorithmic parameters trivial to tune) Future work

Further analysis and theoretical validation

Evaluation on real robots

(58)

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Thank you for your attention!

Questions?

(59)

Bibliography

PI2 CEM CMA-ES

Comparison Evaluation Task

PI2-CMA Evaluation

Stulp, F. and Schaal, S. (2011).

Hierarchical reinforcement learning with motion primitives.

In11th IEEE-RAS International Conference on Humanoid Robots.

Stulp, F., Theodorou, E., Buchli, J., and Schaal, S. (2011a).

Learning to grasp under uncertainty.

InProceedings of the IEEE International Conference on Robotics and Automation (ICRA).

Stulp, F., Theodorou, E., Kalakrishnan, M., Pastor, P., Righetti, L., and Schaal, S. (2011b).

Learning motion primitive goals for robust manipulation.

InInternational Conference on Intelligent Robots and Systems (IROS).

Références

Documents relatifs

Left: Ratios of number of function evaluations performed in single runs to reach a precision 10 −10 close to the global optimal f value of the proposed approach over CMA-ES depending

Evolution strategies, covariance matrix adaptation, adaptive diagonal decoding, active covariance matrix update, default strategy parameters..

In many Evolutionary Algorithms the distribution P is implicitly defined via operators on a population, in particular, selection, recombination and mutation.. Natural template

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des

Removing the phase-space factor, event by event, we determine directly for the rst time the square of the amplitude for pion-pair emission and the corresponding correlation

However, the implementation of this technique faces several problems: (1) the large variety of situations relating to the type of crop or ploughing, sowing dates, soil and

Assuming fixed search costs (number of function evalua- tions), a small population size λ allows a larger number of generations and therefore usually leads to a faster adaptation of

Keywords: stochastic optimization, variable metric algorithm, multi-objective opti- mization, evolutionary algorithm, evolution strategy, covariance matrix adaptation,