A method to improve Standard PSO

(1)

HAL Id: hal-00394945

https://hal.archives-ouvertes.fr/hal-00394945

Preprint submitted on 16 Jun 2009

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A method to improve Standard PSO

Maurice Clerc

To cite this version:

Maurice Clerc. A method to improve Standard PSO. 2009. �hal-00394945�

(2)

Tehnialreport. DRAFTMC2009-03-13

MaurieCler

Abstrat

Inthisreport,IpresentmypersonalmethodtodesignamoreaurateversionofPSO,assumingweknowwhatkindof

problemswewillhavetosolve. Toillustratethemethod,Ionsideralassofproblemsthataremoderatelymultimodal

andnotofveryhighdimensionality(typiallysmallerthan30). ThestartingversionisStandardPSO2007(SPSO07).

Iuseamodiedveloityupdateequationisused,speitotheonsideredlassofproblems.Then,inordertoimprove

therobustnessofthealgorithm,moregeneralmodiationsare added(inpartiular,betterinitialisations,andthe use

of two kindof partiles). Onthe whole, the resulting algorithm is indeed animprovement of SPSO07, though the

improvementisnotalwaysveryimpressive. Inpassing,Igiveapreisedenitionofbetterthan,andexplainwhythe

lassialmeanbestperformaneriterionmayeasily beompletelymeaningless.

1 Why this paper?

AquestionIoftenfaeisthis: YouarealwaysreferringtoStandardPSO2007(SPSO07,[1℄). Isn'titpossible

to designa better version?. The answeris ofourse yes. Itis even written in thesoure ode available on

line:

ThisPSOversiondoesnotintendtobethebest oneonthemarket

Ontheonehand,itisindeednotverydiulttowriteabetterPSO.Ontheotherhand,asareviewerIreada

lotofpapersinwhihtheauthorslaimtheyhavesueededindoingthat,thoughitisnotreallytrue.,mainly

fortworeasons:

•

^they^start^fromâ^bad^versionôf^PSO^(usuallyâ^global^best ône). Âlthough^they^may^reallyîmproveît,

theresultingalgorithmisstillnotasgoodasSPSO07.

•

ôr^they ^start ^from â^reasonably^good ^version^(or êven ^from ^SPSO ⁰⁷ îtself), ^but ^they ûpdate ît ^having

apparentlynolearideaofwhatthemodiationisfor. Atypialexampleistheusuallaimwewantto

improvetheexploitation/explorationbalane withoutgivinganyrigorousdenitionofwhatthisbalane

is. Asaresult,themodiedalgorithmmayormaynotbebetter,moreorlessin arandomway,andwe

donoeventknowwhy.

Also,ofourse,thereisthewellknownproblemofomparisonoftwoalgorithms. Often,wendsomethinglike

ournewalgorithmBisthereforebetterthanalgorithmA ... withoutanyleardenitionofbetterthan. Let

usrstsayafewwordsaboutthispoint.

2 My algorithm is better than yours

Whatdoessuhalaimmean? First,suhalaimisvalidonlyifweagreeonaommonbenhmark. Letussay,

forexample,weonsiderasubsetoftheCEC2005benhmark[2℄. Then,wehavetoagreeonagivenriterion.

For examplefor eah funtion, the riterionmay be

Cr

⁼ ^suess ^rate^over

N

^runs,^for ^a^given ^auray

ε

^,

after at most

F

^tnessevaluationsforeahrun, or,in aseofatile, meanofthe best results. Notethat the riterionmaybeaprobabilistione(forexample

Cr ^′

⁼^theprobabilitythat

A

^is^better^than

B

^on^this^problem

(3)

aordingto theriterion

Cr

îs^greater ^than^0.5). ^What îsîmportant îs ^to^learly ^dene ^the^meaning ôf

A

is betterthan

B

ôn ône ^problem. Â ^lot ôf ^researhersîndeed ûse ^suh ân âpproah, ând ^then ^perform ^nie

statistialanalyses(nullhypothesis,p-test,andsoon),inordertodeideinprobability whetheranalgorithm

AisbetterthananalgorithmB,onthewholebenhmark.

However, in the proess, they miss a simple and frustrating fat: there is no omplete order in

R ^D

^, ^for

all

D > 1

^. ^Why îs ^that împortant? Ît ^may^be ûseful^to êxpress ît ^more^formally. ^Letûs ^say ^the^benhmark

ontains

P

^problems. ^We ^build ^two^omparison ^vetors. ^First

C A,B = (c A,B,1 , . . . , c A,B,P )

^with

c A,B,i = 1

^if

A

^is ^better ^than

B

^on ^problem

i

^(aording ^to ^the ^unique ^riterion ^dened),

c A,B,i = 0

^otherwise. ^Seond

C B,A = (c B,A,1 , . . . , c B,A,P )

^with

c B,A,i = 1

^if

B

^is ^better^than

A

^on^the^problem

i

^,^and

c B,A,i = 0

^otherwise.

Wehave toompare thetwonumerialvetors

C A,B

^and

C B,A

^. ^Now, ^preisely ^beause ^there ^is^no ^omplete

orderin

R ^D ,

^we^an^say^that

A

^is^better^than

B

îfândônlyîf^forâny

i

^we^have

c A,B,i ≥ c B,A,i

^for^all

i

^,^and^if

thereexists

j

^so^that

c A,B,j > c B,A,j

^.

This is similar to the denition of the lassialPareto dominane. As we have

P

^values^of ^one^riterion,

theproessofomparing

A

^and

B

ân^be^seenâsâmultiriterion(or multiobjetive)problem. Itimpliesthat most of the time no omparison is possible, exept by using an aggregation method. For example, here, we

ouldountthenumberof

1

^s ⁱⁿ êah ^vetor,ând^say ^that ^theône ^with^the ^larger^sum^wins. ^But ^the^point

isthat anyaggregationmethod isarbitrary,i.e. foreah methodthere isanotheronethatleadstoadierent

onlusion 1

.

Letusonsider anexample:

•

^the^benhmark^ontains⁵^unimodal^funtions

f 1

^to

f 5

^,^and⁵^multimodal^ones

f 6

^to

f 10

•

^the^algorithm

A

îs êxtremely^goodônûnimodal^funtions^(veryêasy,^say^forâ^gradient^method)

•

^the^algorithm

B

îs^quite^good ^for^multimodal^funtions,^but^not^forûnimodalônes.

You nd

c A,B,i = 1

^for

i = 1, 2, 3, 4, 5

^, ^and ^also ^for

6

^(just ^beause, ^for êxample,^the âttration ^basin ôf ^the

globaloptimumisverylarge,omparedtotheonesoftheloal optima),and

c B,A,i = 1

^for

i = 7, 8, 9, 10

^. ^You

saythen"

A

^is^better^than

B

^". Ânûser^trusts^you,ând^hoosesÂ^for^his^problems. Ândâs^mostôfinteresting realproblemsaremultimodal,hewillbeverydisappointed.

So,wehavetobebothmoremodestandmorerigorous. Thatiswhytherststepinourmethodofdesigning

an improvedPSO is thehoie ofasmall benhmark. Butwewill saythat

A

^is ^better^than

B

ônly îfît îs

trueforalltheproblemsofthisbenhmark.

3 Step 1: a small representative benhmark

Thisisthemostspeipartofthemethod,foritisdependsonthekindorproblemswedowanttosolvelater

withourimprovedPSO.Letusonsiderthefollowinglassofproblems:

•

^moderatelymultimodal,orevenunimodal(butofoursewearenotsupposed toknowitinadvane)

•

^not^of^too^highdimensionality(saynomorethan30)

Forthis lass, to whih alot of realproblems belong, I havefoundthat agood small benhmarkmaybethe

followingone(seethetable 1):

•

^CEC²⁰⁰⁵^Sphere^(unimodal)

•

^CEC²⁰⁰⁵^Rosenbrok^(one^globalôptimum,ât^leastône^loalôptimumâs^soonâs^the^dimensionîs^greater

than3)

•

^Tripod^(two^loalôptima,ône^globalôptimum,^very^deeptive^[3℄)

Thesethreefuntionsaresupposedtoberepresentativeofourlassofproblem. Ifwehaveanalgorithmthat

is good onthem, then itisveryprobablyalso good foralot ofother problems ofthesamelass. Ouraim is

thentodesignaPSOvariantthatisbetterthanSPSO07forthesethreefuntions. OurhopeisthatthisPSO

variantwillindeed bealsobetterthanSPSO 07onmoreproblemsof thesamekind. Andifitistrueevenfor

somehighlymultimodalproblems,and/orforhigherdimensionality,well,weanonsiderthatasaniebonus!

1

Forexample,itispossibletoassigna"weight"toeahproblem(whihrepresentshow"important"isthiskindofproblemfor

theuser) andtolinearly ombinethe

c A,B,i

^and

c B,A,i

^. ^Butîf,^forâ^setôf^(nonîdential)^weights,

A

^is^better^than

B

^,^then^it

alwaysexistsanotheroneforwhih

B

^is^better^than

A

^.

(4)

Tab.1: Thebenhmark. Moredetailsaregivenin 9.1

Searh Required Maximumnumber

spae auray oftnessevaluations

CEC2005Sphere

[ − 100, 100] ³⁰

^0.000001 ¹⁵⁰⁰⁰

CEC2005Rosenbrok

[ − 100, 100] ¹⁰

^0.01 ⁵⁰⁰⁰⁰

Tripod

[ − 100, 100] ²

^0.0001 ¹⁰⁰⁰⁰

4 Step 2: a highly exible PSO

MymaintoolisaPSOversion(Code),whihisbasedonSPSO07. HoweverIhaveaddedalotofoptions,in

ordertohaveaveryexibleresearhalgorithm. Atually,Ioftenmodifyit, butyoualwaysanndthelatest

version(namedBalanedPSO)onmytehnial site[4℄. WhenIused itforthis paper,themain optionswere:

•

^two ^kind ôf ^randomness ^(KISS ^[5℄, ând ^the ^standard ^randomness^provided ⁱⁿ ^LINUX ^C ômpiler). În

what follows,IalwaysuseKISS,sothattheresultsanbemorereproduible

•

^seveninitialisationmethodsforthepositions(in partiularavariantoftheHammersley'sone[6℄)

•

^sixinitialisationmethodsfortheveloities(zero,ompletelyrandom,randomaround aposition,et.)

•

^two^lampingôptions^for^the^position^(atually^, ^just^lamping^likeⁱⁿ^SPSO ^07,ôr^no^lampingând^no

evaluation)

•

possibilitytodeneasearhspaegreaterthanthefeasiblespae. Ofourse,ifapartileiesoutsidethe feasiblespae,itstnessisnotevaluated

•

^six^loal^searhôptions^(no^loal^searhâsⁱⁿ^SPSO^07,ûniformⁱⁿ ^the^best^loalârea,êt.). ^Note^thatît

impliesarigorousdenitionofwhat aloal areais. See[7℄

•

^two^options^for^the^loop^over^partiles(sequentialorat random)

•

^six^strategies

Thestrategiesarerelatedtothelassialveloityupdateformula

v (t + 1) = wv (t) + R (c 1 ) (p (t) − x (t)) + R (c 2 ) (g (t) − x (t))

⁽¹⁾

One anuse dierentoeients

w

^, ^and ^dierent ^random distributions

R

^. ^The^most interesting point is thatdierentpartilesmayhavedierentstrategies.

In theCsoureode, eah optionhasanidentier toeasily desribethe optionsused. Forexample, PSO

P1V2means: SPSO07, inwhihtheinitialisationofthepositionsisdonebymethod1,andtheinitialisation

oftheveloitiesbythemethod2. Pleaserefertotheonlineodeformoredetails. Inourase,wewillnowsee

nowhowaninterestingPSOvariantan bedesignedbyusing justthreeoptions.

5 Step 3: seleting the right options

Firstofall,wesimulateSPSO07,bysettingtheparametersandoptionstotheorrespondingones. Theresults

over500runs aregivenin thetable 2. Inpassing,itisworthnotingthat theusualpratieoflaunhingonly

25oreven100runs isnotenough,forreally badrunsmayourquiterarely. ThisisobviousforRosenbrok,

asweansee from the table 3. Any onlusionthat is drawn after just 100runs is risky, partiularly if you

onsiderthemeanbest value. Thesuessrateismorestable. Moredetails aboutthispartiularfuntion are

givenin9.5.

(5)

Tab.2: StandardPSO2007. Resultsover500runs

Suessrate Meanbest

CEC2005Sphere 84.8%

10 ⁻ ⁶

CEC2005Rosenbrok 15% 12.36

Tripod 46% 0.65

Tab.3:ForRosenbrok,themeanbestvalueishighlydependingonthenumberofruns(50000tnessevaluations

foreahrun). Thesuessrateismorestable

Runs Suess rate Meanbestvalue

100 16% 10.12

500 15% 12.36

1000 14,7% 15579.3

2000 14% 50885.18

5.1 Applying a spei improvement method

Whenweonsider thesurfaes oftheattrationbasins,theresultforTripodisnotsatisfying(thesuessrate

should begreater then50%). Whatoptions/parametersould we modify in order to improvethe algorithm?

Letusallthethree attrationbasinsas

B 1

^,

B 2

^, ^and

B 3

^. ^The^problem^is^deeptive^beause^two^of^them,^say

B 2

^and

B 3

^, ^lead^to ônly ^loal ôptima. Îf, ^for â^position

x

ⁱⁿ

B 1

^(i.e. ⁱⁿ ^the ^basin^of ^the^global ^optimum) ^the

neighbourhood best

g

^is ^eitherⁱⁿ

B 2

^or ⁱⁿ

B 3

^, ^then, âording^to ^theêquation ^1,êvenîf^the ^distane^between

x

^and

g

^is ^high, ^the ^position

x

^may ^be êasily ^modied^suh ^that ît îs^not ⁱⁿ

B 1

^any^more. ^This ^is^beauseⁱⁿ

SPSO07theterm

R (c 2 ) (g (t) − x (t))

^is^simply

U (0, c 2 ) (g (t) − x (t))

^,^where

U

^is^the^uniformdistribution.

However,weareinterestedonfuntionswithasmallnumberofloaloptima,andthereforewemaysuppose

that thedistane betweentwooptimais usuallynotverysmall. So,in ordertoavoidtheabovebehaviour,we

usetheideaisthatthefurtheraninformeris,thesmallerisitsinuene(thisanbeseenasakindofnihing).

Wemaythentrya

R (c 2 )

^that ^is ⁱⁿ^fat ^a

R (c 2, | g − x | )

^,^and ^dereasing^with

| g − x |

^. ^The^optional^formula^I

usetodothatin myexible PSOis

R (c 2, | g − x | ) = U (0, c 2 )

1 − | g − x | x max − x min

λ

(2)

Experimentssuggestthat

λ

^should ^not^be^too^high, ^beauseⁱⁿ^that âse,âlthough ^theâlgorithm^beomes

almost perfet for Tripod, the result for Sphere beomes quite bad. In pratie,

λ = 2

^seems^to ^be^a ^good

ompromise. With thisvaluetheresultfor Sphereisalsoimprovedasweanseefrom thetable4. Aording

to ournomenlature, this PSO is alled PSO R2. Theresultfor Rosenbrokmaybenowslightlyworse, but

we haveseenthat wedo notneed toworry toomuh aboutthe meanbest,ifthe suessrate seemsorret.

Anyway,wemaynowalsoapplysomegeneral improvementoptions.

Tab.4: ResultswithPSO R2(distanedereasing distribution,aordingto theequation2

Suess rate Meanbest

CEC2005Sphere 98.6%

0.14 × 10 ⁻ ⁶

CEC2005Rosenbrok 13.4% 10.48

Tripod 47.6% 0.225

(6)

5.2 Applying some general improvement options (initialisations)

Theaboveoptionwasspeiallyhoseninordertoimprovewhatseemedtobetheworstresult,i.e. theonefor

theTripod funtion. Now,wean triggersomeotheroptionsthat areoftenbeneial,at leastformoderately

multimodalproblems:

•

^modied^Hammersley^method^for^theinitialisationofthepositions

x

•

^One-rand ^method ^for ^the initialisation of the veloity of the partile whose initial position is

x

^, ^i.e.

v = U (x min , x max ) − x

^. ^Note^thatⁱⁿ ^SPSO^07,^the^methodîs^the^Half-di ône,î.e.

v = 0.5 (U (x min , x max ) − U (x min , x max ))

ThismodiedalgorithmisPSOR2P2V1. Theresultsaregiveninthetable5,andarelearlybetterthanthe

onesofSPSO07. Theyarestillnotompletelysatisfying(f. Rosenbrok),though. So,weantryyetanother

option,whihanbealled bi-strategy.

Tab.5: Resultswhenapplyingalsodierentinitialisations,forpositionsandveloities(PSOR2P2V1)

Suess rate Meanbest

CEC2005Sphere 98.2%

0.15 × 10 ⁻ ⁶

CEC2005Rosenbrok 18.6% 31132.29

Tripod 63.8% 0.259

5.3 Bi-strategy

The basiidea is verysimple: we usetwo kindsof partiles. In pratie, during the initialisation phase, we

assignoneoftwopossiblebehaviours,with aprobabilityequalto0.5. Thesetwobehavioursaresimply:

•

^theôneôf^SPSO^07. În^partiular,

R (c 2 ) = U (0, c 2 )

•

ôr^theôneôf^PSO^R2^(i.e. ^byûsingêquation²⁾

TheresultingalgorithmisPSO R3P2V1. As weanseefrom thetable6,forallthethree funtionsnowwe

obtainresultsthatarealsolearlybetterthantheonesofSPSO07. Suess ratesareslightlyworseforSphere

andRosenbrok,slightlybetterforTripod,sonolearomparisonispossible. Howevermoretests(notdetailed

here) showthat this variant is morerobust, aswe anguess by looking at themean best values, sowe keep

it. Twoquestions,though. Isitstillvalidfordierentmaximumnumberoftnessevaluations(searheort).

Andisittrueformoreproblems,eveniftheyarenotreallyin thesamelass,in partiulariftheyarehighly

multimodal? Bothanswersarearmative,astakledinnextsetions.

Tab.6: Resultsbyaddingthebi-strategyoption(PSOR3P2V1)

Suessrate Meanbest

CEC2005Sphere 96.6%

< 10 ⁻ ¹⁰

CEC2005Rosenbrok 18.2% 6.08

Tripod 65.4% 0.286

6 Now, let's try

6.1 Suess rate vs Searh eort

Here, on the same three problems, we simply onsider dierent maximum numbers of tness evaluations

(

F E max

^),ând^weêvaluate^the^suess^rateôver⁵⁰⁰^runs. Âs^weân^see^from ^the^gure^1,^forâny

F E max

^the

(7)

suessrate ofour variantis greaterthanthe oneofSPSO 07. SO,weansafelysaythat it is reallybetter,

at least on this small benhmark. Of ourse, it is not alwaysso obvious. Giving along list of resultsis out

of thesopeof this paper, whih isjust aboutadesign method, but weanneverthelesshavean ideaof the

performaneonafewmoreproblems.

6.2 Moderately multimodal problems

Table7andgure2areaboutmoderatelymultimodalproblems. Thisisasmallseletion,toillustratedierent

ases:

•

^learimprovement,i.e. nomatterwhatthenumberoftnessevaluationsis,buttheimprovementissmall (Shwefel,PressureVessel). AtuallySPSO07isalreadyprettygoodontheseproblems(forexample,for

PressureVessel,SOMAneedsmorethan50000tnessevaluationstosolveit[8℄),sooursmallmodiations

annotimproveitalot.

•

questionableimprovement,i.e. dependingonthenumberoftnessevaluations(Compressionspring)

•

^lear ^bigimprovement(Geartrain). Forthisproblem, andafter 20000tness evaluations, not onlythe suessrateofPSOR3P2V1is92.6%,butitndstheverygoodsolution

(19, 16, 43, 49)

^(or^an^equivalent

permutation),85timesover500runs. Thetnessofthissolutionis

2.7 × 10 ⁻ ¹²

^(SOMA^needs^about^200,000

evaluationstondit).

Even when the improvement is not very important, the robustness is inreased. For example, for Pressure

Vessel, with 11000tness evaluations, themean best is 28.23 (standard dev. 133.35)with SPSO 07, asit is

18.78(standarddev. 56.97)withPSOR3P2V1.

Tab. 7: Moremoderatelymultimodalproblems. See9.2fordetails

Searh Required

spae auray

CEC2005Shwefel

[ − 100, 100] ¹⁰

^0.00001

Pressurevessel 4variables 0.00001

(disreteform) objetive7197.72893

Compressionspring 3variables 0.000001

objetive2.625421

(granularity0.001for

x 3

⁾

Geartrain 4variables

10 ⁻ ⁹

6.3 Highly multimodal problems

Table8andgure3areforhighlymultimodalproblems. ThegoodnewsisthatourmodiedPSOisalsobetter

evenforsomehighly multimodalproblems. It isnottrueallthetime(seeGriewankorCellularphone),butit

wasnotitsaim,anyway.

7 Claims and suspiion

We have seen that it is possible to improve Standard PSO 2007 by modifying the veloity update equation

and the initialisationshemes. However,this improvementis not valid arossall kindsof problems, and not

valid aross all riterions (in partiular, it may be depending on the number of tness evaluations). Also,

the improvement is not always very impressive. Thus, this study inites us to be suspiious when reading

an assertion like My PSO variant is far better than Standard PSO. Suh a laim has to be veryarefully

supported,byarigorousdenitionofwhatbettermeans,andbysigniantresultsonagoodrepresentative

benhmark,onalargerangeofmaximumnumberoftnessevaluations. Also,wehavetobeveryarefulwhen

using the meanbest riterionfor omparison, for it maybe meaningless. And, of ourse, theproposed PSO

variantshouldbeompared totheurrent StandardPSO,andnotto anoldbad version.

(8)

(a)Sphere

(b)Rosenbrok

()Tripod

Fig.1: SuessprobabilityvsSearheort. Forany

F E max

^the^variant^is^better

(9)

(a)Shwefel (b)Pressurevessel

()Compressionspring (d)Geartrain

Fig.2: OntheShwefeland Pressurevessel problemsPSOR3P2V1isslightlybetterthanSPSO 07for any

numberoftnessevaluations. OntheCompressionspringproblem, itistrueonlywhenthenumberoftness

evaluationsisgreaterthanagivenvalue(about19000). So,onthisproblem, eitherlaimSPSO 07isbetter

orPSOR3P2V1isbetter iswrong

(10)

(a)Rastrigin

(b)Griewank

()Akley (d)Cellularphone

Fig.3: Suessprobabilityforsomehighlymultimodalproblems. Althoughdesignedformoderatelymultimodal

problems,PSOR3P2V1isevensometimesgoodfortheseproblems. Butnotalways

(11)

Tab.8: Highlymultimodalproblems. See9.3fordetails

Searh Required

spae auray

CEC2005Rastrigin

[ − 5, 5] ¹⁰

^0.01

CEC2005Griewank

[ − 600, 600] ¹⁰

^0.01

(notrotated)

CEC2005Akley

[ − 32, 32] ¹⁰

^0.0001

(notrotated)

Cellularphone

[0, 100] ²⁰ 10 ⁻ ⁸

8 Home work

Thespei improvementmodiationof SPSO07usedherewasformoderatelymultimodalproblems,in low

dimension. LetusallthemM-problems. Now,whatouldbeaneetivespeimodiationforanotherlass

of problems? Take, for examplethe lassof highly multimodal problems, but still in low dimension(smaller

than30). Letusall themH-problems.

First,wehavetodeneasmallrepresentativebenhmark. Hint: inludeGriewank10D,fromtheCEC2005

benhmark(no needto usethe rotatedfuntion). Seond, wehaveto understand in whihwaythe diulty

ofanH-problemisdierentfromthatofanM-problem. Hint: onanH-problem,SPSO07isusuallylesseasily

trapped into a loal minimum, just beause theattration basins are small. Onthe ontrary, if apartile is

insidethegood attrationbasin(theoneoftheglobaloptimum),itmayevenleaveitprematurely. Andthird,

we haveto nd what optionsare neededto opewith the foundspei diulty(ies). Hint: just makesure

the urrent attration basin is well exploited, aquik loal searh may be useful. A simple way is to dene

aloal areaaround thebest known position, andto sample itsmiddle (PSOL4) 2

. With just this option,an

improvementseemspossible,asweanseefrom gure4fortheGriewankfuntion. However,itdoesnotwork

verywellforRastrigin.

Allthiswillprobablybethetopiofafuturepaper,butforthemoment,youanthinkatityourself. Good

luk!

2

Let

g = (g _1, g _2, . . . , g D )

^be ^the ^best ^known ^position. ^On ^eah ^dimension

i

^, ^let

p i

^and

p ^′ _i

âre ^the ^nearest ôordinates ôf

knownpoints,"ontheleft",and"onthe right"of

g i

^. ^The^loal ^area

H

^is^the

D

^-retangle(hyperparallelepid)artesianprodut

⊗ _i

g i − α (g i − p i ) , g i + α p ^′ _i − g i

with,inpratie,

α = 1/3

^.^Thenîtsênterîs^sampled. Ûsually^,îtîs^not

g

^.

(12)

Fig.4: Griewank,omparisonbetweenSPSO07andPSOL4. Forahighlymultimodalproblem,averysimple

loalsearhmayimprovetheperformane.

9 Appendix

9.1 Formulae for the benhmark

Tab.9: Benhmarkdetails

Formula

Sphere

− 450 +

30 X

d=1

(x d − o d ) ²

^The^random^oset^vetor

O = (o 1 , · · · , o 30 )

isdenedbyitsCode.

Thisisthesolutionpoint.

Rosenbrok

390 +

10 X

d=2

100 z _d− ² 1 − z d 2

+ (z d − 1 − 1) ²

Therandomosetvetor

O = (o 1 , · · · , o 10 )

with

z d = x d − o d + 1

îs^dened^byîts^Côde.

Thisisthesolutionpoint

Thereis alsoaloalminimumon

(o 1 − 2, · · · , o 30 )

^. ^The^tness^value^is^then

394.

Tripod

1 − sign(x 2 )

2 ( | x 1 | + | x 2 + 50 | )

+ ^1+sign(x ₂ ² ⁾ ¹ ^−sign(x ₂ ¹ ⁾ (1 + | x 1 + 50 | + | x 2 − 50 | ) + ^1+sign(x ₂ ¹ ⁾ (2 + | x 1 − 50 | + | x 2 − 50 | )

sign (x) = − 1 x ≤ 0

^The^solution^point^is

(0, − 50)

(13)

Oset forSphere/Parabola(Csoureode)

statidoubleoset_0[30℄={-3.9311900e+001,5.8899900e+001,-4.6322400e+001,-7.4651500e+001,-1.6799700e+001,

-8.0544100e+001, -1.0593500e+001, 2.4969400e+001, 8.9838400e+001, 9.1119000e+000, -1.0744300e+001, -

2.7855800e+001,-1.2580600e+001,7.5930000e+000,7.4812700e+001,6.8495900e+001,-5.3429300e+001,7.8854400e+001,

-6.8595700e+001, 6.3743200e+001, 3.1347000e+001, -3.7501600e+001, 3.3892900e+001, -8.8804500e+001, -

7.8771900e+001,-6.6494400e+001,4.4197200e+001,1.8383600e+001,2.6521200e+001,8.4472300e+001};

Oset forRosenbrok (Csoureode)

statidoubleoset_2[10℄={8.1023200e+001,-4.8395000e+001,1.9231600e+001,-2.5231000e+000,7.0433800e+001,

4.7177400e+001,-7.8358000e+000,-8.6669300e+001,5.7853200e+001};

9.2 Formulae for the other moderately multimodal problems

9.2.1 Shwefel

Thefuntionto minimiseis

f (x) = − 450 +

10 X

d=1 d

X

k=1

x k − o k

! ²

Thesearhspaeis

[ − 100, 100] ¹⁰

^. ^The^solution^point^is^the^oset

O = (o 1 , . . . , o 10 )

^,^where

f = − 450

^.

Oset (Csoureode)

statidoubleoset_4[30℄=

{3.5626700e+001,-8.2912300e+001,-1.0642300e+001,-8.3581500e+001,8.3155200e+001,4.7048000e+001,

-8.9435900e+001,-2.7421900e+001,7.6144800e+001,-3.9059500e+001};

9.2.2 Pressure vessel

Justinshort. Formoredetails,see[9,10,11℄. Therearefourvariables

x 1 ∈ [1.125, 12.5]

granularity

0.0625 x 2 ∈ [0.625, 12.5]

granularity

0.0625 x 3 ∈ ]0, 240]

x 4 ∈ ]0, 240]

andthree onstraints

g 1 := 0.0193x 3 − x 1 ≤ 0 g 2 := 0; 00954x 3 − x 2 ≤ 0

g 3 := 750 × 1728 − πx ₃ ² x 4 + ⁴ ₃ x 3

≤ 0

Thefuntion tominimiseis

f = 0.06224x 1 x 3 x 4 + 1.7781x 2 x ² ₃ + x ² ₁ (3.1611x + 19.84x 3 )

Theanalytialsolutionis

(1.125, 0.625, 58.2901554, 43.6926562)

^whih^gives^the^tness^value7,197.72893. To taketheonstraintsinto aount,apenaltymethodisused.

(14)

9.2.3 Compression spring

Formoredetails,see[9,10,11℄. Therearethreevariables

x 1 ∈ { 1, . . . , 70 }

granularity

1 x 2 ∈ [0.6, 3]

x 3 ∈ [0.207, 0.5]

granularity

0.001

andveonstraints

g 1 := ^8C ^f ^F _πx ^max 3 ^x ²

3 − S ≤ 0

g 2 := l f − l max ≤ 0 g 3 := σ p − σ pm ≤ 0 g 4 := σ p − ^F K ^p ≤ 0 g 5 := σ w − ^F ^max _K ^−F ^p ≤ 0

with

C f = 1 + 0.75 _x ^x ³

2 − x 3 + 0.615 _x ^x ³

2 F max = 1000 S = 189000

l f = ^F ^max _K + 1.05 (x 1 + 2) x 3

l max = 14 σ p = ^F _K ^p σ pm = 6

F p = 300

K = 11.5 × 10 ⁶ _8x ^x ⁴ ³

1 x ³ ₂

σ w = 1.25

andthefuntion tominimiseis

f = π ² x 2 x ² ₃ (x 1 + 1) 4

The best known solution is

(7, 1.386599591, 0.292)

^whih ^gives ^the ^tness ^value 2.6254214578. To take the onstraintsinto aount,apenaltymethodisused.

9.2.4 Geartrain

Formoredetails,see[9,11℄. Thefuntion tominimiseis

f (x) = 1

6.931 − x 1 x 2

x 3 x 4

²

Thesearhspaeis

{ 12, 13, . . . , 60 } ⁴

^. ^There^are^several^solutions,^depending^on^the^required^preision. ^For

example

f (19, 16, 43, 49) = 2.7 × 10 ⁻ ¹²

9.3 Formulae for the highly multimodal problems

9.3.1 Rastrigin

f = − 230 +

10 X

d=1

(x d − o d ) ² − 10 cos (2π (x d − o d ))

Thesearhspaeis

[ − 5, 5] ¹⁰

^. ^The^solution^point^is^the^oset

O = (o 1 , . . . , o 10 )

^,^where

f = − 330

^.

(15)

Oset (Csoureode)

{1.9005000e+000,-1.5644000e+000,-9.7880000e-001,-2.2536000e+000,2.4990000e+000,-3.2853000e+000,

9.7590000e-001,-3.6661000e+000,9.8500000e-002,-3.2465000e+000};

9.3.2 Griewank

f = − 179 + P 10

d=1 (x d − o d ) ²

4000 −

10 Y

d=1

cos

x d − o d

√ d

Thesearhspaeis

[ − 600, 600] ¹⁰ .

^The^solution^point^is ^the^oset

O = (o 1 , . . . , o 10 )

^,^where

f = − 180

^.

Oset (Csoureode)

{-2.7626840e+002,-1.1911000e+001,-5.7878840e+002,-2.8764860e+002,-8.4385800e+001,-2.2867530e+002,

-4.5815160e+002,-2.0221450e+002,-1.0586420e+002,-9.6489800e+001};

9.3.3 Akley

f = − 120 + e + 20e ⁻ ^0.2 q

1 D

P ¹⁰

d=1 (x d − o d ) ²

− e ^D ¹ P 10

d=1 cos(2π(x d −o d ))

Thesearhspaeis

[ − 32, 32] ¹⁰ .

^The^solution^point^is ^the^oset

O = (o 1 , . . . , o 10 )

^,^where

f = − 140

^.

Oset (Csoureode)

{-1.6823000e+001,1.4976900e+001,6.1690000e+000,9.5566000e+000,1.9541700e+001,-1.7190000e+001,

-1.8824800e+001,8.5110000e-001,-1.5116200e+001,1.0793400e+001};

9.3.4 Cellular phone

Thisproblemarisesinarealappliation,onwhihIhaveworkedintheteleommuniationsdomain. However,

here, all onstraints has been removed, exept of ourse the ones given by the searh spae itself. We have

a square at domain

[0, 100] ²

^, ⁱⁿ ^whih ^we ^want ^to ^put

M

^stations. ^Eah ^station

m k

^has ^two ^oordinates

(m k,1 , m k,2 )

^. ^These ^are^the

2M

^variablesôf^the^problem. ^Weônsider êah înteger ^pointôf^the^domain, î.e.

(i, j) , i ∈ { 0, 1, . . . , 100 } , j ∈ { 0, 1, . . . , 100 }

^. Ônêahînteger^point,^theêldîndued^by^the^station

m k

^is^given

by

f i,j,m k )=

1 (i − m k,1 ) ² + (j − m k,2 ) ² + 1

andwewanttohaveatleastoneeld thatisnottooweak. Finally,thefuntion tominimiseis

f = 1

P 100 i=1

P 100

j=1 max k (f i,j,m k )

In this paper, we set

M = 10

^. ^Therefore ^the ^dimension ôf ^the ^problem îs ^20. ^The ôbjetive ^value îs

0.005530517. This is not the true minimum, but enough from an engineering point of view. Of ourse, in

reality we do not know the objetivevalue. We just run the algorithm several times for a given number of

tnessevaluations,andkeepthebestsolution. Fromthegure5weanseeasolutionfoundbySPSO07after

20000 tness evaluations. Atually, for this simplied problem, more eient methods do exist (Delaunay's

tessellation, for example), but those an not be used as soon as we introdue a third dimension and more

onstraints,sothattheeldisnotspherialanymore.

(16)

Fig. 5: Cellularphone problem. A possible (approximate) solution for 10 stations, found by SPSO 07 after

20000tnessevaluations

9.4 A possible simpliation

Wemaywonderwhetherthetwoinitialisationmethodsused in5.2are reallyusefulornot. Letus tryjust the

bi-strategyoption,by keepingthe initialisations ofSPSO 07. Resultsare in thetable 10. Whenweompare

theresultswiththosegiveninthetable6,weanseethat forthethree funtions,theresultsarenotasgood.

However,theyarenotbadatall. So,forsimpliity,itmaybeperfetly aeptabletousejustPSO R3.

Tab.10: Resultswithjustthebi-strategyoption(PSOR3)

Suessrate Meanbest

CEC2005Sphere %

CEC2005Rosenbrok %

Tripod 60.6% 0.3556

9.5 When the mean best may be meaningless

OntheRosenbrokfuntion,wehavequiklyseenthatthemeanbest dependsheavilyonthenumberof runs

(see table 3), and therefore is not an aeptable performane riterion. Here is a moredetailed explanation

of this phenomenon. First we show experimentally that the distribution of the errors for this funtion is

not Gaussian, and, more preisely, that the probability of a verybad run (i.e. a veryhigh tness value) is

not negligible. Then, and more generally, assuming that for agivenproblem this property is true, asimple

probabilistianalysisexplainswhythesuessrateisamorereliableriterion.

9.5.1 Distributionof the errorsforRosenbrok

We runthe algorithm 5000 times, with 5000 tness evaluations for eah run,i.e. just enough to havea non

zerosuessrate. Eahtime,wesavethebest valuefound. Weanthenestimatetheshapeofthedistribution

(17)

of these 5000 values, seen as ourrenes of a random variable. Contrary to what is sometimes said, this

distributionisfar fromnormal(Gaussian)one. Indeed,themain peak isveryaute, andthere aresomevery

highvalues. Evenifthesearerare,itimpliesthatthemeanvalueisnotreallyrepresentativeoftheperformane

of the algorithm. It would be better to onsider the value on whih the highest peak (the mode) lies. For

SPSO 07,itis about7(the rightvalueis0), and themean is25101.4(there areafewverybad runs). As we

ansee from gure6,wehaveaquite niemodel byusing theunion ofapowerlaw(onthe left ofthemain

peak),andaCauhylaw(ontheright).

f requency = α _class ^m ^k k+1

^if

class ≤ m

= ¹ _π _(class ^γ

− m) ² +γ ²

^else

with

γ = 1.294

^,

m = 7

^,^and

k = 6.5

^. ^Note ^thatâ^seond^power^law^for^the^right^partôf^the ûrve^(instead

of theCauhyone) would notbe suitable: although itould bebetterfor lassvaluessmaller thansay15, it

wouldforgettheimportantfatthattheprobabilityofhighvaluesisfarfromzero. Atually,eventheCauhy

modelisoverlyoptimisti,aswean seefromthemagniedversion(lasses40-70)ofthegure6,but atleast

theprobabilityisnotvirtuallyequaltozero,aswiththepowerlawmodel.

ForPSO R3P2V1,the mode isabout6,i. e. just slightlybetter. However, themean isonly3962.1. It

showsthat thisversionis abit morerobust(verybad runsdonotexist). Forbothalgorithms,thesmallpeak

(around10,astherightvalueis4)orrespondsto aloal optimum. Thesmallvalley (around3)is alsodue

totheloal optimum: sometimes(but veryrarely)theswarmisquiklytrappedintoit. It showsthatassoon

as there are loal optima the distribution has neessarily somepeaks, at least for asmall number of tness

evaluations.

9.5.2 Meanbestvs suessrateas riterion

Arun issaidtobesuessful ifthenalvalueissmallerthanasmall

ε

^,ând^bad îf^the^nal^valueîs^greater

thanabig

M

^. ^F^or^one^run,^let

p M

^be^theprobabilityof thatrun beingbad. Then, theprobability, over

N

runs,thatatleastoneof theruns isbadis

p M,N = 1 − (1 − p M ) ^N

Thisprobabilityinreasesquiklywiththenumberofruns. Now,let

f i

^be^the^nal^value^of^the^run

i

^. ^The

estimateofthemeanbest valueisusuallygivenby

µ N = P N

i=1 f i

N

Let us saythe suessrate is

ς

^. ^It ^means^we^have

ςN

^suessful ^runs. ^Letûs ônsider ânother ^sequene

of

N

^runs,^exatly ^the^same, ^exept ^that

k

^runs ^are ^replaed ^by ^bad ^ones. ^Let

m

^be ^the^maximum^of ^the

orresponding

f i

ⁱⁿ ^the^rst^sequene^of

N

^runs. ^Theprobabilityofthiseventis

p M,N,1 = p ^k _M (1 − p M ) ^N ⁻ ^k

Forthenewsuessrate

ς ^′

^,^we^have

ς ≥ ς ^′ ≥ ς − k N

Forthenewestimate

µ ^′ _N

^of^the^mean^best, ^we^have

µ ^′ _N > µ N + k M − m N

Weimmediatelyseethatthereisaproblemwhenabigvalue

M

^is^possible^with^a^non^negligibleprobability:

when thenumberofruns

N

^inreases^the^suess ^rate^may ^slightly^derease,^but ^then ^the^meandramatially inreases. Letussupposethat, foragivenproblemandagivenalgorithm,thedistributionoftheerrorsfollows

aCauhylaw. Thenwehave

(18)

(a)Globalshape

(b)Zoom"onlasses40to70

Fig.6: Rosenbrok. Distributionofthebest valueover5000runs. Onthezoom,weanseethat theCauhy

model, although optimisti, givesa better idea of thedistribution than the powerlawmodel forlass values

greaterthan40

(19)

p M = 0.5 − 1 π arctan

M γ

With theparametersof themodel of thegure6, wehavefor example

p 5000 = 8.3 × 10 ⁻ ⁵

^. ^Over

N = 30

runs,theprobabilitytohaveatleastonebadrun(tnessvaluegreaterthan

M = 5000

⁾^is^low,^just

2.5 × 10 ⁻ ³

^.

Letussaywendanestimateofthemeantobe

m.

^Over

N = 1000

^runs,^theprobabilityis

0.08

^,^whih^is^quite

high. It may easily happen. Insuh aase,eveniffor allthe otherruns the best valueisabout

m

^, ^the^new

estimateisabout

(4999m + 5000) /1000

^,^whih^may^be^very^dierent^from

m

^. În^passing,ândîf^we^lookât^the

table3,thissimpliedexplanationshowsthat forRosenbrokaCauhylawbasedmodel isindeed optimisti.

In otherwords,ifthe numberof runs istoosmall, you may neverhaveabad one,andtherefore, wrongly

estimate the mean best, even when it exists. Note that in ertain asesthe mean maynot even exist at all

(for example, in ase of a Cauhy law), and therefore any estimate of a mean best is wrong. That is why

it is important to estimate the mean for dierent

N

^values^(but ôf ôurse ^with ^the ^same ^number ôf ^tness

evaluations). Ifitseemsnotstable, forgetthis riterion,and justonsider thesuessrate, or,asseenabove,

themode. Astherearealotof papersinwhihtheprobableexisteneof themeanisnotheked,itisworth

insistingonit: if thereisnomean,givinganestimate ofit isnottehnially orret. Worse,omparing two

algorithmsbasedonsuhanestimate issimplywrong.

Referenes

[1℄ PSC,PartileSwarmCentral,http://www.partileswarm.info.

[2℄ CEC, Congress on Evolutionary Computation Benhmarks,http://www3.ntu.edu.sg/home/epnsugan/,

2005.

[3℄ L.Gagne,Steadystateevolutionaryalgorithmwithanoperatorfamily, in EISCI,(Kosie,Slovaquie),

pp.373379,2002.

[4℄ M.Cler,MathStuaboutPSO,http://ler.maurie.free.fr/pso/.

[5℄ G.MarsagliaandA. Zaman,Thekissgenerator, teh.rep.,Dept.ofStatistis,U. ofFlorida,1993.

[6℄ T.-T. Wong, W.-S. Luk, and P.-A. Heng, Sampling with Hammersley and Halton points, Journal of

Graphis Tools,vol.2(2), pp.924,1997.

[7℄ M.Cler,Themythialbalane,orwhenPSOdoenotexploit,Teh.Rep.MC2008-10-31,2008.

[8℄ I.Zelinka,SOMA-Self-OrganizingMigratingAlgorithm,inNewOptimizationTehniquesinEngineering,

pp.168217,Heidelberg,Germany: Springer, 2004.

[9℄ E.Sandgren,Nonlinearintegeranddisreteprogramminginmehanialdesignoptimization, 1990.ISSN

0305-2154.

[10℄ M.Cler,PartileSwarmOptimization. ISTE(InternationalSientiandTehnialEnylopedia),2006.

[11℄ G.C.OnwuboluandB.V.Babu,NewOptimizationTehniquesinEngineering.Berlin,Germany: Springer,

2004.

A method to improve Standard PSO

HAL Id: hal-00394945

https://hal.archives-ouvertes.fr/hal-00394945

Preprint submitted on 16 Jun 2009

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A method to improve Standard PSO

Maurice Clerc

To cite this version:

Maurice Clerc. A method to improve Standard PSO. 2009. �hal-00394945�

•

•

Cr

N

ε

F

Cr ′

A

B

Cr

A

B

R D

D > 1

P

C A,B = (c A,B,1 , . . . , c A,B,P )

c A,B,i = 1

A

B

i

c A,B,i = 0

C B,A = (c B,A,1 , . . . , c B,A,P )

c B,A,i = 1

B

A

i

c B,A,i = 0

C A,B

C B,A

R D ,

A

B

i

c A,B,i ≥ c B,A,i

i

j

c A,B,j > c B,A,j

P

A

B

1

•

f 1

f 5

f 6

f 10

•

A

•

B

c A,B,i = 1

i = 1, 2, 3, 4, 5

6

c B,A,i = 1

i = 7, 8, 9, 10

A

B

A

B

•

•

•

•

•

c A,B,i

c B,A,i

A

B

B

A

Cr ^′

R ^D

R ^D ,

[ − 100, 100] ³⁰

[ − 100, 100] ¹⁰

[ − 100, 100] ²

10 ⁻ ⁶

0.14 × 10 ⁻ ⁶

0.15 × 10 ⁻ ⁶

< 10 ⁻ ¹⁰

2.7 × 10 ⁻ ¹²

[ − 100, 100] ¹⁰

10 ⁻ ⁹

[ − 5, 5] ¹⁰

[ − 600, 600] ¹⁰

[ − 32, 32] ¹⁰

[0, 100] ²⁰ 10 ⁻ ⁸

g = (g _1, g _2, . . . , g D )