• Aucun résultat trouvé

Benchmarking Multivariate Solvers of SciPy on the Noiseless Testbed

N/A
N/A
Protected

Academic year: 2021

Partager "Benchmarking Multivariate Solvers of SciPy on the Noiseless Testbed"

Copied!
10
0
0

Texte intégral

(1)

HAL Id: hal-02160099

https://hal.inria.fr/hal-02160099

Submitted on 19 Jun 2019

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Benchmarking Multivariate Solvers of SciPy on the Noiseless Testbed

Konstantinos Varelas, Marie-Ange Dahito

To cite this version:

Konstantinos Varelas, Marie-Ange Dahito. Benchmarking Multivariate Solvers of SciPy on the Noise-

less Testbed. GECCO 2019 - The Genetic and Evolutionary Computation Conference, Jul 2019,

Prague, Czech Republic. �10.1145/3319619.3326891�. �hal-02160099�

(2)

Benchmarking Multivariate Solvers of SciPy on the Noiseless Testbed

Konstantinos Varelas

1,2

and Marie-Ange Dahito

1,3

1Inria, CMAP, École Polytechnique, IP Paris

2Thales LAS France SAS

3PSA Group

[email protected]

ABSTRACT

In this article we benchmark eight multivariate local solvers as well as the global Differential Evolution algorithm from the Python SciPy library on the BBOB noiseless testbed. We experiment with different parameter settings and termination conditions of the solvers. More focus is given to the L-BFGS-B and Nelder-Mead algorithms. For the first we investigate the effect of the maximum number of variable metric corrections used for the Hessian approximation and show that larger values than the default are of advantage. For the second we investigate the effect of adaptation of parameters, which is proved crucial for the performance of the method with increasing dimensionality.

CCS CONCEPTS

•Computing methodologies→Continuous space search;

KEYWORDS

Benchmarking, Black-box optimization, local solvers, SciPy, Differ- ential Evolution

ACM Reference format:

Konstantinos Varelas1,2and Marie-Ange Dahito1,3. 2019. Benchmarking Multivariate Solvers of SciPy on the Noiseless Testbed. InProceedings of Genetic and Evolutionary Computation Conference Companion, Prague, Czech Republic, July 13–17, 2019 (GECCO ’19 Companion),9 pages.

https://doi.org/10.1145/3319619.3326891

1 INTRODUCTION

A quantified performance comparison of optimization solvers is an important task for algorithm selection. Methods based on funda- mentally different approaches, e.g. gradient-based methods, model- building methods, stochastic search algorithms etc. can be beneficial in particular groups of problems. Thus, assessing their performance on a well-understood set of problems such as the BBOB Noiseless testbed [8] is particularly helpful in order to expose their advan- tages and drawbacks and better understand them. Moreover, it is

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

GECCO ’19 Companion, July 13–17, 2019, Prague, Czech Republic

© 2019 Association for Computing Machinery.

ACM ISBN 978-1-4503-6748-6/19/07. . . $15.00 https://doi.org/10.1145/3319619.3326891

useful to compare how particular implementations of such methods are evolved and improved over time.

In this article, a variety of solvers either in a black-box setting or in a gradient based setting with approximation of the gradients is benchmarked. In particular, multivariate optimization solvers from the latest version of the Python SciPy1library are compared, under default or modified parameter settings. A similar study for a previ- ous version of SciPy, that benchmarked six solvers of the library under default parameters has been presented in [1], where the Basin Hopping [21] restart strategy was used within each independent restart. In this study we follow a policy where independent restarts are applied when the corresponding termination criteria are met, until a given budget of function evaluations is exhausted. Based on a preliminary experimentation, we choose proper parameter settings and termination conditions for some algorithms such that their performance is not deteriorated.

The contribution in comparison to [1] is threefold: for the com- mon benchmarked solvers, we compare the different parameter settings and restart policies. Furthermore, complete data sets for all dimensions are included (in [1] the results were restricted to dimen- sions 2, 5 and 20) and three additional solvers are benchmarked.

2 BENCHMARKED ALGORITHMS AND THEIR PARAMETER SETTING

In order to investigate such effects and identify proper settings prior to the performance comparison of all solvers, experimentation was performed separately up to some extent, concerning in most, but not all, cases the termination tolerances in search and objective space. The following algorithms were benchmarked, where a star indicates those included in [1] as described above: Nelder-Mead, Powell, BFGS, L-BFGS-B, Conjugate Gradient, Truncated New- ton, Differential Evolution, COBYLA and SLSQP.

In the case of the quasi Newton L-BFGS-B algorithm [12] for high dimensional optimization, reducing the tolerance in objective val- ues (ftolparameter with default value 10−8) can be of advantage, in particular for ill-conditioned functions and for the Attractive Sector function, as presented in Figures 1 and 2. This performance improvement becomes more significant with increasing dimension- ality. Thus, in our experimentation it was set to the float machine precision2for very high accuracy. More importantly, the maximum number of variable metric corrections for the Hessian approxima- tion has to be set carefully. The default value is 10 and experiments showed performance improvement for increasing values up to 2×D,

1Version: 1.2.1

2Equal to2.220446049250313×10−16

(3)

GECCO ’19 Companion, July 13–17, 2019, Prague, Czech Republic Konstantinos Varelas et. al.

Dbeing the problem dimension, as illustrated in Figure 3. Thus it was set to this value in our comparison. Furthermore, the effect of decreasing the step length for the finite difference approximation of the gradient was investigated to some extent: decreasing the default value of this parameter (10−8) can improve the performance on particular functions, such as the Ellipsoid, while it shows worse success ratio on others. A more detailed study was presented in [2]

and in the following comparison it is set to its default value.

The Nelder-Mead [14] simplex method is tested both in its default setting and with adaptation of parameters to the dimensionality of the problem [4], controlled by theadaptiveflag.

For the modified Powell’s conjugate direction algorithm [16]

[18], the parameterftolwas set to 10−15. As for the termination tolerance in the search space, different values ofxtolwere tested in the set{10−2,10−3,10−5,10−6}. Values larger than the default (10−4) typically can make the solver faster only for the easiest targets, while smaller values can show an improved success rate for high budgets. In the following the default value was chosen.

The truncated Newton algorithm [15] [13] requires an estimation of the optimalf value. Since it always lies in(︀−1000,1000⌋︀[9] and in accordance to the black-box setting where no prior information is available for the function, we set this value to−1000.

The SLSQP method that uses Sequential Least-Squares Program- ming [11] has been tested for different values offtol(10−6, 10−9, 10−12and 10−15). Same as L-BFGS-B and Powell’s algorithm, it was sensitive to this parameter, that was set to 10−15in the performance comparison.

The original BFGS [15] method, the conjugate gradient algorithm by Polak and Ribiere [15], the global optimization Differential Evo- lution [20] method as well as the Constrained Optimization BY Linear Approximation (COBYLA) algorithm [17] are benchmarked in their default setting.

In cases where the solver supported constraint handling, no constraints were applied. Finally, the maximum iterations were set to values large enough (wherever applicable), in order to avoid termination before convergence.

3 EXPERIMENTAL PROCEDURE

All solvers were run on thebbobtestbed with restarts for a max- imum budget of 105×Dfunction evaluations (at minimum, for solvers that did not support a termination callback such as COBYLA).

For all runs, the initial point was chosen uniformly at random in (︀−4,4⌋︀Dand with the function value evaluated at this point. In the special case of Differential Evolution where no initial point is given, the domain bounds were set as(︀−5,5⌋︀D.

4 CPU TIMING

The Python code was run on several multicore machines (not ex- clusively) with different number of cores. The time per function evaluation, measured in 10−5seconds, for different dimensions along with the corresponding processor type is presented in Ta- ble 1.

Algorithm Processor Type 2-D 3-D 5-D 10-D 20-D 40-D

Nelder-Mead

Intel(R) Xeon(R) CPU E5-2667 v3

@ 3.20GHz 2.2 2.3 2.4 2.9 3.5 5.0

Adaptive Nelder-Mead

Intel(R) Xeon(R) CPU E5-2667 v3

@ 3.20GHz 2.1 2.1 2.2 2.6 3.3 4.8

Powell

Intel Core Haswell, no TSX

@ 2.29GHz 2.1 2.5 2.4 2.5 2.9 4.0

BFGS

Intel(R) Xeon(R) CPU E5-2667 v3

@ 3.20GHz 2.7 2.7 2.6 2.7 3.3 5.2

L-BFGS-B

Intel(R) Xeon(R) CPU X5650

@ 2.67GHz

2.9 2.3 2.1 2.3 3.1 5.4

Conjugate Gradient

Intel(R) Xeon(R) CPU E5-2667 v3

@ 3.20GHz

2.2 1.9 1.4 1.3 1.9 3.8

Truncated Newton

Intel(R) Xeon(R) CPU E5-2683 v4

@ 2.10GHz

1.4 1.4 2.5 2.0 2.8 5.0

Differential Evolution

Intel(R) Xeon(R) CPU X5650

@ 2.67GHz

8.4 8.4 8.5 9.3 11.0 14.0

COBYLA

Intel Core Haswell, no TSX

@ 2.29GHz

0.51 0.53 0.65 0.96 2.1 8.2

SLSQP

Intel Core Haswell, no TSX

@ 2.29GHz

2.0 1.9 1.8 2.1 1.9 2.2 Table 1: CPU timing per function evaluation

5 RESULTS

Results from experiments according to [10] and [5] on the bench- mark functions given in [3, 8] are presented in Figures 4, 5, 6, 7, and 8. The experiments were performed with COCO [7], version 2.3, the plots were produced with version 2.3. The solvers benchmarked in [1] are denoted by a prefix “B-” in the corresponding name and the data were obtained by the data archive that COCO provides.

Theaverage runtime (aRT), used in the figures, depends on a given target function value,ft=fopt+∆f, and is computed over all relevant trials as the number of function evaluations executed during each trial while the best function value did not reachft, summed over all trials and divided by the number of trials that actually reachedft[6, 19].Statistical significanceis tested with the rank-sum test for a given target∆ftusing, for each trial, either the number of needed function evaluations to reach∆ft(inverted and multiplied by−1), or, if the target was not reached, the best∆f- value achieved, measured only up to the smallest number of overall function evaluations for any unsuccessful trial under consideration.

6 OBSERVATIONS AND CONCLUSION

Aggregated results over all 24 functions of the suite show the effec- tiveness of SLSQP. In dimension 5, it has the highest success rate for a budget range(︀18D,800D⌋︀while in dimension 20, it dominates all solvers up to∼1400Dfunction evaluations, after which it is outperformed by L-BFGS-B.

(4)

0 2 4 6 log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs 1e-10

1e-11 1e-12 1e-13 1e-14 1e-15 best 2009

bbob f2, 20-D 51 targets: 100..1e-08 15 instances

v2.3.13

2 Ellipsoid separable

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs 1e-10

1e-11 1e-12 1e-13 1e-14 1e-15 best 2009

bbob f11, 20-D 51 targets: 100..1e-08 15 instances

v2.3.13

11 Discus

Figure 1: Bootstrapped empirical cumulative distribution of the number of objective function evaluations divided by dimen- sion (FEvals/DIM) for51targets with target precision in10(︀−8..2⌋︀of the ill-conditioned separable Ellipsoid and Discus functions in 20-D for L-BFGS-B. The graphs correspond to different values off-tolerance for termination.

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs 1e-10

1e-11 1e-12 1e-15 1e-14 1e-13 best 2009

bbob f6, 5-D 51 targets: 100..1e-08 15 instances

v2.3.13

6 Attractive sector

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs 1e-10

1e-11 1e-12 1e-13 1e-14 1e-15 best 2009

bbob f6, 20-D 51 targets: 100..1e-08 15 instances

v2.3.13

6 Attractive sector

Figure 2: Bootstrapped empirical cumulative distribution of the number of objective function evaluations divided by dimen- sion (FEvals/DIM) for51targets with target precision in10(︀−8..2⌋︀of the Attractive Sector function in 5-D and 20-D for L-BFGS-B.

The graphs correspond to different values off-tolerance for termination.

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs m10

m35 m15 m50 m45 m25 m40 m20 m30 best 2009

bbob f10, 10-D 51 targets: 100..1e-08 15 instances

v2.3.13

10 Ellipsoid

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs m15

m10 m20 m25 m45 m35 m50 m30 m40 best 2009

bbob f10, 20-D 51 targets: 100..1e-08 15 instances

v2.3.13

10 Ellipsoid

Figure 3: Bootstrapped empirical cumulative distribution of the number of objective function evaluations divided by dimen- sion (FEvals/DIM) for51targets with target precision in10(︀−8..2⌋︀of the ill-conditioned separable Ellipsoid function in 10-D and 20-D for L-BFGS-B. The graphs correspond to different values of maximum number of variable metric corrections for the Hessian approximation.

It is interesting to see the performance difference of SLSQP and B-SLSQP in unimodal functions such as the separable Ellipsoid function: in 5D, the runtimes are almost equal for the easiest targets and then SLSQP is faster by an increasing factor, until termination criteria start to become effective. Performance differences between the early and recent implementation of SciPy, which are not due to the different parameter setting or restart policy of [1], are also

observed for BFGS and Nelder-Mead, showing an improvement of the library implementation.

Comparing BFGS and L-BFGS-B, the latter can show better per- formance for some functions inalldimensions, as it is the case for the Sphere, Linear Slope, original and rotated Rosenbrock and Bent Cigar functions. Overall, the picture is more diverse: BFGS has same or higher success rate in the budget ranges(︀25D,125D⌋︀

(5)

GECCO ’19 Companion, July 13–17, 2019, Prague, Czech Republic Konstantinos Varelas et. al.

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

1 Sphere

B-BFGS B-CG B-L-BFGS-B B-Nelder-Mead B-Powell B-SLSQP BFGS CG COBYLA DE L-BFGS-B Nelder-Mead Powell SLSQP TNC adapt-Nelder-M

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

2 Ellipsoid separable

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

3 Rastrigin separable

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

4 Skew Rastrigin-Bueche separ

2 3 5 10 20 40

0 1 2 3 4 5

15 instances target Df: 1e-8 v2.3.3

5 Linear slope

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

6 Attractive sector

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

7 Step-ellipsoid

2 3 5 10 20 40

0 1 2 3 4 5

15 instances target Df: 1e-8 v2.3.3

8 Rosenbrock original

2 3 5 10 20 40

0 1 2 3 4 5

15 instances target Df: 1e-8 v2.3.3

9 Rosenbrock rotated

2 3 5 10 20 40

1 3 5 7

15 instances target Df: 1e-8 v2.3.3

10 Ellipsoid

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

11 Discus

2 3 5 10 20 40

1 3 5 7

15 instances target Df: 1e-8 v2.3.3

12 Bent cigar

2 3 5 10 20 40

1 3 5 7

15 instances target Df: 1e-8 v2.3.3

13 Sharp ridge

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

14 Sum of different powers

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

15 Rastrigin

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

16 Weierstrass

2 3 5 10 20 40

0 1 2 3 4 5

15 instances target Df: 1e-8 v2.3.3

17 Schaffer F7, condition 10

2 3 5 10 20 40

0 1 2 3 4 5

15 instances target Df: 1e-8 v2.3.3

18 Schaffer F7, condition 1000

2 3 5 10 20 40

1 3 5 7

15 instances target Df: 1e-8 v2.3.3

19 Griewank-Rosenbrock F8F2

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

20 Schwefel x*sin(x)

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

21 Gallagher 101 peaks

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

22 Gallagher 21 peaks

2 3 5 10 20 40

1 3 5

15 instances target Df: 1e-8 v2.3.3

23 Katsuuras

2 3 5 10 20 40

1 3 5 7

15 instances target Df: 1e-8 v2.3.3

24 Lunacek bi-Rastrigin

B-BFGS B-CG B-L-BFGS-B B-Nelder-Mead B-Powell B-SLSQP BFGS CG COBYLA DE L-BFGS-B Nelder-Mead Powell SLSQP TNC adapt-Nelder-M

Figure 4: Average running time (aRTin number off-evaluations aslog10value), divided by dimension for target function value10−8versus dimension. Slanted grid lines indicate quadratic scaling with the dimension. Different symbols correspond to different algorithms given in the legend off1andf24. Light symbols give the maximum number of function evaluations from the longest trial divided by dimension. Black stars indicate a statistically better result compared to all other algorithms withp<0.01and Bonferroni correction number of dimensions (six). Legend:○: B-BFGS,♢: B-CG,⋆: B-L-BFGS-B,▽: B-Nelder- Mead,9: B-Powell,△: B-SLSQP,D: BFGS,(: CG,+: COBYLA,◇: DE,◁: L-BFGS-B,▽: Nelder-Mead,⋆: Powell,◻: SLSQP,♢: TNC,

△: adapt-Nelder-Mead

and(︀50D,400D⌋︀for dimensions 5 and 20 respectively, while the runtimes always differ less than by a factor of 4.

For Nelder and Mead’s method, adaptation of parameters is cru- cial. Without this option, the algorithm is deteriorated as the di- mension increases. In dimension 20, the smallest target values for

(6)

0 2 4 6 log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs TNCDE

Nelder-Me B-Nelder- adapt-Nel B-SLSQP COBYLA Powell B-Powell CG BFGS B-CG B-L-BFGS- B-BFGS L-BFGS-B best 2009 SLSQP bbob f1, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

1 Sphere

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLACG

B-CG TNC DE B-L-BFGS- B-SLSQP B-Nelder- Nelder-Me adapt-Nel L-BFGS-B B-BFGS B-Powell BFGS SLSQP Powell best 2009 bbob f2, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

2 Ellipsoid separable

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs adapt-NelBFGS

TNC Nelder-Me L-BFGS-B CG SLSQP COBYLA B-Nelder- B-SLSQP B-CG B-BFGS B-L-BFGS- DE Powell B-Powell best 2009 bbob f3, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

3 Rastrigin separable

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs TNCBFGS

Nelder-Me L-BFGS-B B-CG B-BFGS adapt-Nel SLSQP COBYLA B-L-BFGS- B-Nelder- B-SLSQP CG DE Powell B-Powell best 2009 bbob f4, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

4 Skew Rastrigin-Bueche separ

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs DE

B-Nelder- Nelder-Me adapt-Nel B-SLSQP TNC Powell B-Powell B-L-BFGS- COBYLA CG BFGS B-CG B-BFGS SLSQP L-BFGS-B best 2009 bbob f5, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

5 Linear slope

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

TNC Nelder-Me B-Nelder- B-Powell adapt-Nel B-SLSQP DE B-L-BFGS- B-CG CG B-BFGS BFGS SLSQP Powell L-BFGS-B best 2009 bbob f6, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

6 Attractive sector

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs BFGS

TNC SLSQP L-BFGS-B CG Nelder-Me B-L-BFGS- B-CG B-SLSQP B-BFGS COBYLA adapt-Nel Powell B-Nelder- DE B-Powell best 2009 bbob f7, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

7 Step-ellipsoid

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

B-Powell TNC B-L-BFGS- B-CG B-SLSQP B-Nelder- DE Powell Nelder-Me CG B-BFGS adapt-Nel BFGS SLSQP L-BFGS-B best 2009 bbob f8, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

8 Rosenbrock original

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

B-Powell TNC B-Nelder- B-CG DE Powell B-L-BFGS- Nelder-Me CG B-SLSQP adapt-Nel BFGS SLSQP B-BFGS L-BFGS-B best 2009 bbob f9, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

9 Rosenbrock rotated

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

TNC B-CG CG B-Powell SLSQP DE B-L-BFGS- B-SLSQP B-BFGS BFGS L-BFGS-B B-Nelder- Nelder-Me adapt-Nel Powell best 2009 bbob f10, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

10 Ellipsoid

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

B-Powell TNC DE B-L-BFGS- B-CG SLSQP CG B-SLSQP BFGS L-BFGS-B B-BFGS B-Nelder- Nelder-Me adapt-Nel Powell best 2009 bbob f11, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

11 Discus

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

B-Powell B-CG TNC SLSQP B-Nelder- DE B-BFGS B-SLSQP adapt-Nel Nelder-Me CG B-L-BFGS- BFGS L-BFGS-B Powell best 2009 bbob f12, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

12 Bent cigar

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLASLSQP

B-Powell B-SLSQP CG TNC BFGS B-CG B-L-BFGS- B-BFGS DE L-BFGS-B adapt-Nel Nelder-Me B-Nelder- Powell best 2009 bbob f13, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

13 Sharp ridge

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs B-PowellCOBYLA

TNC CG B-SLSQP SLSQP DE B-CG B-L-BFGS- BFGS L-BFGS-B B-BFGS Powell B-Nelder- Nelder-Me adapt-Nel best 2009 bbob f14, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

14 Sum of different powers

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs CGNelder-Me

adapt-Nel TNC BFGS Powell COBYLA B-Powell B-Nelder- B-SLSQP SLSQP L-BFGS-B DE B-L-BFGS- B-CG B-BFGS best 2009 bbob f15, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

15 Rastrigin

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs SLSQPTNC

CG B-CG BFGS L-BFGS-B B-BFGS B-L-BFGS- B-SLSQP Powell COBYLA adapt-Nel B-Nelder- B-Powell Nelder-Me DE best 2009 bbob f16, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

16 Weierstrass

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-Meadapt-Nel

TNC B-Nelder- SLSQP BFGS L-BFGS-B Powell CG B-BFGS COBYLA B-CG B-SLSQP B-L-BFGS- B-Powell DE best 2009 bbob f17, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

17 Schaffer F7, condition 10

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs CGL-BFGS-B

adapt-Nel SLSQP TNC Nelder-Me BFGS B-CG COBYLA B-BFGS B-Nelder- Powell B-L-BFGS- B-SLSQP B-Powell DE best 2009 bbob f18, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

18 Schaffer F7, condition 1000

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs SLSQPPowell

adapt-Nel TNC B-CG B-Nelder- CG Nelder-Me COBYLA BFGS B-Powell L-BFGS-B B-BFGS B-SLSQP B-L-BFGS- DE best 2009 bbob f19, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

19 Griewank-Rosenbrock F8F2

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLAB-Powell

adapt-Nel B-Nelder- Nelder-Me CG TNC Powell B-CG B-SLSQP B-L-BFGS- SLSQP L-BFGS-B B-BFGS DE BFGS best 2009 bbob f20, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

20 Schwefel x*sin(x)

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs B-CG

B-Nelder- B-SLSQP B-Powell TNC B-L-BFGS- B-BFGS CG DE Powell COBYLA L-BFGS-B BFGS Nelder-Me adapt-Nel SLSQP best 2009 bbob f21, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

21 Gallagher 101 peaks

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs TNC

B-CG B-Powell B-Nelder- B-SLSQP COBYLA CG DE B-BFGS B-L-BFGS- BFGS SLSQP Nelder-Me adapt-Nel L-BFGS-B Powell best 2009 bbob f22, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

22 Gallagher 21 peaks

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs TNC

B-BFGS CG B-CG BFGS SLSQP COBYLA DE Powell L-BFGS-B B-Powell B-L-BFGS- B-Nelder- Nelder-Me adapt-Nel B-SLSQP best 2009 bbob f23, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

23 Katsuuras

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs CG

TNC BFGS adapt-Nel Powell COBYLA L-BFGS-B SLSQP Nelder-Me DE B-Nelder- B-BFGS B-CG B-Powell B-SLSQP B-L-BFGS- best 2009 bbob f24, 5-D

51 targets: 100..1e-08 15 instances

v2.3.3

24 Lunacek bi-Rastrigin

Figure 5: Empirical cumulative distribution of simulated (bootstrapped) runtimes, measured in number of objective function evaluations, divided by dimension (FEvals/DIM) for the51targets10(︀−8..2⌋︀in dimension 5.

the Sphere function are not reached while in the aggregated ECDF the method is dominated by all other solvers and for all budgets.

It is interesting that COBYLA, that is based simply on linear interpolation, often achieves better performance for the fraction of easiest targets than all other solvers e.g. for the Sharp Ridge and Sum of Different Powers functions in 20D, even outperforming the virtual best solver of BBOB 2009 for small budgets. More remarkable is the performance on the multimodal Gallagher and Katsuura functions in 20D, where it is one of the most effective methods.

Finally, Differential Evolution shows the best performance among the other (local) solvers for the group of multimodal functions with adequate global structure, where also the Basin Hopping policy is of advantage. Even though the effectiveness of DE weakens with increasing dimensionality, it maintains the highest success rate in this function group.

(7)

GECCO ’19 Companion, July 13–17, 2019, Prague, Czech Republic Konstantinos Varelas et. al.

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-MeB-Nelder-

TNC adapt-Nel COBYLA DE Powell B-Powell B-SLSQP B-CG CG BFGS B-L-BFGS- B-BFGS L-BFGS-B best 2009 SLSQP bbob f1, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

1 Sphere

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-MeCOBYLA

B-Nelder- CG B-L-BFGS- DE B-CG TNC B-SLSQP adapt-Nel L-BFGS-B B-BFGS BFGS SLSQP B-Powell Powell best 2009 bbob f2, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

2 Ellipsoid separable

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs adapt-NelTNC

SLSQP Nelder-Me CG BFGS L-BFGS-B B-Nelder- COBYLA B-CG B-SLSQP B-L-BFGS- B-BFGS Powell B-Powell DE best 2009 bbob f3, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

3 Rastrigin separable

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs adapt-NelTNC

SLSQP Nelder-Me L-BFGS-B CG BFGS B-Nelder- COBYLA B-CG B-L-BFGS- B-SLSQP B-BFGS DE Powell B-Powell best 2009 bbob f4, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

4 Skew Rastrigin-Bueche separ

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs DE

B-Nelder- Nelder-Me adapt-Nel B-SLSQP TNC Powell B-Powell CG COBYLA B-L-BFGS- BFGS B-CG B-BFGS L-BFGS-B SLSQP best 2009 bbob f5, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

5 Linear slope

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-Me

B-Nelder- COBYLA B-Powell Powell TNC B-SLSQP adapt-Nel B-L-BFGS- DE B-BFGS CG BFGS B-CG SLSQP L-BFGS-B best 2009 bbob f6, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

6 Attractive sector

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs adapt-Nel

TNC Nelder-Me CG BFGS SLSQP L-BFGS-B B-SLSQP B-BFGS B-CG B-L-BFGS- B-Nelder- Powell COBYLA B-Powell DE best 2009 bbob f7, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

7 Step-ellipsoid

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-Me

B-Nelder- B-Powell COBYLA TNC DE B-L-BFGS- B-CG B-SLSQP adapt-Nel B-BFGS Powell CG BFGS SLSQP L-BFGS-B best 2009 bbob f8, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

8 Rosenbrock original

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-Me

B-Nelder- TNC COBYLA B-Powell adapt-Nel DE B-CG B-SLSQP B-L-BFGS- Powell CG B-BFGS BFGS SLSQP L-BFGS-B best 2009 bbob f9, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

9 Rosenbrock rotated

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-Me

COBYLA TNC B-Nelder- B-Powell DE CG B-L-BFGS- B-CG B-SLSQP adapt-Nel SLSQP B-BFGS L-BFGS-B BFGS Powell best 2009 bbob f10, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

10 Ellipsoid

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLA

Nelder-Me B-Powell B-Nelder- TNC DE B-L-BFGS- B-CG CG L-BFGS-B B-SLSQP SLSQP BFGS B-BFGS adapt-Nel Powell best 2009 bbob f11, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

11 Discus

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs B-CG

B-BFGS Nelder-Me COBYLA B-Nelder- adapt-Nel TNC CG B-Powell DE B-SLSQP B-L-BFGS- BFGS SLSQP L-BFGS-B Powell best 2009 bbob f12, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

12 Bent cigar

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLAB-Nelder-

Nelder-Me B-Powell SLSQP B-SLSQP DE B-BFGS CG B-CG BFGS TNC B-L-BFGS- L-BFGS-B adapt-Nel Powell best 2009 bbob f13, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

13 Sharp ridge

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs Nelder-MeB-Powell

COBYLA TNC B-Nelder- B-L-BFGS- DE SLSQP B-CG B-SLSQP CG BFGS L-BFGS-B B-BFGS adapt-Nel Powell best 2009 bbob f14, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

14 Sum of different powers

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs adapt-NelTNC

Powell Nelder-Me L-BFGS-B COBYLA BFGS CG SLSQP B-Nelder- B-Powell B-CG B-L-BFGS- DE B-BFGS B-SLSQP best 2009 bbob f15, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

15 Rastrigin

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs CGSLSQP

BFGS TNC L-BFGS-B B-CG B-BFGS B-L-BFGS- adapt-Nel DE Nelder-Me B-SLSQP Powell B-Nelder- B-Powell COBYLA best 2009 bbob f16, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

16 Weierstrass

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs COBYLAadapt-Nel

CG TNC Powell Nelder-Me B-Nelder- BFGS SLSQP L-BFGS-B B-Powell B-CG B-BFGS B-L-BFGS- B-SLSQP DE best 2009 bbob f17, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

17 Schaffer F7, condition 10

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs CGPowell

COBYLA Nelder-Me adapt-Nel SLSQP BFGS TNC B-CG L-BFGS-B B-Nelder- B-Powell B-SLSQP B-BFGS B-L-BFGS- DE best 2009 bbob f18, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

18 Schaffer F7, condition 1000

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs SLSQPadapt-Nel

Powell B-Nelder- CG Nelder-Me B-CG TNC BFGS B-L-BFGS- COBYLA L-BFGS-B B-Powell B-BFGS B-SLSQP DE best 2009 bbob f19, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

19 Griewank-Rosenbrock F8F2

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs B-CGNelder-Me

B-SLSQP B-Nelder- CG TNC COBYLA B-L-BFGS- L-BFGS-B BFGS B-BFGS SLSQP Powell adapt-Nel B-Powell DE best 2009 bbob f20, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

20 Schwefel x*sin(x)

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs TNC

B-Powell B-SLSQP B-CG B-BFGS B-Nelder- Nelder-Me DE B-L-BFGS- CG BFGS L-BFGS-B SLSQP adapt-Nel Powell COBYLA best 2009 bbob f21, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

21 Gallagher 101 peaks

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs B-Nelder-

Nelder-Me TNC B-Powell B-SLSQP B-L-BFGS- DE B-CG B-BFGS CG BFGS L-BFGS-B SLSQP adapt-Nel Powell COBYLA best 2009 bbob f22, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

22 Gallagher 21 peaks

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs TNC

B-CG B-BFGS CG BFGS B-L-BFGS- L-BFGS-B Powell DE B-Powell SLSQP COBYLA B-Nelder- Nelder-Me B-SLSQP adapt-Nel best 2009 bbob f23, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

23 Katsuuras

0 2 4 6

log10(# f-evals / dimension) 0.0

0.2 0.4 0.6 0.8 1.0

Fraction of function,target pairs adapt-Nel

TNC Powell Nelder-Me L-BFGS-B COBYLA CG BFGS SLSQP B-Nelder- DE B-CG B-Powell B-BFGS B-SLSQP B-L-BFGS- best 2009 bbob f24, 20-D

51 targets: 100..1e-08 15 instances

v2.3.3

24 Lunacek bi-Rastrigin

Figure 6: Empirical cumulative distribution of simulated (bootstrapped) runtimes, measured in number of objective function evaluations, divided by dimension (FEvals/DIM) for the51targets10(︀−8..2⌋︀in dimension 20.

7 ACKNOWLEDGEMENTS

The PhD thesis of Konstantinos Varelas is funded by the French MoD DGA/MRIS and Thales Land & Air Systems. The PhD thesis of Marie-Ange Dahito is funded by PSA Group.

REFERENCES

[1] P. Baudiš. 2014. COCOpf: An algorithm portfolio framework.arXiv preprint arXiv:1405.3487(2014).

[2] A. Blelly, M. Felipe-Gomes, A. Auger, and D. Brockhoff. 2018. Stopping criteria, initialization, and implementations of BFGS and their effect on the BBOB test suite. InProceedings of the Genetic and Evolutionary Computation Conference

Companion. ACM, 1513–1517.

[3] S. Finck, N. Hansen, R. Ros, and A. Auger. 2009. Real-Parameter Black-Box Optimization Benchmarking 2009: Presentation of the Noiseless Functions. Technical Report 2009/20. Research Center PPE. http://coco.lri.fr/downloads/download15.

03/bbobdocfunctions.pdf Updated February 2010.

[4] F. Gao and L. Han. 2012. Implementing the Nelder-Mead simplex algorithm with adaptive parameters.Comp. Opt. and Appl.51 (2012), 259–277.

[5] N. Hansen, A Auger, D. Brockhoff, D. Tušar, and T. Tušar. 2016. COCO: Perfor- mance Assessment.ArXiv e-printsarXiv:1605.03560 (2016).

[6] N. Hansen, A. Auger, S. Finck, and R. Ros. 2012. Real-Parameter Black-Box Optimization Benchmarking 2012: Experimental Setup. Technical Report. INRIA.

http://coco.gforge.inria.fr/bbob2012-downloads

[7] N. Hansen, A. Auger, O. Mersmann, T. Tušar, and D. Brockhoff. 2016. COCO: A Platform for Comparing Continuous Optimizers in a Black-Box Setting.ArXiv

Références

Documents relatifs