• Aucun résultat trouvé

4 Results and Discussion

The wind farm site is located in Canada and has 29 months of energy data available.

The MOPSO is used to train SVM over the available 17 months of training data (at 6-hourly time resolution) using a “Round-Robin” cross-validation strategy. This gives an opportunity to train on 16 months and test the results on a ‘hold-out’ 1 month test set. By repeating the process for all the 17 months, gives a full 17 months

78 K. Gill et al.

of test data to compare against the observed. Since there is 29 months of data available for this site, a full 1 year of data is used in validation (completely unseen data). The results that follow are the predictions on the validation set. The results are shown on the normalized (between 1 and 1) dataset. The MOPSO-SVM results are shown with and without PCA pre-processing.

The trade-off curve for MOPSO SVM optimization for the two objectives is shown in Fig.4. The trade-off between BinRMS vs. RMSE is shown for the SVM using original input compared against SVM using Principal Components (PCs) as inputs. It can be noticed that there is little difference between the two approaches.

The PCA-SVM produced a better objective function result for BinRMS, where as simple SVM provided a better objective function result for RMSE.

Figure 5 shows the monthly mean wind power for the 12 months compared against the observed data. The results are shown for SVM prediction with and without the pre-processing using PCA. As stated above, there is a very little dif-ference between the two approaches and a good fit has been found. It can be noticed that predictions are in reasonable agreement with the observed data. The results in Fig.6show histogram of observed vs.the predicted wind power data at the 6-hourly time resolution (the prediction time step). The results are shown for SVM prediction with and without the pre-processing using PCA. It can be noticed that the distribu-tions are well-maintained using MOPSO methodology and a reasonable agreement between observed and predicted power is evident from Fig.6. A number of good-ness-of-fit measures are evaluated in Table1, which are monthly root mean square error (RMSE), monthly coefficient of determination (COD), instantaneous RMSE, instantaneous COD, and BinRMSE (histogram bin RMSE). The results in Table1 are presented for SVM prediction with and without the pre-processing using PCA.

Both monthly and instantaneous wind power are of significant interest, and thus are

0.4

Fig. 4 Trade-off curve between the two objectives BinRMS vs. RMSE for SVM and PCA-SVM 6 Multiobjective Evolutionary Optimization and Machine Learning 79

included in the current analysis. It can be noticed that not only monthly but also instantaneous power predictions are in close agreement with the observed.

5 Conclusions

Machine learning methods have gained in popularity in solving prediction problems in the area of geosciences. Due to their ability to model complex processes, the machine learning algorithms are preferred over computationally rigorous process-based

–0.7 –0.6 –0.5 –0.4 –0.3 –0.2 –0.1 0 0.1 0.2 0.3

–0.7 –0.6 –0.5 –0.4 –0.3 –0.2 –0.1 0 0.1 0.2 0.3

Observed

Predicted

SVM PCA-SVM

Fig. 5 Scatter plot for the mean monthly wind energy data for SVM and PCA-SVM

–1 – 0.8 – 0.6 – 0.4 – 0.2 0 0.2 0.4 0.6 0.8 1

0 50 100 150 200 250 300 350 400

Power (kw)

Observed PCA SVM SVM

Fig. 6 Histogram for SVM and PCA-SVM along with observed

80 K. Gill et al.

models which are usually over-parameterized and require immense amounts of data to calibrate. Support Vector Machines are well-suited for the problems exhibiting high degrees of spatial and temporal variability, issues of nonlinearity, conflicting scales, and hierarchical uncertainty.

In the current chapter, a multiobjective evolutionary computing method MOPSO is used to optimize the three parameters of SVM for wind energy predictions. The approach has been tested on data from a wind farm using NCEP’s re-analysis grid data. The prediction strategy employs SVM which is parameterized for the two objective functions. The approach is also tested by pre-processing the input data using PCA. A number of graphical and tabular results in the form of goodness-of-fit measures are presented for wind energy predictions. The results also show a trade-off curve for the two objectives employed in the MOPSO. The trade-trade-off curve is helpful in identifying the appropriate parameter set for SVM in order to achieve the desired accuracy for the two objective problem. The SVM predictions at the farm level produced excellent agreement with the observed data for the validation set. Overall, the results have been encouraging and it is recommended to use MOPSO-SVM approach for other operational projects in the area of renewable energy predictions and forecasting. While further modifications and advancements are underway, the current procedure is sound enough to be applied in operational settings.

References

1. V. Cherkassky, F. Mulier,Learning from Data - Concepts, Theory, and Methods(Wiley, New York, 1998), p. 441

2. B. Scholkopf, K. Sung, J.C. Chris, C. Burges, F. Girosi, T. Poggio, V. Vapnik, Comparing support vector machines with Gaussian kernels to radial basis function classifiers. IEEE Trans.

Signal Process. 45(11), 2758–2765 (1997)

3. V. Vapnik,The Nature of Statistical Learning Theory(Springer, New York, 1995), p. 188 4. V. Vapnik,Statistical Learning Theory(Wiley, Hoboken, NJ, 1998), p. 736

5. N. Cristianini, J. Shaw-Taylor (eds),An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods(Cambridge University Press, Cambridge, 2000), p. 189 6. A. Tikhonov, V. Arsenin,Solution of Ill-posed Problems(W.H. Winston & Sons, Washington,

DC, 1977), p. 258

7. M.K. Gill, T. Asefa, Y. Kaheil, M. McKee, Effect of missing data on performance of learning algorithms for hydrologic predictions: Implications to an imputation technique. Water Resour.

Res. 43, W07416 (2007). doi:10.1029/2006WR005298.

6 Multiobjective Evolutionary Optimization and Machine Learning 81

8. M.K. Gill, Y.H. Kaheil, A. Khalil, M. McKee, L. Bastidas, Multiobjective particle swarm optimization for parameter estimation in hydrology. Water Resour. Res. 42, W07417 (2006).

doi:10.1029/2005WR004528 (2006)

9. R.C. Eberhart, J. Kennedy, A new optimizer using particle swarm theory, inProceedings of the Sixth International Symposium on Micro Machine and Human Science, 1995, MHS’95.

doi:10.1109/MHS.1995.494215 (IEEE Press, Piscataway, NJ (1995), pp. 39–43

10. J. Kennedy, R.C. Eberhart, Particle swarm optimization, inProceedings of IEEE International Conference on Neural Networks, IV, vol. 4. doi:10.1109/ICNN.1995.488968 (IEEE Press, Piscataway, NJ (1995), pp. 1942–1948

11. J. Kennedy, R.C. Eberhart, Y. Shi,Swarm Intelligence(Morgan Kaufmann, San Francisco, CA, 2001)

12. R. C. Eberhart, R. W. Dobbins, P. Simpson,Computational Intelligence PC Tools(Elsevier, New York, 1996)

13. H. Kita, Y. Yabumoto, N. Mori, Y. Nishikawa, Multi-objective optimization by means of the thermodynamical genetic algorithm, inParallel Problem Solving From Nature—PPSN IV, eds. by H.-M. Voigt, W. Ebeling, I. Rechenberg, H.-P. Schwefel (Springer-Verlag Berlin, Germany, 1996), Sept. Lecture Notes in Computer Science, pp. 504–512

14. Kalnay et al. The NCEP/NCAR 40-year reanalysis project. Bull. Am. Meteor. Soc. 77, 437–470 (1996)

82 K. Gill et al.

Chapter 7