• Aucun résultat trouvé

6. RECOMMANDATIONS DE CONCEPTION

6.3 I NSTRUMENTATION D ’ EXPLOITATION

6.3.6 Détecteurs de fuite

bilita o uso de ensembles compostos por árvores de decisão em aplicações de credit-scoring ;

5.2

Trabalhos futuros

Pela possibilidade de variação dos métodos adotados em cada uma das fases da arquitetura proposta, é possível configurá-la de uma forma muito diversificada. Por isso, alguns trabalhos que poderiam ser desenvolvidos a partir de agora seriam:

• Estudar o uso da arquitetura proposta, adotando na fase de poda o mé- todo Ensemble Pruning via Individual Contribution Ordering (EPIC) (Lu

et al., 2010). O EPIC utiliza como critérios de seleção dos classificadores

que formarão o subensemble algumas de suas características individuais, além de medidas de diversidade entre eles. O EPIC superou em desempe- nho o método Orientation-Ordering em experimentos realizados sobre 26 bases de dados UCI, considerando problemas de aprendizagem en- volvendo classificação binária. Embora, dentre estas bases, não estejam relacionadas as utilizadas nos experimentos deste estudo – Australian e

German, é possível que, ao se aplicar este método na arquitetura proposta

neste trabalho, a mesma apresente ainda um melhor desempenho do que com o OO;

• Investigar o desempenho da arquitetura na solução de outros tipos de aplicações, inclusive aquelas que envolvam problemas com múltiplas clas- ses, adotando outros modelos de classificadores base, como por exemplo: redes neurais artificiais ou máquinas de vetor de suporte. É possível que, neste caso, seja interessante ajustar o método de combinação na fase de teste da arquitetura, saindo de uma abordagem no nível abstrato, como o Majority Vote, para outros métodos que possuam abordagens de rank ou medição;

• Avaliar, como alternativa para tratar a questão de redundância de atri- butos, a possibilidade de combinação de outros métodos de seleção de características, aleatórios ou não, ou ainda métodos de extração de carac- terísticas durante a fase de treinamento.

Referências

Abrahams, C. R. e Zhang, M. (2008). Fair lending compliance: intelligence and

implications for credit risk management, volume 13. Wiley. com.

Bache, K. e Lichman, M. (2013). UCI machine learning repository.

Breiman, L. (1996). Bagging predictors. Machine Learning , 24(2), 123–140. Breve, F., Ponti-Junior, M., e Mascarenhas, N. (2007). Multilayer perceptron classifier combination for identification of materials on noisy soil science multispectral images. In Brazilian Symposium on Computer Graphics and

Image Processing , pp. 239–244.

Dai, Q. (2013). A competitive ensemble pruning approach based on cross- validation technique. Knowledge-Based Systems, 37(0), 394 – 414.

Dietterich, T. (2000). Ensemble methods in machine learning. In Multiple

Classifier Systems, volume 1857 of Lecture Notes in Computer Science, pp. 1–15.

Springer Berlin Heidelberg.

Dietterich, T. G. e Bakiri, G. (1995). Solving multiclass learning problems via error-correcting output codes. CoRR, cs.AI/9501101.

Freund, Y. e Schapire, R. E. (1997). A decision-theoretic generalization of on- line learning and an application to boosting. Journal of Computer and System

Sciences, 55(1), 119 – 139.

Fumera, G. e Roli, F. (2005). A theoretical and experimental analysis of li- near combiners for multiple classifier systems. IEEE Transactions on Pattern

Analysis and Machine Intelligence, 27(6), 942–956.

Hand, D. J. e Henley, W. E. (1997). Statistical classification methods in consu- mer credit scoring: a review. Journal of the Royal Statistical Society: Series A

(Statistics in Society), 160(3), 523–541.

Ho, T. K. (1998). The random subspace method for constructing decision forests. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 20(8), 832–844.

5.2. TRABALHOS FUTUROS Ho, T. K. (2001). Data complexity analysis for classifier combination. In

Multiple Classifier Systems, pp. 53–67. Springer.

Ko, A. H., Sabourin, R., Britto, A. S., e Jr. (2008). From dynamic classifier selection to dynamic ensemble selection. Pattern Recognition, 41(5), 1718 – 1731.

Kuncheva, L. I. (2004). Combining Pattern Classifiers: methods and algorithms. Wiley-Interscience.

Lu, Z., Wu, X., Zhu, X., e Bongard, J. (2010). Ensemble pruning via individual contribution ordering. In ACM SIGKDD International Conference on Knowledge

Discovery and Data Mining , pp. 871–880.

Margineantu, D. D. e Dietterich, T. G. (1997). Pruning adaptive boosting. In

International Conference on Machine Learning (ICML), volume 97, pp. 211–218.

Marqués, A., García, V., e Sánchez, J. (2012a). Exploring the behaviour of base classifiers in credit scoring ensembles. Expert Systems with Applications,

39(11), 10244 – 10250.

Marqués, A., García, V., e Sánchez, J. (2012b). Two-level classifier ensembles for credit risk assessment. Expert Systems with Applications, 39(12), 10916 – 10922.

Martınez-Munoz, G. e Suárez, A. (2004). Aggregation ordering in bagging. In

IASTED International Conference on Artificial Intelligence and Applications,

pp. 258–263.

Martinez-Muñoz, G., Hernandez-Lobato, D., e Suarez, A. (2009). An analysis of ensemble pruning techniques based on ordered aggregation. IEEE Transacti-

ons on Pattern Analysis and Machine Intelligence, 31(2), 245–259.

Martínez-Muñoz, G. e Suárez, A. (2006). Pruning in ordered bagging ensembles. In International Conference on Machine Learning , pp. 609–616.

Nanni, L. e Lumini, A. (2009). An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring. Expert Systems with

5.2. TRABALHOS FUTUROS Ponti, M. P. (2011). Combining classifiers: from the creation of ensembles to the decision fusion. In Conference on Graphics, Patterns and Images Tutorials, pp. 1–10.

Rodriguez, J., Kuncheva, L., e Alonso, C. (2006). Rotation forest: A new classifier ensemble method. Pattern Analysis and Machine Intelligence, IEEE Transacti-

ons on, 28(10), 1619–1630.

Rosenberg, E. e Gleit, A. (1994). Quantitative methods in credit management: a survey. Operations research, 42(4), 589–613.

Santana, L. E. A., de Oliveira, D. F., Canuto, A. M., e de Souto, M. C. P. (2007). A comparative analysis of feature selection methods for ensembles with different combination methods. In International Joint Conference on Neural Networks, pp. 643–648.

Schapire, R. E. (1990). The strength of weak learnability. Machine learning ,

5(2), 197–227.

Selfridge, O. G. (1958). Pandemonium: a paradigm for learning in mecha- nisation of thought processes. In Symposium Held at the National Physical

Laboratory, pp. 513–526.

Soares, R., Santana, A., Canuto, A., e De Souto, M. C. P. (2006). Using accuracy and diversity to select classifiers to build ensembles. In International Joint

Conference on Neural Networks, pp. 1310–1316.

Thomas, L. C., Edelman, D. B., e Crook, J. N. (2002). Credit scoring and its

applications. SIAM Mathematical Modeling and Computation.

Tsoumakas, G., Partalas, I., e Vlahavas, I. (2009). An ensemble pruning primer. In O. Okun e G. Valentini, editors, Applications of Supervised and Unsupervised

Ensemble Methods, volume 245 of Studies in Computational Intelligence, pp.

1–13. Springer Berlin Heidelberg.

Tumer, K. e Ghosh, J. (1996). Error correlation and error reduction in ensemble classifiers. Connection science, 8(3-4), 385–404.

Wang, G., Ma, J., Huang, L., e Xu, K. (2012). Two credit scoring models based on dual strategy ensemble trees. Knowledge-Based Systems, 26(0), 61–68.

5.2. TRABALHOS FUTUROS Wolpert, D. e Macready, W. (1997). No free lunch theorems for optimization.

IEEE Transactions on Evolutionary Computation, 1(1), 67–82.

Wo´zniak, M., Graña, M., e Corchado, E. (2014). A survey of multiple classifier systems as hybrid systems. Information Fusion, 16(0), 3 – 17. Special Issue on Information Fusion in Hybrid Intelligent Fusion Systems.

Xu, L., Krzyzak, A., e Suen, C. (1992). Methods of combining multiple classifiers and their applications to handwriting recognition. , IEEE Transactions on

Systems, Man and Cybernetics, 22(3), 418–435.

Zhou, Z.-H. e Tang, W. (2003). Selective ensemble of decision trees. In G. Wang, Q. Liu, Y. Yao, e A. Skowron, editors, Rough Sets, Fuzzy Sets, Data Mining, and

Granular Computing , volume 2639 of Lecture Notes in Computer Science, pp.

476–483. Springer Berlin Heidelberg.

Zhou, Z.-H., Wu, J., e Tang, W. (2002). Ensembling neural networks: Many could be better than all. Artificial Intelligence, 137(1–2), 239 – 263.