Search for content and authors
 

Multiclass corporate failure prediction by agregation classification trees.

Esteban A. Cortes 1Matias Gamez Martinez Noelia Garcia Rubio 

1. Castilla-La Mancha University., Plaza de la Universidad, 1, Albacete 02-071, Spain

Abstract

The goal of this study is to compare the behaviour of three ensemble methods (AdaBoost, Bagging and Random Forest) in a financial application. Since the sixties several classification techniques have been used for corporate failure prediction. In last decades, the interest in this task has widely increased owing to the existence of data bases with available information.

Classification trees are a powerful alternative to the more traditional statistical models. This model has the advantage of being able to detect non-linear relationships and showing a good performance in presence of qualitative information as it happens in corporate failure prediction problems. As a result, they are widely used as base classifiers for ensemble methods.

AdaBoost constructs its base classifiers in sequence, updating a distribution over the training examples to create each base classifier. Bagging combines the individual classifiers built in bootstrap replicates of the training set. Random Forest is a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest.

In this paper, we compare the prediction accuracy of these techniques for the corporate failure prediction task.

Key Words: Bankruptcy Prediction, AdaBoost, Bagging, Random Forest.

 

Legal notice
  • Legal notice:
 

Presentation: Invited oral at XVI KONFERENCJA NAUKOWA SEKCJI KLASYFIKACJI I ANALIZY DANYCH PTS, Sesja plenarna, by Esteban A. Cortes
See On-line Journal of XVI KONFERENCJA NAUKOWA SEKCJI KLASYFIKACJI I ANALIZY DANYCH PTS

Submitted: 2007-04-15 21:28
Revised:   2010-03-05 15:17