Title
Machine Learning for Bankruptcy Prediction in the American Stock Market: Dataset and Benchmarks
Abstract
Predicting corporate bankruptcy is one of the fundamental tasks in credit risk assessment. In particular, since the 2007/2008 financial crisis, it has become a priority for most financial institutions, practitioners, and academics. The recent advancements in machine learning (ML) enabled the development of several models for bankruptcy prediction. The most challenging aspect of this task is dealing with the class imbalance due to the rarity of bankruptcy events in the real economy. Furthermore, a fair comparison in the literature is difficult to make because bankruptcy datasets are not publicly available and because studies often restrict their datasets to specific economic sectors and markets and/or time periods. In this work, we investigated the design and the application of different ML models to two different tasks related to default events: (a) estimating survival probabilities over time; (b) default prediction using time-series accounting data with different lengths. The entire dataset used for the experiments has been made available to the scientific community for further research and benchmarking purposes. The dataset pertains to 8262 different public companies listed on the American stock market between 1999 and 2018. Finally, in light of the results obtained, we critically discuss the most interesting metrics as proposed benchmarks for future studies.
Year
DOI
Venue
2022
10.3390/fi14080244
FUTURE INTERNET
Keywords
DocType
Volume
bankruptcy prediction, deep learning, multi-head, LSTM, machine learning, stock market
Journal
14
Issue
ISSN
Citations 
8
1999-5903
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Gianfranco Lombardo100.34
Mattia Pellegrino200.68
George Adosoglou301.01
Stefano Cagnoni41096155.20
Panos M. Pardalos514119.60
Agostino Poggi600.68