Can Machine Learning Models Capture Correlations in Corporate Distresses?

Benjamin Christoffersen, Rastin Matin, Pia Mølgaard

Research output: Working paperResearch


A number of papers document that recent machine learning models outperform traditional corporate distress models in terms of accurately ranking firms by their riskiness. However, it remains unanswered whether advanced machine learning models can capture correlations in distresses sufficiently well to be used for joint modelling, which traditional distress models often struggle with. We implement a regularly top-performing machine learning model and find that prediction accuracy of individual distress probabilities improves while there is almost no difference in the predicted aggregate distress rate relative to traditional distress models. Thus, our findings suggest that complex machine learning models do not eliminate the excess clustering in distresses. Instead, we propose a frailty model, which allows for correlations in distresses, augmented with regression splines. This model demonstrates competitive performance in terms of ranking firms by their riskiness, while providing accurate aggregate risk measures.
Original languageEnglish
Place of PublicationKøbenhavn
PublisherDanmarks Nationalbank
Number of pages34
Publication statusPublished - 26 Oct 2019
SeriesDanmarks Nationalbank. Working Papers


  • Corporate default prediction
  • Discrete hazard models
  • Frailty models
  • Gradient boosting

Cite this