Demonstrating non-inferiority of easy interpretable methods for insolvency prediction

2015 | journal article. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Demonstrating non-inferiority of easy interpretable methods for insolvency prediction​
Obermann, L. & Waack, S.​ (2015) 
Expert Systems with Applications42(23) pp. 9117​-9128​.​ DOI: https://doi.org/10.1016/j.eswa.2015.08.009 

Documents & Media

License

GRO License GRO License

Details

Authors
Obermann, Lennart; Waack, Stephan
Abstract
Insolvency prediction and credit rating are challenging tasks used to evaluate commercial enterprises based on qualitative and quantitative attributes. One way to approach this task is machine learning whereby hypotheses are trained on sample data. The advantages are the automatization of the process obviating the need of human knowledge and thus, its high level objectivity. Nevertheless, this approach does not claim to be perfect as it does not completely replace human knowledge. Hence, these hypotheses are intended to be used as decision support for financial experts and thus, offer an advantage over black box hypotheses. We demonstrate how easily interpretable white box hypotheses (Decision Trees (DTs) and Disjunctive Normal Forms (DNFs)) are not inferior to more difficult interpretable gray box hypotheses (Random Forests (RFs) and Artificial Neural Networks (ANNs)) or even non-interpretable black box hypotheses (Support Vector Machines (SVMs)). We calculate DTs by means of Quinlans famous C4.5 algorithm. For DNFs, we developed an algorithm we call threshold heuristic. In our case study, a database with financial statements of 5152 enterprises is used to evaluate the performance in insolvency prediction for all classifiers. A common problem in insolvency prediction is extremely imbalanced data because naturally there are very few insolvent enterprises. Therefore we apply an asymmetric bagging method which increases the performance with extremely imbalanced data sets. In our case study, interpretable hypotheses perform better than other hypotheses. DTs have a better alpha-error, whereas DNFs outperform DTs with respect to the beta-error. We compared both hypothesis classes and exemplary hypotheses. Both are interpretable in different ways leaving the choice to preference. This leads to the conclusion that interpretable threshold based methods are appropriate for classification problems in finance. In this domain, they are not inferior to more sophisticated methods like SVMs. (C) 2015 Elsevier Ltd. All rights reserved.
Issue Date
2015
Status
published
Publisher
Pergamon-elsevier Science Ltd
Journal
Expert Systems with Applications 
ISSN
1873-6793; 0957-4174

Reference

Citations


Social Media