Data mining feature selection for credit scoring models

2005 | journal article. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Data mining feature selection for credit scoring models​
Liu, Y. & Schumann, M. ​ (2005) 
Journal of the Operational Research Society56(9) pp. 1099​-1108​.​ DOI: https://doi.org/10.1057/palgrave.jors.2601976 

Documents & Media

License

GRO License GRO License

Details

Authors
Liu, Y.; Schumann, M. 
Abstract
The features used may have an important effect on the performance of credit scoring models. The process of choosing the best set of features for credit scoring models is usually unsystematic and dominated by somewhat arbitrary trial. This paper presents an empirical study of four machine learning feature selection methods. These methods provide an automatic data mining technique for reducing the feature space. The study illustrates how four feature selection methods -'ReliefF','Correlation-based', 'Consistency-based' and 'Wrapper' algorithms help to improve three aspects of the performance of scoring models: model simplicity, model speed and model accuracy. The experiments are conducted on real data sets using four classification algorithms -'model tree (M5)', 'neural network (multi-layer perceptron with back-propagation)', 'logistic regression', and 'k-nearest-neighbours'.
Issue Date
2005
Status
published
Publisher
Palgrave Macmillan Ltd
Journal
Journal of the Operational Research Society 
ISSN
0160-5682

Reference

Citations


Social Media