A Hybrid Machine Learning Approach for Improving Mortality Risk Prediction on Imbalanced Data
2019 | conference paper. A publication with affiliation to the University of Göttingen.
Jump to: Cite & Linked | Documents & Media | Details | Version history
Cite this publication
A Hybrid Machine Learning Approach for Improving Mortality Risk Prediction on Imbalanced Data
Tashkandi, A. & Wiese, L. (2019)
Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Service pp. 83-92. iiWAS2019: The 21st International Conference on Information Integration and Web-based Applications & Services, München. DOI: https://doi.org/10.1145/3366030.3366040
Documents & Media
Details
- Authors
- Tashkandi, Araek; Wiese, Lena
- Abstract
- The efficiency of Machine Learning (ML) models has widely been acknowledged in the healthcare area. However, the quality of the underlying medical data is a major challenge when applying ML in medical decision making. In particular, the imbalanced class distribution problem causes the ML model to be biased towards the majority class. Furthermore, the accuracy will be biased, too, which produces the Accuracy Paradox. In this paper, we identify an optimal ML model for predicting mortality risk for Intensive Care Units (ICU) patients. We comprehensively assess an approach that leverages the efficiency of ML ensemble learning (in particular, Gradient Boosting Decision Tree) and clustering-based data sampling to handle the imbalanced data problem that this model faces. We comprehensively compare different competitors (in terms of ML models as well as clustering methods) on a big real-world ICU dataset achieving a maximum area under the curve value of 0.956.
- Issue Date
- 2019
- Conference
- iiWAS2019: The 21st International Conference on Information Integration and Web-based Applications & Services
- Conference Place
- München
- Event start
- 2019-12
- Event end
- 2019-12
- Language
- English