Ensemble learning for poor prognosis predictions: a case study on SARS-CoV2.

Wu, Honghan, Zhang, Huayu, Karwath, Andreas, Ibrahim, Zina, Shi, Ting, Zhang, Xin, Wang, Kun, Sun, Jiaxing, Dhaliwal, Kevin, Bean, Daniel, Cardoso, Victor Roth, Li, Kezhi, Teo, James T, Banerjee, Amitava, Gao-Smith, Fang, Whitehouse, Tony, Veenith, Tonny, Gkoutos, Georgios V, Wu, Xiaodong, Dobson, Richard and Guthrie, Bruce (2020) Ensemble learning for poor prognosis predictions: a case study on SARS-CoV2. Journal of the American Medical Informatics Association : JAMIA. ISSN 1527-974X.


Download (1MB) | Preview
Official URL: https://academic.oup.com/jamia/advance-article/doi...



Risk prediction models are widely used to inform evidence-based clinical decision making. However, few models developed from single cohorts can perform consistently well at population level where diverse prognoses exist (such as the SARS-CoV2 pandemic). This study aims at tackling this challenge by synergising prediction models from the literature using ensemble learning.


In this study we selected and reimplemented seven prediction models for COVID-19, which were derived from diverse cohorts and used different implementation techniques. A novel ensemble learning framework was proposed to synergise them for realising personalised predictions for individual patients. Four diverse international cohorts (2 from the UK and 2 from China; total N=5,394) were used to validate all eight models on discrimination, calibration and clinical usefulness.


Results showed that individual prediction models could perform well on some cohorts while poorly on others. Conversely, the ensemble model achieved the best performances consistently on all metrics quantifying discrimination, calibration and clinical usefulness. Performance disparities were observed in cohorts from the two countries: all models achieved better performances on the China cohorts.


When individual models were learned from complementary cohorts, the synergised model will have the potential to achieve synergised performances. Results indicate that blood parameters and physiological measurements might have better predictive powers when collected early, which remains to be confirmed by further studies.


Combining a diverse set of individual prediction models, ensemble method can synergise a robust and well-performing model by choosing the most competent ones for individual patients.

Item Type: Article
Subjects: WB Practice of medicine
WC Communicabable diseases
Divisions: Clinical Support > Infectious Diseases
Related URLs:
Depositing User: Mr Philip O'Reilly
Date Deposited: 26 Nov 2020 16:09
Last Modified: 26 Nov 2020 16:09
URI: http://www.repository.uhblibrary.co.uk/id/eprint/3721

Actions (login required)

View Item View Item