Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Nov 23, 2020
Date Accepted: Mar 25, 2021
Date Submitted to PubMed: Apr 19, 2021

The final, peer-reviewed published version of this preprint can be found here:

Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study

Benito-León J, del Castillo MD, Estirado A, Ghosh R, Dubey S, Serrano JI

Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study

J Med Internet Res 2021;23(5):e25988

DOI: 10.2196/25988

PMID: 33872186

PMCID: 8163491

Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups among COVID-19 Patients in the Emergency Department

  • Julián Benito-León; 
  • Mª Dolores del Castillo; 
  • Alberto Estirado; 
  • Ritwik Ghosh; 
  • Souvik Dubey; 
  • J. Ignacio Serrano

ABSTRACT

Background:

Early detection and intervention are the key factors for improving outcomes in COVID-19.

Objective:

To detect severity subgroups among COVID-19 patients, based only on clinical data and standard laboratory tests obtained during the assessment at the emergency department.

Methods:

We applied unsupervised machine learning to a dataset of 853 COVID-19 patients from HM hospitals in Spain.

Results:

From a total of 850 variables, four tests, the serum levels of aspartate transaminase (AST), lactate dehydrogenase (LDH) and C-reactive protein (CRP), and the number of neutrophils, were enough to segregate the entire patient pool into three separate clusters. Further, the percentage of monocytes and lymphocytes and the levels of alanine transaminase (ALT) distinguished the cluster 3 from the other two clusters. The cluster 1 was characterized by the higher mortality rate and higher levels of AST, ALT, LDH, CRP and number of neutrophils, and low percentage of monocytes and lymphocytes. The cluster 2 included patients with a moderate mortality rate and medium levels of the previous laboratory determinations. The cluster 3 was characterized by the lower mortality rate and lower levels of AST, ALT, LDH, CRP and number of neutrophils, and higher percentage of monocytes and lymphocytes. Age, sex, comorbidities, and vital signs did not allow us to separate the three clusters. An online cluster assignment tool can be found at https://g-nec.car.upm-csic.es/COVID19-severity-group-assessment/.

Conclusions:

A few standard laboratory tests, deemed to be available in all emergency departments, have shown far discriminative power for characterization of severity subgroups among COVID-19 patients.


 Citation

Please cite as:

Benito-León J, del Castillo MD, Estirado A, Ghosh R, Dubey S, Serrano JI

Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study

J Med Internet Res 2021;23(5):e25988

DOI: 10.2196/25988

PMID: 33872186

PMCID: 8163491

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

Advertisement