Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Dec 18, 2020
Date Accepted: Mar 23, 2021
Date Submitted to PubMed: Apr 12, 2021

The final, peer-reviewed published version of this preprint can be found here:

Machine Learning–Based Prediction of Growth in Confirmed COVID-19 Infection Cases in 114 Countries Using Metrics of Nonpharmaceutical Interventions and Cultural Dimensions: Model Development and Validation

Yeung AY, Roewer-Despres F, Rosella L, Rudzicz F

Machine Learning–Based Prediction of Growth in Confirmed COVID-19 Infection Cases in 114 Countries Using Metrics of Nonpharmaceutical Interventions and Cultural Dimensions: Model Development and Validation

J Med Internet Res 2021;23(4):e26628

DOI: 10.2196/26628

PMID: 33844636

PMCID: 8074952

Comparison of Multiple Machine Learning-based Predictions of Growth in COVID-19 Confirmed Infection Cases in Countries using Non-Pharmaceutical Interventions and Cultural Dimensions Data: Development and Validation

  • Arnold YS Yeung; 
  • Francois Roewer-Despres; 
  • Laura Rosella; 
  • Frank Rudzicz

ABSTRACT

Background:

National governments have implemented non-pharmaceutical interventions to control and mitigate against the COVID-19 pandemic. A deep understanding of these interventions is required.

Objective:

We investigate the prediction of future daily national Confirmed Infection Growths – the percentage change in total cumulative cases across 14 days – using metrics representative of non-pharmaceutical interventions and cultural dimensions of each country.

Methods:

We combine the OxCGRT dataset, Hofstede’s cultural dimensions, and COVID-19 daily reported infection case numbers to train and evaluate five non-time series machine learning models in predicting Confirmed Infection Growth. We use three validation methods – in-distribution, out-of-distribution, and country-based cross-validation – for evaluation, each applicable to a different use case of the models.

Results:

Our results demonstrate high R^2 values between the labels and predictions for the in-distribution, out-of-distribution, and country-based cross-validation methods (0.959, 0.513, and 0.574 respectively) using random forest and AdaBoost regression. While these models may be used to predict the Confirmed Infection Growth, the differing accuracies obtained from the three tasks suggest a strong influence of the use case.

Conclusions:

This work provides new considerations in using machine learning techniques with non-pharmaceutical interventions and cultural dimensions data for predicting the national growth of confirmed infections of COVID-19.


 Citation

Please cite as:

Yeung AY, Roewer-Despres F, Rosella L, Rudzicz F

Machine Learning–Based Prediction of Growth in Confirmed COVID-19 Infection Cases in 114 Countries Using Metrics of Nonpharmaceutical Interventions and Cultural Dimensions: Model Development and Validation

J Med Internet Res 2021;23(4):e26628

DOI: 10.2196/26628

PMID: 33844636

PMCID: 8074952

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

Advertisement