Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Public Health and Surveillance

Date Submitted: Jul 24, 2020
Date Accepted: Aug 4, 2020

The final, peer-reviewed published version of this preprint can be found here:

Correction: A Snapshot of SARS-CoV-2 Genome Availability up to April 2020 and its Implications: Data Analysis

Mavian C, Marini S, Prosperi M, Salemi M

Correction: A Snapshot of SARS-CoV-2 Genome Availability up to April 2020 and its Implications: Data Analysis

JMIR Public Health Surveill 2020;6(3):e22853

DOI: 10.2196/22853

PMID: 32776889

PMCID: 7445602

Correction: A Snapshot of SARS-CoV-2 Genome Availability up to April 2020 and its Implications: Data Analysis

  • Carla Mavian; 
  • Simone Marini; 
  • Mattia Prosperi; 
  • Marco Salemi

ABSTRACT

Background:

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has been growing exponentially, affecting over 4 million people and causing enormous distress to economies and societies worldwide. A plethora of analyses based on viral sequences has already been published both in scientific journals and through non–peer-reviewed channels to investigate the genetic heterogeneity and spatiotemporal dissemination of SARS-CoV-2. However, a systematic investigation of phylogenetic information and sampling bias in the available data is lacking. Although the number of available genome sequences of SARS-CoV-2 is growing daily and the sequences show increasing phylogenetic information, country-specific data still present severe limitations and should be interpreted with caution.

Objective:

The objective of this study was to determine the quality of the currently available SARS-CoV-2 full genome data in terms of sampling bias as well as phylogenetic and temporal signals to inform and guide the scientific community.

Methods:

We used maximum likelihood–based methods to assess the presence of sufficient information for robust phylogenetic and phylogeographic studies in several SARS-CoV-2 sequence alignments assembled from GISAID (Global Initiative on Sharing All Influenza Data) data released between March and April 2020.

Results:

Although the number of high-quality full genomes is growing daily, and sequence data released in April 2020 contain sufficient phylogenetic information to allow reliable inference of phylogenetic relationships, country-specific SARS-CoV-2 data sets still present severe limitations.

Conclusions:

At the present time, studies assessing within-country spread or transmission clusters should be considered preliminary or hypothesis-generating at best. Hence, current reports should be interpreted with caution, and concerted efforts should continue to increase the number and quality of sequences required for robust tracing of the epidemic.


 Citation

Please cite as:

Mavian C, Marini S, Prosperi M, Salemi M

Correction: A Snapshot of SARS-CoV-2 Genome Availability up to April 2020 and its Implications: Data Analysis

JMIR Public Health Surveill 2020;6(3):e22853

DOI: 10.2196/22853

PMID: 32776889

PMCID: 7445602

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

Advertisement