Skip to main content
  • Research article
  • Open access
  • Published:

Discriminant models for the prediction of postponed viral shedding time and disease progression in COVID-19

Abstract

Background

COVID-19 infection can cause life-threatening respiratory disease. This study aimed to fully characterize the clinical features associated with postponed viral shedding time and disease progression, then develop and validate two prognostic discriminant models.

Methods

This study included 125 hospitalized patients with COVID-19, for whom 44 parameters were recorded, including age, gender, underlying comorbidities, epidemiological features, laboratory indexes, imaging characteristics and therapeutic regimen, et al. Fisher's exact test and Mann–Whitney test were used for feature selection. All models were developed with fourfold cross-validation, and the final performances of each model were compared by the Area Under Receiving Operating Curve (AUROC). After optimizing the parameters via L2 regularization, prognostic discriminant models were built to predict postponed viral shedding time and disease progression of COVID-19 infection. The test set was then used to detect the predictive values via assessing models’ sensitivity and specificity.

Results

Sixty-nine patients had a postponed viral shedding time (> 14 days), and 28 of 125 patients progressed into severe cases. Six and eleven demographic, clinical features and therapeutic regimen were significantly associated with postponed viral shedding time and disease progressing, respectively (p < 0.05). The optimal discriminant models are: y1 (postponed viral shedding time) = − 0.244 + 0.2829x1 (the interval from the onset of symptoms to antiviral treatment) + 0.2306x4 (age) + 0.234x28 (Urea) − 0.2847x34 (Dual-antiviral therapy) + 0.3084x38 (Treatment with antibiotics) + 0.3025x21 (Treatment with Methylprednisolone); y2 (disease progression) = − 0.348–0.099x2 (interval from Jan 1st,2020 to individualized onset of symptoms) + 0.0945x4 (age) + 0.1176x5 (imaging characteristics) + 0.0398x8 (short-term exposure to Wuhan) − 0.1646x19 (lymphocyte counts) + 0.0914x20 (Neutrophil counts) + 0.1254x21 (Neutrphil/lymphocyte ratio) + 0.1397x22 (C-Reactive Protein) + 0.0814x23 (Procalcitonin) + 0.1294x24 (Lactic dehydrogenase) + 0.1099x29 (Creatine kinase).The output ≥ 0 predicted postponed viral shedding time or disease progressing to severe/critical state. These two models yielded the maximum AUROC and faired best in terms of prognostic performance (sensitivity of78.6%, 75%, and specificity of 66.7%, 88.9% for prediction of postponed viral shedding time and disease severity, respectively).

Conclusion

The two discriminant models could effectively predict the postponed viral shedding time and disease severity and could be used as early-warning tools for COVID-19.

Peer Review reports

Background

Summary: this study fully characterizes the clinical features associated with postponed viral shedding time and disease progression, then develop and validate two prognostic models with satisfactory discriminant performance.

The prevalence of coronavirus disease 2019 (COVID-19) has put a huge burden to medical resources [1]. Although patients with COVID-19 infection mostly manifested as non-severe cases, it can also cause life-threatening conditions before or during hospitalization, such as severe pneumonia, adult respiratory distress syndrome or multiple organ failure, which are all related to worse outcomes [2]. Compare to the other epidemiological disease, such as the previous outbreaks of SARS-CoV and MERS-CoV, COVID-19 progresses and spreads more rapidly, with peculiar epidemiological traits. High viral loads of SARS-CoV-2 were observed in the upper respiratory specimens of patients with little or no symptoms, this indicated that inapparent-transmission plays a major but underestimated role in sustaining the outbreak of COVID-19 [3].

Since the first case emerged in Liaoning province in Jan 22th, 2020, there is an urgent need to construct a simple, efficient and accurate “early-warning prediction model” for disease progression at early stage once the patients were admitted to the hospital. This will facilitate the medical staff to make critical time-sensitive decisions regarding patients and treatments. Traditional evaluation scoring tools, such as CURB-65, qSOFA, and NEWS, could be adopted to assess disease severity, but not for the early assessment of COVID-19 severity [4]. Thus, to promptly predict and identify patients with postponed viral shedding time and disease severity is required but challenging. To date, dozens of prediction models of COVID-19 have been established to respond quickly to this healthcare crisis [5, 6]. Unfortunately, the quality of some of the identified models is uniformly poor and none can be recommended for clinical use, as demonstrated by one systemic review [7]. This study aimed to fully characterize the demographic, epidemiological, clinical features and therapeutic regimens and to detect their association with postponed viral shedding time and disease progression among patients with COVID-19 in Liaoning province, China. Furthermore, another purpose of this study was to specifically design and validate two prognostic discriminant models incorporating the associated features. These new mathematical models can serve as early-warning prediction tools to estimate the postponed viral shedding time and to identify the risk of progressing to severe stage in advance among patients with COVID-19 infection.

Methods

Study design

This retrospective multi-center cohort study included consecutive patients, they were laboratory-confirmed with COVID-19 infection and enrolled from Jan 22th to Mar 22th 2020 in eight designated hospitals throughout Liaoning province. All the enrolled patients were diagnosed with COVID-19 according to the WHO interim guidance [8]. Laboratory confirmation of COVID-19 was achieved by the nucleic acid test using real-time reverse transcriptase-polymerase chain reaction (RT-PCR) assay at Liaoning municipal Center for Disease Prevention and Control (CDC). Samples were collected using a nose swab and/or throat swab from each suspected patient. This study was approved by the human research ethics committee of the first hospital of CMU (committee’s reference number: AF-OG-20-1.1-02), Shenjing hospital of CMU (committee’s reference number: 2020041002), Liaoning province peoples’ hospital, Shenyang Sixth People's Hospital, Jinzhou Infectious Disease Hospital, Tieling infectious Disease Hospital, Fuxin Infectious Disease Hospital, Central Hospital of Huludao City. The other committee’s reference number were not available. All the written informed consent was waived. Permissions were obtained to access the data, as this was included as a part of the formal ethics approval. The data used in this study was anonymized before its use.

Data collection

The date of disease onset (defined as the day when any symptom was noticed by the patients) and hospital admission date, the first day for nucleic acid detected to be positive or negative were all recorded. All the patients were hospitalized, and the clinical outcomes were monitored for at least 8 weeks. All the clinical data on epidemiology (recent exposure history), symptoms, signs, underlying comorbidities, laboratory results (on admission), imaging findings (on admission) and clinical progression were recorded and retrospectively double-extracted from electronic medical records, with two independent reviewers extracted the data and evaluated the eligibility of the original data. Long-term exposure to Wuhan was defined by Wuhan residence or study, work for at least one-month in Wuhan. Short-term exposure to Wuhan was defined by meeting, transfer or travel history to Wuhan temporarily. Considering that one of the isolation release and discharge criteria for hospitalized patients is a sputum/oral swab testing negative twice with 24 h interval [5], virus detection was repeated twice every 24 h. It was deemed as viral clearance when virus was detected to be negative for two consecutive times.

Regarding antiviral treatment, 14 patients with confirmed COVID-19 were treated with lopinavir–ritonavir (400 mg/100 mg), 11 with arbidol (200 mg t.i.d.). Dual-antiviral therapy of nebulized Interferon-α (IFN-α) (5 mU b.i.d.) with lopinavir–ritonavir or arbidol were used in 42 and 8 patients. Triple-antiviral therapy of IFN-α, lopinavir–ritonavir and arbidol were used in 20 patients. Other therapies such as antibiotics and corticosteroids were used in 38 and 22 patients.

Definition of disease progression and postponed virus shedding time

Disease progression were recorded for at least 8 weeks after admission. Severity of COVID-19 was defined according to the American Thoracic Society (ATS) guidelines for community-acquired pneumonia (CAP) [9]. Severe/critical cases of COVID-19 should meet one major criterion (septic shock with need for vasopressors or respiratory failure requiring mechanical ventilation) or at least three minor criteria: (a) respiratory distress with respiratory frequency ≥ 30/min; (b) oxygenation index (partial pressure of oxygen/inspired oxygen fraction, PaO2/FiO2) ≤ 250 mmHg; (c) multilobe infiltrates, confusion/disorientation; (d) uremia (blood urea nitrogen ≥ 20 mg/dL); (e) leukopenia (white blood cell count < 400 cells/μL); (f) thrombocytopenia (platelet count < 100,000/μL); (g) hypothermia (body temperature < 36 ºC);(h) hypotension requiring aggressive fluid resuscitation. Non-severe patients were defined as a confirmed case with fever, respiratory symptoms, with or without radiographic evidence of pneumonia.

The median viral shedding time was 14 days (IQR, 11–19), those ≥ 14 days were deemed as postponed virus shedding.

Variables

Discriminant variables for disease severity and postponed viral shedding time were determined according to the assessment of the existing medical records (Additional file 1: Table S1) listed as follows: the interval from onset of symptoms to antiviral treatment (x1),interval from Jan 1st,2020 (the day for the first case emerged in Liaoning) to individualized onset of symptoms (x2), gender (x3), age (x4), imaging characteristics (x5);long-term exposure to Wuhan (x6), local transmitted history (x7), short-term exposure to Wuhan (x8); respiratory symptoms (x9), digestive symptoms (x10), general malaise (x11), fever (x12); comorbidities with chronic respiratory disease (x13), hypertension (x14), diabetes mellitus (x15), surgery history (x16), other comorbidities (x17), White blood cell (WBC) (x18), lymphocyte counts (x19), neutrophil counts (x20), neutrophil/lymphocyte ratio (N/L) (x21), C-Reactive Protein (CRP) (x22), Procalcitonin (PCT) (x23), Lactic dehydrogenase (LDH) (x24), Aspartate amino transferase (AST) (x25), Alanine aminotransferase (ALT) (x26), Creatinine (Cr) (x27), Urea nitrogen (Urea) (x28), Creatine kinase (CK) (x29), oxygenation index(x30),disease severity (x31), treatment with Lobinavi/ritonavir alone (x32),treatment with Arbidol alone (x33), combined treatment of nebulized IFN-α with lopinavir–ritonavir (x34), combined treatment of nebulized IFN-α, with Arbidol (x35), combined treatment of nebulized IFN-α, lopinavir–ritonavir and Arbidol (x36), treatment with Oseltamivir phosphate alone (x37), Treatment with antibiotics (x38), treatment with ribavirin (x39), treatment with Chinese traditional medicine (Xuebijing) (x40), treatment with Methylprednisolone (x41), treatment with γ-globulin (x42),antiviral treatment course (x43). The outcomes were postponed viral shedding time (y1) and disease progression (y2).

Discriminate factors were also quantitatively assigned, some variables such as the imaging characteristics were assigned from 0, and the order was based on their influence on the progression of disease.

Establish the optimal discriminant models for disease progression and postponed viral shedding time

Setting disease progression and postponed viral shedding time as the goal for discriminant models, logistic regression, linear discriminant analysis, K-nearest neighbor, support vector machine (SVM) and decision tree were constructed through Python 3.6 software (Numpy and Sklearn package). The dataset was split 4:1 by stratified random sampling and four-fifths was used as a training group to establish models. After comparing effectiveness among the models by analyzing total accuracy in both training and testing data set, the most optimal discriminant models incorporating multiple related factors were established to reflect the probability of disease progressing to severe stage or postponed viral shedding time. The models were constructed using the output as an outcome, while the output ≥ 0 indicated disease progressing to severe/critical stage or postponed viral shedding time.

Verification of the discriminant models

The precision of the prediction models was further evaluated and validated. Due to the resampling methods, bootstrapping or cross-validation were more powerful than splitting the sample for internal validation. We applied cross-validation on the basis of random stratification. By comparing the area under the receiver-operating characteristic curve (AUROC) value, sensitivity, specificity, accuracy and recall rate, et al., the optimal primary screening model was chosen. Multicollinearity was calculated to assess the feasibility of the optimal model. Receiver operating characteristic (ROC) curves and confusion matrix were constructed to describe the screening effectiveness of the optimal model. By analyzing total accuracy, the optimal discriminant model was chosen.

Statistical analysis

Categorical variables were summarized as frequencies and percentages. Continuous variables were described using median and interquartile ranges (IQR) values. Data were then compared between groups divided by disease severity (non-severe vs. severe group) or viral shedding time (with cutoff value 14 days) using F test for continuous variable, or by Mann–Whitney test or χ2 test for categorical data. Features significantly different (p < 0.05) in both algorithms were selected into the models. In order to eliminate the overfitting effect and regularize the models, z-score standardization [x* = (x − mean)/standard deviation] was conducted on all continuous variables in the data set. Thus, each corresponding feature was converted into a normal distribution with mean value of 0 and variance of 1, to eliminate the dimensional influence.

Results

Demographic and clinical characteristics of patients with COVID-19 infection

A total of 125 hospitalized patients diagnosed as COVID-19 infection were included in this study. Disease progression was recorded during the 8-weeks follow-up after admission. Ten patients were categorized into mild cases, 97 developed into moderate cases and 28 patients developed into severe cases (including 3 critically ill) during hospitalization. Symptoms were shown in Table 1. Among 125 patients enrolled, 75 (62%) were imported cases (with an exposure history to Wuhan), which further divided into 37 (29.6%) short-term exposure (meeting, transfer or travel et al.) and 38 (30.4%) long-term exposure (residence or study, work in Wuhan) history to Wuhan. The rest 50 (40%) cases had no history of Wuhan exposure, thus were categorized into local transmitted cases who had contact with symptomatic cases. The median age for all patients were 44 years (IQR, 34–57), for non-severe patients was 41 years old (IQR, 34–55), and for severe patients was 50 years old (IQR, 38–63). About half (55.2%) of patients were male. Among all patients, up to 61 patients had at least one underlying comorbidities, the most common of which were chronic diseases, such as hypertension, et al. The median interval from the onset of symptoms to hospital admission was 4 days (IQR, 2–7). The median viral shedding time was 14 days (IQR, 11–19).

Table 1 Demographic and baseline characteristics of patients with different severities and virus shedding of COVID-19 infection

Clinical and laboratory features associated with disease progression or postponed viral shedding time

A total of 44 laboratory and clinical records on admission and during hospitalization were obtained and analyzed, including but not limited to the demographics, symptoms, signs, images, blood routine, immunocytochemistry, enzymatic and liver/renal function. These data were acquired within 24 h on admission. For the postponed viral shedding time discriminant model, six features (including one clinical feature, one demographic feature, one laboratory index and three therapeutic regimens) were selected to be the significant indicators of the postponed viral shedding time. They were listed as follows: the interval from the onset of symptoms to antiviral treatment (days) (x2), age (x4), CK (x29), Combined treatment of nebulized IFN-α with lopinavir/ritonavir (x34), Treatment with antibiotics (x38), Treatment with Methylprednisolone (x41) (Table 2).

Table 2 Treatment approaches among patients with different virus shedding of COVID-19 infection

For the disease progression discriminant model, eleven features (including one demographic, two epidemiological features and one imaging, seven laboratory indexes) were significantly associated with the progression of severe disease. These features include the interval from Jan 1st, 2020 (the day for the first case emerged in Liaoning) to individualized onset of symptoms (x2), age (x4),imaging characteristics (x5), epidemiological history of short-term exposure to Wuhan (x8), and immune features [Lymphocyte counts (x19), leukocyte counts (x20), N/L ratio (x21), CRP (x22), PCT (x23)], Lactic dehydrogenase (x24),creatine kinase (x29).

No multi-collinearity was found in the screened variables, since VIF (Variance Inflation Factor) values of the screened dependent variables were all less than 10. Thus, the model passed the multi-collinearity test (Additional file 1: Tables S2, S3).

Establishment and verification of the optimal discriminant models for disease progression and postponed viral shedding time

All samples were stratified and randomly divided into training and testing datasets, and all models were developed with fourfold cross-validation [10]. During construction of the discriminant model of postponed, the training datasets contained 55 postponed virus clearance cases and 45 non-postponed cases, while the testing dataset was consisted of 14 postponed virus clearance cases and 12 postponed cases. During construction of the discriminant model of disease progression, the training datasets contained 31 severe/critical cases and 69 non-severe cases, while the testing dataset was consisted of 8 severe/critical cases and 18 non-severe cases. Then, the discriminant models were constructed in training set via the selected indicators using several methods, including logistic regression, linear discriminant analysis, decision tree, K-nearest neighbor, and support vector machine method. The outcome was predicted in testing data set.

By analyzing total accuracy in both training and testing data set, the optimal discriminant models for the prediction of disease progression and postponed viral shedding time of COVID-19 infection were established via the logistic regression with the selected eleven and six features used as independent variables. In order to eliminate the influence of model overfitting on the prediction results and to improve the generalization ability of the model, L2 regularization \(\left[ {\mathop {\min }\limits_{\beta ,C} \frac{1}{2}\beta^{T} \beta + C\sum\nolimits_{i = 1}^{n} {\log (\exp ( - y_{i} (X_{i} \beta + c) + 1))} } \right]\) was used to constrain the objective function in the logistic regression optimization process. Regarding the term of L2 regularization, β represents the coefficient in front of a variable. C represents the regularization parameter in cost function, and it controls the strength of L2 regularization. Xi represents the value of the attribute variable and c represents the constant term.

"Viral shedding postponed or not" was used as a dependent variable (y1) for logistic regression, which yielded the equation:

$$Ln\left(\frac{p}{1-p}\right)=-0.244+0.2829{x}_{1}+0.2306{x}_{4}+0.234{x}_{28}-0.2847{x}_{34}+0.3084{x}_{38}+0.3025{x}_{41}.$$

Within the formula, p represents the probability of the patient being judged positive (viral shedding postponed).

"Severe or not" was used as a dependent variable (y2) for logistic regression. This yielded the equation:

$$Ln\left(\frac{p}{1-p}\right)=-0.348-0.099{x}_{2}+0.0945{x}_{4}+0.1176{x}_{5}+0.0398{x}_{8}-0.1646{x}_{19}+0.0914{x}_{20}+0.1254{x}_{21}+0.1397{x}_{22}+0.0814{x}_{23}+0.1294{x}_{24}+0.1099{x}_{29}.$$

Within the formula, p represents the probability of the patient being judged positive (severe disease). Notably, all data needs to be equally standardized [\(x^{*} = \frac{x - \mu }{\sigma }\)] before being plugged into the equation, and the corresponding mean and standard deviation were determined and shown in Additional file 1: Tables S4, S5.

Performance of the discriminant models

When applying the discriminant model of postponed viral shedding time onto the validation of the training set (Tables 3, 4; Additional file 1: Table S6), sensitivity was 0.727, specificity was 0.733, positive predictive value was 0.769, and negative predictive value was 0.688. In the test set, sensitivity was 0.786, specificity was 0.667, with positive predictive value 0.733 and negative predictive value 0.727. Recall rate was 78.6%. Accuracy was 0.732. The AUROC of the combinations of 7 clinical features was 0.73 in the training dataset (Fig. 1a) and 0.73 in the testing dataset (Fig. 1b).

Table 3 The summary of primary discriminant screening models for virus shedding of Covid-19 infection
Table 4 The summary of primary discriminant screening models for different severities of Covid-19 infection
Fig. 1
figure 1

AUROC curve of the postponed viral shedding discriminant model for training set (a) and testing set (b). AUROC area under receiving operating curve

According to the confusion matrix of discriminant model for disease progression (Tables 3, 4; Additional file 1: Table S7), during the validation of the training set, sensitivity was 0.774, specificity was 0.884, positive predictive value was 0.75, and negative predictive value was 0.897. In the test set, sensitivity was 0.75, specificity was 0.889, with positive predictive value 0.75and negative predictive value 0.889. The recall rate was 75%, and accuracy was 0.846. The AUROC was also constructed to evaluate the effectiveness of the discriminant models The AUROC of the combinations of 11 demographic, clinical and imaging/laboratory features was 0.829 (Fig. 2a) in the training dataset and 0.819 (Fig. 2b) in the testing dataset.

Fig. 2
figure 2

AUROC curve of the disease progression discriminant model for training set (a) and testing set (b). AUROC area under receiving operating curve

Discussion

This retrospective study tentatively developed two discriminant models consisting of several clinical and epidemiological features that could be quickly obtained on admission. To date, a certain number of models predict the risk of severe COVID-19 has been developed [5, 11]. However, there is geographic discrepancy of severity and mortality rate in patients with COVID-19 infection, and most of the previous prediction models were established to predict survival risk or progression to severe or critical state in the south part of China [5, 11]. Few of them were designed to predict the postponed viral shedding time. Besides, since there is no consensus about the therapeutic regimen of COVID-19 [12], the models established previously mainly consisted of variables that extracted from the clinical and laboratory parameters, with few of them incorporated epidemiological features or therapeutic regimen [11]. To the best of our knowledge, for the first time, the impact of epidemiological features and therapeutic regimen on the disease progression and postponed viral shedding time were described and integrated into the predicted polynomial equations among patients confirmed as COVID-19 infection.

Overall, these two discriminant models in the present study was demonstrated to have satisfactory sensitivity (> 72.00%) and specificity (> 73.00%), and they can be used as early warning tools to robustly and effectively predict the postponed viral shedding time and the severe/critical progression in patients with COVID-19 infection upon admission. A medical staff can easily predict in advance using these two discriminant models and conduct a timely and optimal medical intervention at an early stage.

SARS-CoV-2 in the respiratory tract, especially sputum, has been observed to be associated with a prolonged viral shedding and high viral load, when compared with the stool specimens [13]. In this study, the median viral shedding time was 14 days (IQR, 11–19). In order to effectively control the transmission resources on the imported cities or regions, it is essential to identify factors that associated with the COVID-19 PCR negative conversion time and to establish a prediction model that could individually estimate the risk of postponed viral shedding time among patients with COVID-19 upon admission to hospitals or shelters. In this study, we initially selected 43 variables probably associated with disease progression and postponed viral shedding time respectively according to the published literatures [4,5,6,7], then detected the risk factors by F-test and χ2 test analysis. As a result, 6 variables were identified as discriminatory factors and were devised to discriminant models for prediction of postponed viral shedding time of COVID-19 infection. Both older age and delayed antiviral treatment could give rise to the postponed viral shedding time. Consistently, these results were in accordance with another previous study indicated that the time from symptom onset to viral clearance slightly increased with age [14]. Besides, the association between delayed initiation of antiviral treatment and the prolonged virus shedding for influenza A (H7N9) and SARS-CoV-2 was also observed in previous studies, indicating that timely initiation of antiviral treatments necessary for viral clearance [15,16,17]. In addition, dual-antiviral therapy of nebulized IFN-α with lopinavir/ritonavir (x33) was negatively associated with the viral shedding time, whereas treatment with antibiotics (x37) and methylprednisolone (x40) were related to postponed viral shedding time. This was consistent with previous studies which demonstrated that SARS-CoV-2 was more susceptible to IFNs when compared to SARS-CoV, as the inhalation of Interferon-α (IFN-α) 2b could reduce the infection rate significantly and it can be used for prophylaxis of SARS-CoV-2 infection [18,19,20]. In addition, although Lopinavir/ritonavir (Kaletra) presented controversial therapeutic effects as compared to the standard care in vivo [22,23,24,25], it was found to have anti-SARS-CoV efficacy in vitro [21], thus Lopinavir/ritonavir has been recommended by the National Health Commission of China for the treatment of COVID-19 in the early period of 2020 [26]. To the best of our knowledge, for the first time this study observed that combined treatment of IFN-α inhalation and lopinavir/ritonavir was related to the shortened viral shedding time of COVID-19. Nevertheless, there is no additional benefit on virus clearance from an extra use of Arbidol, when combined with IFN-α and lopinavir/ritonavir. Anyway, statistical analysis can only stress association but cannot explain causality. A recent study also observed that early initiation of dual-antiviral treatment with lopinavir/ritonavir + IFN-α combination therapy could help shorten the duration of SARS-CoV-2 shedding when compared with triple antiviral treatment (opinavir/ritonavir + IFN-α + arbidol) [17]. This conclusion may provide a rationale for clinicians to optimize and to early initiate the antiviral treatments. In a previous study, the administration of corticosteroids has been observed to bring benefits for patients infected by COVID-19, since will prevent the use of mechanical ventilation and reduce the mortality of them [27]. Conversely, in this study, treatment with antibiotics (x37) and methylprednisolone (x40) could give rise to the postponed viral shedding time. Some previous studies have also observed that high-dose of corticosteroids was associated with increased mortality and longer viral shedding in patients with influenza A (H7N9) viral pneumonia and MERS [28,29,30]. Systemic corticosteroids could increase the risk of opportunistic infections (such as bacterial or fungal) that occur secondary to immunosuppression, and eventually hinders the virus clearance ability [31]. Besides, potential bacterial infections secondary to influenza viral infection that has been commonly seen in this study (32.8%) could also prolong the viral shedding time, as indicated by the evidence that the use of antibiotics was associated with postponed viral shedding time.

Noteworthy, this study for the first time observed that intervals from the first case emerged in Liaoning province (Jan 22st, 2020) to the individualized onset of symptoms (x2) could serve as an important prognostic feature in our early-warning model of disease progression. Indeed, nearly half of the confirmed cases at the early stage (in January) of COVID-19 outbreak were severe cases, whereas in the latter period (after February), the percentage of non-severe cases became dominant (76.9%) in Liaoning province. One explanation is that pathogens tend to reduce their virulence overtime in order to maximize their between-host transmission, which could result in the gradually lowered severity of COVID-19 infection on the imported regions [32, 33]. Besides, the human intervention efforts in China, such as to promptly admit suspected patients to the designated shelter hospitals, have effectively contributed to the decreased number of severe cases of COVID-19. Interestingly, short-term exposure to the epidemic area (Wuhan) during traveling or transfer could also result in a higher likelihood of progressing into severe stage of disease. Those transiently migrant individuals might have been primed by one or more prior coronavirus exposures during traveling or transfer around the epidemic area, thus have experienced the effects of antibody dependent enhancement (ADE) antigenic epitope heterogeneity due to antigenic epitope heterogeneity [34]. ADE hinders the ability to manage inflammation and result in disease progression. Another explanation would be attributed to the SARS-CoV-2 strains of L type, which are evolutionarily more aggressive and contagious. This virus strain of L type with altered virulence could probability be the underlying causal pathogen for patients who acquired infections via short-term exposure to the epidemic area [35].

In our discriminant models, both immune features (lymphocytes, neutrophils, N/L ratio, CRP) and enzymatic index (LDH, CK) obtained on admission were observed as the most significant prognostic factors for disease severity. These are consistent with the well-established within-host model in previous literature, which describes the interactions between SARS-CoV-2, host pulmonary epithelial cells and cytotoxic T lymphocyte cells [36]. Our results illustrate an earlier exhibition of abnormal laboratory features prior to the disease progression [37, 38]. Consistent with previous studies, our study revealed that advanced age in a strong risk factor for more severe COVID-19 infection [39,40,41]. This suggests that more intensive surveillance is necessary in elderly patients.

Anyway, some limitations should be noted in this study. First, the study design was retrospective, and the sample size may be insufficient for characterization of an entire population. However, by including all patients from eight designated hospitals throughout Liaoning province, we considered patients recruited in this study are representative of cases diagnosed with COVID-19 in Liaoning, China. Secondly, not enough severe/critical cases were recruited for the present study. This was possibly because the fatality rate of patients infected by COVID-19 in Liaoning province was lower (1.6%) than the whole national average level (3.2%) [42], and not resembling the previous studies from Wuhan [2, 43]. Thirdly, we only included the initial antiviral treatments as factors for prolonged shedding duration, so as to minimize the bias of different efficacy caused by different treatment courses as much as possible. This discrepancy may have had an unknown influence on the efficacy of the models.

Conclusion

The discriminant models reported here is the first attempt of its kind to develop an early warning tool for both postponed viral shedding time and disease progression in the northeast area of China. We believe that these models can help to judge the disease progression early enough in a great number of patients with COVID-19 infection, and this early judgment can facilitate a timely medical intervention, which will ultimately reduce the mortality of COVID-19.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

COVID-19:

Coronavirus disease 2019

References

  1. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N Engl J Med. 2020;382(13):1199–207.

    Article  CAS  Google Scholar 

  2. Chen N, Zhou M, Dong X, Qu J, Gong F, Han Y, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet. 2020;395(10223):507–13.

    Article  CAS  Google Scholar 

  3. Wang Y, Wang Y, Chen Y, Quin Q. Unique epidemiological and clinical features of the emerging 2019 novel coronavirus pneumonia (COVID-19) implicate special control measures. J Med Virol. 2020;92(6):568–76.

    Article  CAS  Google Scholar 

  4. Ito A, Ishida T, Tokumasu H, Washio Y, Yamazaki A, Ito Y, et al. Prognostic factors in hospitalized community-acquired pneumonia: a retrospective study of a prospective observational cohort. BMC Pulm Med. 2017;17(1):78.

    Article  Google Scholar 

  5. Zhou Y, He Y, Yang H, Yu H, Wang T, Chen Z, et al. Development and validation a nomogram for predicting the risk of severe COVID-19: a multi-center study in Sichuan, China. PLoS ONE. 2020;15(5):e0233328.

    Article  CAS  Google Scholar 

  6. Sun L, Song F, Shi N, Liu F, Li S, Li P, et al. Combination of four clinical indicators predicts the severe/critical symptom of patients infected COVID-19. J Clin Virol. 2020;128:104431.

    Article  CAS  Google Scholar 

  7. Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal. BMJ. 2020;3(369):2204.

    Google Scholar 

  8. World Health Organization. 2020. Clinical management of COVID-19: interim guidance, 27 May 2020. World Health Organization. https://apps.who.int/iris/handle/10665/332196.

  9. Metlay JP, Waterer GW, Long AC, Anzueto A, Brozek J, Crothers K, et al. Diagnosis and treatment of adults with community-acquired pneumonia An Official Clinical Practice Guideline of the American Thoracic Society and Infectious Diseases Society of America. Am J Resp Crit Care. 2019;200(7):E45–67.

    Article  Google Scholar 

  10. Wolff RF, Moons KG, Riley RD, Whiting PF, Westwood MC, Gary SR, for the PROBAST Group†, et al. PROBAST: a tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. 2019;170:51–8.

    Article  Google Scholar 

  11. Liang W, Liang H, Ou L, Chen B, Chen A, Li C, China Medical Treatment Expert Group for COVID-19, et al. Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern Med. 2020;180:e202033.

    Article  Google Scholar 

  12. Siemieniuk RA, Bartoszko JJ, Ge L, Zeraatkar D, Izcovich A, Kum E, et al. Drug treatments for covid-19: living systematic review and network meta-analysis. BMJ. 2020;30(370):m2980.

    Article  Google Scholar 

  13. Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-19. Nature. 2020.

  14. Mancuso P, Venturelli F, Vicentini M, Perilli C, Larosa E, Bisaccia E, et al. Temporal profile and determinants of viral shedding and of viral clearance confirmation on nasopharyngeal swabs from SARS-CoV-2-positive subjects: a population-based prospective cohort study in Reggio Emilia, Italy. BMJ Open. 2020;10(8):e040380.

    Article  Google Scholar 

  15. Wang Y, Guo Q, Yan Z, Zhou D, Cao B. Factors associated with prolonged viral shedding in patients with avian influenza A(H7N9) virus infection. J Infect Dis. 2018;217(11):1708–17.

    Article  CAS  Google Scholar 

  16. Zheng S, Wang Y, Yu F, Wang Y, Yu F, Cui D, et al. Benefit of early initiation of neuraminidase inhibitor treatment to hospitalized patients with avian influenza A (H7N9) virus. Clin Infect Dis. 2018;66(7):1054–60.

    Article  CAS  Google Scholar 

  17. Zuo Y, Liu Y, Zhong Q, Zhang K, Xu Y, Wang Z. Lopinavir/ritonavir and interferon combination therapy may help shorten the duration of viral shedding in patients with COVID-19: a retrospective study in two designated hospitals in Anhui. China J Med Virol. 2020. https://doi.org/10.1002/jmv.26127.

    Article  PubMed  Google Scholar 

  18. Sheahan TP, Sims AC, Leist SR, Schäfer A, Won J, Brown AJ, et al. Comparative therapeutic efficacy of remdesivir and combination lopinavir, ritonavir, and interferon beta against MERS-CoV. Nat Commun. 2020;11:222.

    Article  CAS  Google Scholar 

  19. Shen K, Yang Y. Diagnosis and treatment of 2019 novel coronavirus infection in children: a pressing issue. World J Pediatr. 2020;16:219.

    Article  CAS  Google Scholar 

  20. Lokugamage KG, Schindewolf C, Menachery VD. SARS-CoV-2 sensitive to type I interferon pretreatment. bioRxiv. 2020.

  21. Chu CM, Cheng VCC, Hung IFN, Wong MML, Chan KH, Chan KS, et al. Role of lopinavir/ritonavir in the treatment of SARS: initial virological and clinical findings. Thorax. 2004;59:252–6.

    Article  CAS  Google Scholar 

  22. Cao B, Wang Y, Wen D, Liu W, Wang J, Fan G, et al. A trial of lopinavir–ritonavir in adults hospitalized with severe Covid-19. N Engl J Med. 2020.

  23. Group RC. Lopinavir–ritonavir in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial. Lancet. 2020.

  24. Li Y, Xie Z, Lin W, Cai W, Wen C, Guan Y, et al. An exploratory randomized, controlled study on the efficacy and safety of lopinavir/ritonavir or arbidol treating adult patients hospitalized with mild/moderate COVID-19 (ELACOI). medRxiv. 2020;13:e0195068.

    Google Scholar 

  25. Lim J, Jeon S, Shin HY, Kim MJ, Seong YM, Lee WJ, et al. Case of the index patient who caused tertiary transmission of COVID-19 infection in Korea: the application of lopinavir/ritonavir for the treatment of COVID-19 infected pnuemonia monitored by quantitative RT-PCR. J Korean Med Sci. 2020;35(6):e79.

    Article  CAS  Google Scholar 

  26. National Health Commission of China. Guideline for COVID-19 pneumonia of China; 2020.

  27. Edalatifard M, Akhtari M, Salehi M, Naderi Z, Jamshidi A, Mostafaei S, et al. Intravenous methylprednisolone pulse as a treatment for hospitalised severe COVID-19 patients: results from a randomised controlled clinical trial. Eur Respir J. 2020;56(6):2002808.

    Article  CAS  Google Scholar 

  28. Cao B, Gao H, Zhou B, Deng X, Hu C, Deng C, et al. Adjuvant corticosteroid treatment in adults with influenza A (H7N9) viral pneumonia. Crit Care Med. 2016;44(6):e318–28.

    Article  CAS  Google Scholar 

  29. Hu Y, Lu S, Song Z, Wang W, Hao P, Li J, et al. Association between adverse clinical outcome in human disease caused by novel influenza A H7N9 virus and sustained viral shedding and emergence of antiviral resistance. Lancet. 2013;381:2273–9.

    Article  Google Scholar 

  30. Arabi YM, Mandourah Y, Al-Hameed F, Sindi AA, Almekhlafi GA, Hussein MA, et al. Corticosteroid therapy for critically ill patients with middle east respiratory. Am J RespirCrit Care Med. 2018;197(6):757–67.

    Article  Google Scholar 

  31. Rice TW, Rubinson L, Uyeki TM, Vaughn FL, John BB, Miller RR 3rd, et al. NHLBI ARDS Network: critical illness from 2009 pandemic influenza A virus and bacterial coinfection in the United States. Crit Care Med. 2012;40:1487–98.

    Article  Google Scholar 

  32. Holshue ML, DeBolt C, Lindquist S, Lofy KH, Wiesman J, Bruce H, et al. First case of 2019 novel coronavirus in the United States. N Engl J Med. 2020;382(10):929–36.

    Article  CAS  Google Scholar 

  33. Lei H, Li Y, Xiao S, Lin CH, Norris SL, Wei D, et al. Routes of transmission of influenza A H1N1, SARS-CoV, and norovirus in air cabin: comparative analyses. Indoor Air. 2018;28:394–403.

    Article  CAS  Google Scholar 

  34. Jason A. Tetro, is COVID-19 receiving ADE from other coronaviruses? Microbes Infect. 2020;22(2):72–3.

    Article  Google Scholar 

  35. Guo YR, Cao QD, Hong ZS, Tan YY, Chen S-D, Jin H-J, et al. The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak—an update on the status. Mil Med Res. 2020;7(1):11.

    CAS  PubMed  PubMed Central  Google Scholar 

  36. Hattaf K, Yousfi N. Dynamics of SARS-CoV-2 infection model with two modes of transmission and immune response. Math Biosci Eng. 2020;17(5):5326–40.

    Article  Google Scholar 

  37. Sun L, Liu G, Song F, Shi N, Shi Y. Combination of four clinical indicators predicts the severe/critical symptom of patients infected COVID-19. J ClinVirol. 2020;128:104431.

    CAS  Google Scholar 

  38. Zhang H, Zhou P, Wei Y, Yue H, Wang Y, Hu M, et al. Histopathologic changes and SARS-CoV-2 immunostaining in the lung of a patient with COVID-19. Ann Intern Med. 2020;172(9):629–32.

    Article  Google Scholar 

  39. Chen R, Liang W, Jiang M, Guan W, Zhan C, Wang T, Medical Treatment Expert Group for COVID-19, et al. Risk factors of fatal outcome in hospitalized subjects with coronavirus disease 2019 from a nationwide analysis in China. Chest. 2020;158(1):97–105. https://doi.org/10.1016/j.chest.2020.04.010.

    Article  CAS  PubMed  Google Scholar 

  40. Wang D, Hu B, Hu C, Zhu F, Liu X, Zhang J, et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan. China JAMA. 2020;323(11):1061–9.

    Article  CAS  Google Scholar 

  41. Ruan Q, Yang K, Wang W, Jiang L, Song J. Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China. Intensive Care Med. 2020;368(2):m641–3.

    Google Scholar 

  42. National Health Commission of the People’s Republic of China home page (http://www.nhc.gov.cn).

  43. Wang Z, Yang B, Li Q, Wen L, Zhang R. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506.

    Article  Google Scholar 

Download references

Acknowledgements

We thank all the efforts from staff of Liaoning provincial health commission and Liaoning COVID-19 treatment team in the management of the confirmed patients. We also appreciate the editor and the reviewers for their constructive suggestions.

Funding

This study was supported by the emergency project of Liaoning provincial science and technology department, No. 2020JH2/10300003. Liaoning provincial science and technology department had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

WYL: Conceptualization, Formal analysis, Investigation, Methodology, Resources, Supervision, Writing—original draft. YG: Formal analysis, Methodology. HH: Formal analysis, Investigation, Supervision. HZ: Formal analysis, Supervision. JK: Validation, Visualization. DW: Writing original draft, review and editing. WW: Conceptualization, Funding acquisition, Investigation, Validation, Visualization, Writing original draft, review and editing. All authors have read and approved the manuscript and ensure that this is the case.

Corresponding author

Correspondence to Wei Wang.

Ethics declarations

Ethics approval and consent to participate

this study was approved by the human research ethics committee of the first hospital of CMU (committee’s reference number: AF-OG-20-1.1-02), Shenjing hospital of CMU (committee’s reference number: 2020041002), Liaoning province peoples’ hospital, Shenyang Sixth People's Hospital, Jinzhou Infectious Disease Hospital, Tieling infectious Disease Hospital, Fuxin Infectious Disease Hospital, Central Hospital of Huludao City. The other committee’s reference number were not available. All the written informed consent was waived. Permissions were obtained to access the data, as this was included as a part of the formal ethics approval. The data used in this study was anonymized before its use.

Consent for publication

Not applicable.

Competing interests

All authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

The variable assignment and multi-collinearity analysis of independent variables in two models.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, WY., Wang, D., Guo, Y. et al. Discriminant models for the prediction of postponed viral shedding time and disease progression in COVID-19. BMC Infect Dis 22, 366 (2022). https://doi.org/10.1186/s12879-022-07338-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12879-022-07338-x

Keywords