A deep learning integrated radiomics model for identification of coronavirus disease 2019 using computed tomography

Zhang, Xiaoguo; Wang, Dawei; Shao, Jiang; Tian, Song; Tan, Weixiong; Ma, Yan; Xu, Qingnan; Ma, Xiaoman; Li, Dasheng; Chai, Jun; Wang, Dingjun; Liu, Wenwen; Lin, Lingbo; Wu, Jiangfen; Xia, Chen; Zhang, Zhongfa

doi:10.1038/s41598-021-83237-6

Download PDF

Article
Open access
Published: 16 February 2021

A deep learning integrated radiomics model for identification of coronavirus disease 2019 using computed tomography

Xiaoguo Zhang¹^na1,
Dawei Wang²^na1,
Jiang Shao ORCID: orcid.org/0000-0001-9153-8006³,
Song Tian²,
Weixiong Tan²,
Yan Ma¹,
Qingnan Xu¹,
Xiaoman Ma¹,
Dasheng Li⁴,
Jun Chai⁵,
Dingjun Wang⁶,
Wenwen Liu³,
Lingbo Lin³,
Jiangfen Wu²,
Chen Xia² &
…
Zhongfa Zhang¹

Scientific Reports volume 11, Article number: 3938 (2021) Cite this article

2736 Accesses
22 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Since its first outbreak, Coronavirus Disease 2019 (COVID-19) has been rapidly spreading worldwide and caused a global pandemic. Rapid and early detection is essential to contain COVID-19. Here, we first developed a deep learning (DL) integrated radiomics model for end-to-end identification of COVID-19 using CT scans and then validated its clinical feasibility. We retrospectively collected CT images of 386 patients (129 with COVID-19 and 257 with other community-acquired pneumonia) from three medical centers to train and externally validate the developed models. A pre-trained DL algorithm was utilized to automatically segment infected lesions (ROIs) on CT images which were used for feature extraction. Five feature selection methods and four machine learning algorithms were utilized to develop radiomics models. Trained with features selected by L1 regularized logistic regression, classifier multi-layer perceptron (MLP) demonstrated the optimal performance with AUC of 0.922 (95% CI 0.856–0.988) and 0.959 (95% CI 0.910–1.000), the same sensitivity of 0.879, and specificity of 0.900 and 0.887 on internal and external testing datasets, which was equivalent to the senior radiologist in a reader study. Additionally, diagnostic time of DL-MLP was more efficient than radiologists (38 s vs 5.15 min). With an adequate performance for identifying COVID-19, DL-MLP may help in screening of suspected cases.

Assessing clinical applicability of COVID-19 detection in chest radiography with deep learning

Article Open access 21 April 2022

João Pedrosa, Guilherme Aresta, … Aurélio Campilho

Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography

Article Open access 05 November 2020

Jun Chen, Lianlian Wu, … Honggang Yu

Development and evaluation of an artificial intelligence system for COVID-19 diagnosis

Article Open access 09 October 2020

Cheng Jin, Weixiang Chen, … Jianjiang Feng

Introduction

Since its first outbreak in Wuhan, China, Coronavirus Disease 2019 (COVID-19) has been extensively spreading all over the world and caused a global pandemic. Real-time reverse-transcriptase-polymerase chain reaction (rRT-PCR) amplification of SARS-CoV-2 serves as the gold standard for COVID-19 diagnosis. However, false-negative results and long turnaround time limit the clinical efficacy of rRT-PCR testing in rapid COVID-19 screening^1,2, especially during disease outbreaks. Given that about 97% of COVID-19 patients presented chest abnormalities^1,3, chest CT examination has been regarded as a prompt and complementary reference to rRT-PCR testing for screening COVID-19 patients^3,4. Yet, an increasing number of chest CT examinations would overload radiologists and subtle chest abnormalities such as ground-glass opacities could be easily missed. Thus, an efficient and reliable CT-based auxiliary tool is urgently needed to help radiologists screen COVID-19 patients.

Over the past few years, different deep learning (DL)-based artificial intelligence (AI) diagnostic systems were developed and deployed in clinical practice to assist radiologists, such as the DL-based pulmonary nodules diagnostic system⁵. Since the outbreak of COVID-19, multiple machine learning (ML) and DL models for detecting lesions, assessing disease severity, and predicting disease prognosis of COVID-19 have been developed^{6,7,8,9,10,11,12,13}. Wang et al. developed a DL model to provide clinical diagnosis before the pathogenic examinations by extracting radiographical features of COVID-19⁸. Yue et al. built a ML model using CT images to estimate the hospital stay of COVID-19 patients¹⁴. Another study developed a radiomics nomogram using features extracted from the lung parenchyma window to predict COVID-19¹³. When reviewing published literature on prediction models for COVID-19 diagnosis¹⁵, we noticed that regions of interest (ROIs) annotation which was time-consuming but indispensable for model development were one of the common challenges for both deep learning and radiomics modeling. Moreover, though radiomics is a widely utilized method in the field of medical imaging¹⁶, lack of automatic ROI annotation is a key hurdle during its clinical application because each case needs to be annotated before being applied to the radiomics models.

In recent years, radiomics is developed rapidly and has attracted broad attention for its potential to identify subtle disease characteristics that failed to be discovered by naked eyes. However, the performance of the radiomics model could be greatly influenced by different feature selection methods and classification algorithms^17,18,19. To achieve the best model, feature selection and classification algorithm need to be well-designed. To our knowledge, no research so far has tried to evaluate the effects of feature selection methods and classification algorithms on the performance of radiomics models for distinguishing COVID-19 and other community acquired pneumonia (CAP) patients. In this study, we solved the time-consuming ROI annotation problem by integrating a DL segmentation algorithm with the radiomics approach, and developed an end-to-end model using CT images to screen COVID-19 patients. Additionally, cross-combinations of five feature selection methods and four machine learning algorithms were used to develop the optimal radiomics model. Furthermore, the clinical feasibility of the model was validated on an external dataset in terms of classification performance and time efficiency.

Materials and methods

Patients

This study was approved by the Institutional Reviewing Board (IRB) of Jinan Infectious Disease Hospital, Beijing Haidian Hospital, and Inner Mongolia Autonomous Region People's Hospital. Informed consent was waived by IRBs since patient information was anonymized to ensure privacy. All methods were carried out in accordance with relevant guidelines and regulations. For model development, a total of 293 patients (371 CT scans, some patients underwent several CT examinations) were retrospectively collected from Jinan Infectious Disease Hospital and Beijing Haidian Hospital between Jan 25 and Feb 15, 2020, including 98 COVID-19 patients, 157 other CAP patients, and 38 etiologically confirmed influenza and mycoplasma pneumonia patients. To further validate model robustness, 93 patients (31 COVID-19 patients and 62 CAP patients, 95 CT scans) were enrolled from Inner Mongolia Autonomous Region People's Hospital between Jan 26 and Feb 17, 2020, and constituted an independent external testing dataset. Of note, rRT-PCR testing for SARS-COV-2 served as the gold standard to diagnose COVID-19 patients in this study. Detailed clinical information of the enrolled patients were summarized in Table 1.

Table 1 Characteristics of enrolled patients and collected CT scans for model development and validation.

Full size table

In addition, patients’ characteristics were summarized, including clinical stages and imaging manifestations. In particular, over 65% of the included COVID-19 patients were clinically classified as the moderate type, followed by 27.1% mild type, 2.3% severe type, and 0.8% critical type (Appendix Table S1). In terms of imaging manifestations on chest CT scans, multifocal small patchy shadows, ground glass opacity (GGO), and consolidation were the main lesions found in both COVID-19 and CAP cases. As can be seen in Appendix Table S2, GGO was more common and consolidation was less common in COVID-19 patients than among CAP cases, which could be attributed to the relatively larger proportion of mild or moderate clinical type patient. Other reported imaging manifestations, including infiltrate and pleural effusion, were rare among the included patients of this study.

DL segmentation algorithms

The DL segmentation algorithm was a built-in feature on InferScholar platform by Infervision (https://www.infervision.com/, Beijing, CHINA) and applied to automatically delineate ROIs in this study. The segmentation algorithm was trained with 507 sets of CT scans from suspected COVID-19 patients in Wuhan area. Coarse annotation strategy was utilized in which major lesions with multifocal small patchy shadowing, ground-glass opacities, and consolidations were selectively annotated on CT images by experienced radiologists (Fig. 1a). During algorithm training, CT images of different sizes were first resized to 512 × 512 using bilinear interpolation method as previously described²⁰ and the CT values of images were rescaled at window center of -600 and window width at 1500 so that the pneumonia lesions could be presented and easily distinguished (Fig. 1b). Annotated lesions on each slide were merged into a 3D ROI after segmentation (Fig. 1c). Training and testing of the DL segmentation algorithm were performed by using Mxnet (version 1.6.0) and CUDA (version 10.0).

To briefly summarize the structure of the DL segmentation algorithm, U-Net was the main architecture of the algorithm in which Xception^21,22 served as the backbone (sFig. 1). The annotation performance was evaluated by the Dice index. Dice Loss equation for loss function was as followed:

$$Dice Loss= 1-\frac{2*Pred*Anno}{Pred+Anno}$$

where Pred denotes lesion pixels predicted by the DL segmentation algorithm and Anno represents the reference lesion pixels annotated by senior radiologists.

Feature extraction

In this study, we used Python (version 3.8.1) to call the pyRadiomics package (version 2.2.0) for radiomics feature extraction. A total of 1454 features were extracted from the DL algorithm segmented ROIs and can be subdivided into 7 classes, including first-order (FOS), shape, Gray Level Cooccurence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), Gray Level Size Zone Matrix (GLSZM), Neighbouring Gray Tone Difference Matrix (NGTDM), and Gray Level Dependence Matrix (GLDM) features. Detailed information on feature extraction methods and parameters²³, and the number of extracted features for each feature class was summarized in Appendix Table S3.

Feature selection

In order to select discriminating features, five methods were applied and compared in this study, including L1 regularized least absolute shrinkage and selection operator (L1-LASSO), L1 regularized logistic regression (L1-LR), L1 regularized ridge regression (L2-Ridge), eXtreme gradient boosting (XGBoost), and Z-test^24,25. Five-fold cross-validation method was utilized. All methods were implemented by calling the scikit-learn (version 0.20.2) package and the optimal one with the highest accuracy was chosen as the final dimensionality reduction method.

ML model training and testing

For unbiased estimation of diagnostic accuracy, data from two hospitals (Jinan Infectious Disease Hospital and Beijing Haidian Hospital) was divided into training and internal testing sets at a ratio of 2:1; data from the third hospital was utilized as an external testing set. With the selected features, four independent ML models were trained on the training set, including support vector machine (SVM), multi-layer perceptron (MLP), logistic regression (LR), and XGBoost. These methods were all implemented by calling the scikit-learn (version 0.20.2) package. To select the best model and the optimal hyper-parameters for each model, five-fold cross-validation was performed on the training set, in which 80% of the data was randomly selected to train models and the remaining 20% data (tuning set) validated the trained models. Training and validation process repeated five times until each cross section was part of the tuning set once. In model testing stage, ensemble models from five-fold-cross validation were used to discriminated COVID-19 and CAP patients while the model performance was evaluated on internal and external testing datasets.

Reader study

To further evaluate the clinical feasibility of these proposed models, two radiologists (one senior radiologist with 15 years’ experience and one junior radiologist with 5 years’ experience) participated in the reader study on both the internal and external testing datasets. The senior radiologist and junior radiologist both had taken part in the fight against COVID-19 in the front line. They diagnosed cases independently only based on the CT imaging information in the reader study. Their diagnostic performance was compared with the proposed end-to-end models. Of note, the diagnostic efficiency was evaluated in terms of diagnostic time-consumption.

Model evaluation and statistical analysis

Diagnostic performance was evaluated by classification sensitivity, specificity, precision, accuracy, F1 score, G-Mean, and area under ROC curve (AUC) and PR curve (AP). PR curve, a measure complementary to the ROC curve²⁶, was utilized as well just in case of the possible asymmetrical data problems. Categorical variables were expressed in terms of frequency and statistically analyzed by Chi-square test. P < 0.05 was considered statistically significant. Continuous variables were represented by the means ± SD. A two-sided 95% confidence interval for AUC or AP was constructed following the approach of Hanley and McNeil (1982)²⁷. Cohen’s Kappa coefficient was calculated to measure the agreement between ground-truth results and model predictions. All statistical analyses were performed with the R statistical package (The R Foundation for Statistical Computing, Vienna, Austria).

Results

Performance of feature selection methods and ML models

The pre-trained DL segmentation algorithm achieved a Dice index of 0.69 and also displayed an adequate performance on the CT scans in this study. Much more lesions were annotated by DL algorithms comparing the coarse annotation method. Examples of coarse annotated and AI labeled ROIs were shown in Fig. 2. Of the five selection methods, L1-LR which selected 108 radiomics features enabled three ML models to achieve the highest AUC on validation set and was thus selected as the optimal method (sFig. 2, Fig. 1d). Pearson Correlation Coefficient (PCC) among the 108 selected features were calculated; features with PPC < 0.8 and 0.5 constituted another two feature sets, respectively (Appendix Tables S4 and S5). Feature redundancy was examined by training models with these three features sets and it turned out that 108 features guaranteed the optimal model performance (sFig. S, Figs. 5a, and 6a). All selected features were listed in Appendix Table S6 while features with the top 20 absolute coefficients were shown in Fig. 3 as the representatives.

After training, MLP, SVM, LR, and XGBoost obtained a mean AUC of 0.995, 0.964, 0.995, and 0.995 on the training set; the higher the AUC on training set, the better the model fit. Meanwhile, the mean AUC of 0.873 (95% confidence interval (CI) 0.812–0.934), 0.872 (95% CI 0.846–0.898), 0.858 (95% CI 0.807–0.909), and 0.815 (95% CI 0.772–0.858) were obtained on validation set, respectively (Fig. 4, sFig. 4). L1-LR + classifier MLP (DL-MLP) demonstrated the optimal performance during the training.

Performance evaluation of the end-to-end models

ML models integrated with DL segmentation algorithm constituted the end-to-end models. We then evaluated the performance of these models on testing datasets. DL-MLP outperformed other models with an AUC of 0.922 (95% CI 0.856–0.988), an F1 score of 0.841, and a kappa coefficient of 0.761 on the internal testing dataset; the AP reached 0.851 (95% CI 0.762–0.939) (Fig. 5a,b). In contrast, the AUC of DL-SVM, DL-LR, and DL-XGBoost were 0.927 (95% CI 0.864–0.991), 0.918 (95% CI 0.851–0.986), and 0.882 (95% CI 0.802–0.961), respectively. Detailed diagnostic performance metrics of these models were listed in Table 2. In addition, subgroup analysis was performed between COVID-19 and etiologically confirmed influenza pneumonia or mycoplasma pneumonia and DL-MLP again demonstrated an adequate classification performance with AUC of 0.891 (95% CI 0.805–0.977) and 0.933 (95% CI 0.865–1.000) (Fig. 5c).

Table 2 Detailed diagnostic metrics of end-to-end models and radiologists on internal and external testing datasets.

Full size table

Furthermore, DL-MLP achieved better performance on the external testing dataset with an AUC of 0.959 (95% CI 0.910–1.000), an F1 score of 0.841, and a kappa coefficient of 0.750; its AP reached 0.937 (95% CI 0.877–0.997). Detailed diagnostic performance metrics of other models were summarized in Table 2 and Fig. 6. Notably, it just took the end-to-end model 38 s to diagnose each input CT scan, indicating its high efficiency in practice.

Performance evaluation of the participated radiologists in a reader study

In comparison to the junior radiologist, senior radiologist achieved an overall better performance with the diagnostic accuracy, precision, sensitivity, and specificity of 0.90, 0.83, 0.88, and 0.91 on the internal testing dataset and 0.926, 0.964, 0.818, and 0.984 on external testing dataset (Table 2). The radiologists’ diagnostic performance was dotted in ROC and PR curves according to their sensitivity, specificity, and precision (Figs. 5a and 6a). The kappa coefficient of senior radiologist reached 0.781 and 0.832 on internal and external testing datasets (Figs. 5b and 6b). In addition, junior and senior radiologists spent an average time of 5.29 min and 5 min to diagnose a set of CT images.

Discussion

Early and timely detection of COVID-19 patients is of great importance in containing the pandemic. The practice has proved that the CT examination serves as a complementary approach to rRT-PCR for COVID-19 screening in some emergent scenarios^28,29,30. By integrating DL segmentation algorithm with radiomics, we developed an end-to-end model using CT images from multiple medical centers to screen COVID-19 patients. Automatically delineated ROIs by DL segmentation algorithm greatly enhanced the application potentials of radiomics models in clinical practice. Trained with selected radiomics features, DL-MLP model demonstrated comparable diagnostic performance to a senior radiologist with 15 years’ experience on internal and external testing datasets.

To date, many DL and radiomics models were developed since the outbreak of COVID-19, focusing on screening, diagnosis, and prognosis of COVID-19¹⁵. However, due to limited medical labor resources and diffused lesion distribution across multiple sections, ROI annotations remained challenging in many of the current studies^8,9,11. In our study, we utilized a DL segmentation algorithm that was trained with 507 sets of coarse annotated suspected COVID-19 CT scans. Lesions were selectively annotated on certain CT sections where they predominantly presented. This strategy reduced the annotation workload when medical resources were scarce and eventually achieved adequate results. The DL segmentation algorithm enabled direct application of radiomics models in clinical practice by saving the need for manual annotation, which is of great value to be extended to other disease scenarios when the radiomics approach was utilized.

Of note, five feature selection methods and four machine learning algorithms were utilized so as to discover the optimal radimocis model for identifying COVID-19 patients. A total of 20 models were tested and compared on both internal and external testing datasets in terms of AUC. Optimal feature selection methods were firstly screened by comparing the corresponding model performance on validation sets. Three of the four machine learning models achieved the best AUC when trained with L1-LR selected features. Redundancy of L1-LR selected features was further tested by modeling without features with strong correlations (PCC ≥ 0.8; PCC ≥ 0.5). All L1-LR selected features were finally utilized because of the robust performance on internal and external testing datasets. Machine learning models were trained with L1-LR selected features. Based on the performance on internal and external testing datasets in terms of AUC, AP, and other diagnostic performance metrics, the optimal model MLP was further analyzed in subgroups and compared with radiologists.

Current diagnostic performance for COVID-19 varied from model to model due to different development datasets and techniques. Detection sensitivity ranged from 0.83 to 1 while the AUC ranged from 0.81 to 0.996^15,31,32. A recent study ensembled transfer learning with deep convolutional neural networks (15 architectures) to detect COVID-19 on CT images and achieved the best performance with sensitivity of 0.854, accuracy of 0.85, and precision of 0.857³³. Another DL-based multi-view fusion model was developed using CT images with the maximum lung regions in axial, coronal and sagittal views and achieved AUC, accuracy, sensitivity and specificity of 0.819, 0.760, 0.811 and 0.615 on testing set, respectively³². In comparison, our study shared similar data size and achieved a better diagnostic performance as evidenced by the AUC, accuracy, sensitivity and specificity of 0.959, 0.884, 0.879 and 0.887 on the external testing dataset. Similarly, the multi-view fusion model solved annotation problem by using certain whole CT images, however, that may also result in insufficient features to properly detect COVID-19³². Another deep learning model was trained with a large dataset to identify COVID-19 from other pneumonia³⁴. Like this model, our proposed DL-MLP could also distinguish COVID-19 from etiologically confirmed influenza and mycoplasma pneumonia and achieved better performance in terms of AUC.

Notably, there were also developed radiomics models to distinguish COVID-19, predict hospital stay, disease severity, and prognosis of COVID-19 patients^10,12,13,14. An earlier radiomics study that utilized both lesion and normal region patches cropped from COVID-19 CT scans achieved a higher classification accuracy of 99.68% with GLSZM features³⁵. However, this study ignored the within-patient correlation between the two classes of image patches. Meanwhile, radiomics nomogram for predicting COVID-19 was also developed by combining radiomics scores and significantly associated CT characteristics¹³ and obtained a comparable performance to ours. Yet, note that in addition to internal and external testing sets, the proposed DL-MLP model was further validated by comparing with experienced radiologists on external testing dataset, which substantiated the model’s greater application potentials in clinical scenarios.

The diagnostic performance of two radiologists served as the benchmark to evaluate the diagnostic efficacy of models in this study. Unlike studies with imbalanced classifications of data whose diagnostic threshold was determined by G-Mean³⁶, our model output the normalized predicted probabilities of each class and achieved an adequate performance on identifying COVID-19 with a diagnostic threshold of 0.5 (sFig. 5). Notably, diagnostic performance of the participating radiologists on identification of COVID-19 was generally comparable to radiologists in other studies with similar sensitivity, specificity and accuracy^11,37. In consistent with previous DL studies^11,37,38, DL-MLP demonstrated comparable diagnostic performance to the experienced senior radiologist on both internal and external testing datasets in terms of detection sensitivity, specificity and accuracy. Adequate performance on the external testing dataset further increased the reliability of the end-to-end DL-MLP model. In addition, diagnostic efficiency is another important parameter to evaluate model feasibility. Comparable reading time of the radiologists was found in the current and previous study (5.15 min vs. 6.5 min)^11,38; in contrast, the model made a diagnosis in about 38 s which was much more efficient.

There are still limitations in this study that can be improved in future research. More radiologists for reader study, the utilization of AI-assisted reading mode, and detailed subgroup analyses could further validate the model’s feasibility in clinical practice. In addition, integrating clinical information other than CT images could potentially improve diagnostic performance.

In conclusion, an end-to-end DL-MLP model was developed by integrating the DL segmentation algorithm with the radiomoics approach to efficiently screen COVID-19 patients from other CAP patients. DL-MLP achieved an adequate diagnostic performance that was comparable to a senior radiologist on both internal and external testing datasets, demonstrating the algorithm’s great potential to assist radiologists to screen suspected COVID-19 cases in joint with rRT-PCR testing in emergent scenarios or high prevalence areas.

Data availability

The data will be made available to others on reasonable requests to the corresponding author.

References

Li, D. et al. False-negative results of real-time reverse-transcriptase polymerase chain reaction for severe acute respiratory syndrome coronavirus 2: role of deep-learning-based CT diagnosis and insights from two cases. Kor. J. Radiol. 21, 505–508. https://doi.org/10.3348/kjr.2020.0146 (2020).
Article Google Scholar
Chen, Z. et al. A patient with COVID-19 presenting a false-negative reverse transcriptase polymerase chain reaction result. Kor. J. Radiol. 21, 623–624. https://doi.org/10.3348/kjr.2020.0195 (2020).
Article Google Scholar
Ai, T. et al. Correlation of Chest CT and RT-PCR Testing in Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology, 200642, doi:https://doi.org/10.1148/radiol.2020200642 (2020).
Long, C. et al. Diagnosis of the Coronavirus disease (COVID-19): rRT-PCR or CT?. Eur. J. Radiol. 126, 108961. https://doi.org/10.1016/j.ejrad.2020.108961 (2020).
Article PubMed PubMed Central Google Scholar
Liu, K. et al. Evaluating a fully automated pulmonary nodule detection approach and its impact on radiologist performance. Radiol. Artif. Intell. Vol. 1 (2019).
Zheng, C. et al. Deep learning-based detection for COVID-19 from chest CT using weak label. medRxiv https://doi.org/10.1101/2020.03.12.20027185 (2020).
Song, Y. et al. Deep learning enables accurate diagnosis of novel coronavirus (COVID-19) with CT images. medRxiv. https://doi.org/10.1101/2020.02.23.20026930 (2020).
Wang, S. et al. A deep learning algorithm using CT images to screen for Corona Virus Disease (COVID-19). medRxiv. https://doi.org/10.1101/2020.02.14.20023028 (2020).
Wang, B. et al. AI-assisted CT imaging analysis for COVID-19 screening: Building and deploying a medical AI system. Appl. Soft Comput. 106897. https://doi.org/10.1016/j.asoc.2020.106897 (2020).
Cai, W. et al. CT quantification and machine-learning models for assessment of disease severity and prognosis of COVID-19 patients. Acad. Radiol. https://doi.org/10.1016/j.acra.2020.09.004 (2020).
Article PubMed PubMed Central Google Scholar
Jin, C. et al. Development and evaluation of an artificial intelligence system for COVID-19 diagnosis. Nat. Commun. 11, 5088. https://doi.org/10.1038/s41467-020-18685-1 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Wu, Q. et al. Radiomics analysis of computed tomography helps predict poor prognostic outcome in COVID-19. Theranostics 10, 7231–7244. https://doi.org/10.7150/thno.46428 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fang, X., Li, X., Bian, Y., Ji, X. & Lu, J. Radiomics nomogram for the prediction of 2019 novel coronavirus pneumonia caused by SARS-CoV-2. Eur. Radiol. 30, 6888–6901. https://doi.org/10.1007/s00330-020-07032-z (2020).
Article CAS PubMed Google Scholar
Yue, H. et al. Machine learning-based CT radiomics method for predicting hospital stay in patients with pneumonia associated with SARS-CoV-2 infection: a multicenter study. Ann. Transl. Med. 8, 859. https://doi.org/10.21037/atm-20-3026 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wynants, L. et al. Prediction models for diagnosis and prognosis of covid-19 infection: Systematic review and critical appraisal. BMJ 369, m1328. https://doi.org/10.1136/bmj.m1328 (2020).
Article PubMed PubMed Central Google Scholar
Lambin, P. et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14, 749–762. https://doi.org/10.1038/nrclinonc.2017.141 (2017).
Article PubMed Google Scholar
Zhang, B. et al. Radiomic machine-learning classifiers for prognostic biomarkers of advanced nasopharyngeal carcinoma. Cancer Lett. 403, 21–27. https://doi.org/10.1016/j.canlet.2017.06.004 (2017).
Article CAS PubMed Google Scholar
Wu, W. et al. Exploratory study to identify radiomics classifiers for lung cancer histology. Front. Oncol. 6, 71. https://doi.org/10.3389/fonc.2016.00071 (2016).
Article PubMed PubMed Central Google Scholar
Parmar, C., Grossmann, P., Bussink, J., Lambin, P. & Aerts, H. Machine learning methods for quantitative radiomic biomarkers. Sci. Rep. 5, 13087. https://doi.org/10.1038/srep13087 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Amyar, A., Modzelewski, R., Li, H. & Ruan, S. Multi-task deep learning based CT imaging analysis for COVID-19 pneumonia: Classification and segmentation. Comput. Biol. Med. 126, 104037. https://doi.org/10.1016/j.compbiomed.2020.104037 (2020).
Article CAS PubMed PubMed Central Google Scholar
Brox, O. R. U-Net: convolutional networks for biomedical image segmentation. Med. Image Comput. Comput. Assist. Intervent. (MICCAI) 9351, 234–241 (2015).
Chollet, F. Xception: Deep learning with depthwise separable convolutions. arXiv (2016).
Kocak, B., Durmaz, E. S., Ates, E. & Kilickesmez, O. Radiomics with artificial intelligence: a practical guide for beginners. Diagn. Intervent. Radiol. 25, 485–495. https://doi.org/10.5152/dir.2019.19321 (2019).
Article Google Scholar
Ng, A. Y. Feature selection, L 1 vs. L 2 regularization, and rotational invariance. Proceedings of the 21 st International Conference on Machine Learning (2004).
Tibshirani, R. Regression Shrinkage and Selection via the lasso. J. R. Stat. Soc. Ser. B (methodological)) 58, 267–288 (1996).
O’Reilly, C. & Nielsen, T. Automatic sleep spindle detection: benchmarking with fine temporal resolution using open science tools. Front. Hum. Neurosci. 9, 353. https://doi.org/10.3389/fnhum.2015.00353 (2015).
Article PubMed PubMed Central Google Scholar
Hanley, J. A. & McNeil, B. J. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143, 29–36. https://doi.org/10.1148/radiology.143.1.7063747 (1982).
Article CAS PubMed Google Scholar
Wang, Y., Hou, H., Wang, W. & Wang, W. Combination of CT and RT-PCR in the screening or diagnosis of COVID-19. J. Global Health 10, 010347. https://doi.org/10.7189/jogh.10.010347 (2020).
Article Google Scholar
Hao, W. & Li, M. Clinical diagnostic value of CT imaging in COVID-19 with multiple negative RT-PCR testing. Travel Med. Infect. Dis. 34, 101627. https://doi.org/10.1016/j.tmaid.2020.101627 (2020).
Article PubMed PubMed Central Google Scholar
He, J. L. et al. Diagnostic performance between CT and initial real-time RT-PCR for clinically suspected 2019 coronavirus disease (COVID-19) patients outside Wuhan China. Respir. Med. 168, 105980. https://doi.org/10.1016/j.rmed.2020.105980 (2020).
Article PubMed PubMed Central Google Scholar
Harmon, S. A. et al. Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets. Nat. Commun. 11, 4080. https://doi.org/10.1038/s41467-020-17971-2 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Wu, X. et al. Deep learning-based multi-view fusion model for screening 2019 novel coronavirus pneumonia: A multicentre study. Eur. J. Radiol. 128, 109041. https://doi.org/10.1016/j.ejrad.2020.109041 (2020).
Article PubMed PubMed Central Google Scholar
Gifani, P., Shalbaf, A. & Vafaeezadeh, M. Automated detection of COVID-19 using ensemble of transfer learning with deep convolutional neural network based on CT scans. Int. J. Comput. Assist. Radiol. Surg. https://doi.org/10.1007/s11548-020-02286-w (2020).
Article PubMed PubMed Central Google Scholar
Wang, S. et al. A fully automatic deep learning system for COVID-19 diagnostic and prognostic analysis. Eur. Respir. J. 56. https://doi.org/10.1183/13993003.00775-2020 (2020).
Barstugan, M. O., Umut; Ozturk, Saban. Coronavirus (COVID-19) classification using CT Images by machine learning methods arXiv (2020).
Song, B., Zhang, G., Zhu, W. & Liang, Z. ROC operating point selection for classification of imbalanced data with application to computer-aided polyp detection in CT colonography. Int. J. Comput. Assist. Radiol. Surg. 9, 79–89. https://doi.org/10.1007/s11548-013-0913-8 (2014).
Article PubMed PubMed Central Google Scholar
Javor, D. et al. Deep learning analysis provides accurate COVID-19 diagnosis on chest computed tomography. Eur. J. Radiol. 133, 109402. https://doi.org/10.1016/j.ejrad.2020.109402 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, J. et al. Deep learning-based model for detecting 2019 novel coronavirus pneumonia on high-resolution computed tomography. Sci. Rep. 10, 19196. https://doi.org/10.1038/s41598-020-76282-0 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to thank the COVID-19 Research Grant no. 202001003 from Jinan Science and Technology Bureau, Jinan, China.

Author information

These authors contributed equally: Xiaoguo Zhang and Dawei Wang.

Authors and Affiliations

Department of Respiratory Medicine, Jinan Infectious Disease Hospital, Shandong University, 22029# Jing-Shi Road, Jinan, 250021, Shandong, People’s Republic of China
Xiaoguo Zhang, Yan Ma, Qingnan Xu, Xiaoman Ma & Zhongfa Zhang
Institute of Advanced Research, Infervision Medical Technology Co., Ltd., 18F, Building E. Yuanyang International Center, Chaoyang District, Beijing, 100025, People’s Republic of China
Dawei Wang, Song Tian, Weixiong Tan, Jiangfen Wu & Chen Xia
Department of Radiology, Jinan Infectious Disease Hospital, Shandong University, 22029# Jing-Shi Road, Jinan, 250021, People’s Republic of China
Jiang Shao, Wenwen Liu & Lingbo Lin
Department of Radiology, Beijing Haidian Section of Peking University Third Hospital (Beijing Haidian Hospital), 29# Zhongguancun Road, Haidian District, Bejing, 100080, People’s Republic of China
Dasheng Li
Department of Radiology, Inner Mongolia Autonomous Region People’s Hospital, 20# Zhaowuda Road, Hohhot, 010017, People’s Republic of China
Jun Chai
Department of Radiology, Affiliated Jinhua Hospital, Zhejiang University School of Medicine, 365# Renmin East Road, Wucheng District, Jinhua, 321000, People’s Republic of China
Dingjun Wang

Authors

Xiaoguo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Shao
View author publications
You can also search for this author in PubMed Google Scholar
Song Tian
View author publications
You can also search for this author in PubMed Google Scholar
Weixiong Tan
View author publications
You can also search for this author in PubMed Google Scholar
Yan Ma
View author publications
You can also search for this author in PubMed Google Scholar
Qingnan Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoman Ma
View author publications
You can also search for this author in PubMed Google Scholar
Dasheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Jun Chai
View author publications
You can also search for this author in PubMed Google Scholar
Dingjun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wenwen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lingbo Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jiangfen Wu
View author publications
You can also search for this author in PubMed Google Scholar
Chen Xia
View author publications
You can also search for this author in PubMed Google Scholar
Zhongfa Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z., D.W., and Z.Z. designed the study and prepared the main manuscript text. Y.M., Q.X., X.M., D.L., J.C., and L.L. collected patient data and provided clinical expertise. S.T., W.T., X.Z., and D.W. were responsible for modeling and testing. J.S. and W.L. participated in the reader study. DJ.W. was responsible for identifying CT imaging manifestations. D.W., J.W., and C.X. further polished the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Zhongfa Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, X., Wang, D., Shao, J. et al. A deep learning integrated radiomics model for identification of coronavirus disease 2019 using computed tomography. Sci Rep 11, 3938 (2021). https://doi.org/10.1038/s41598-021-83237-6

Download citation

Received: 17 July 2020
Accepted: 31 January 2021
Published: 16 February 2021
DOI: https://doi.org/10.1038/s41598-021-83237-6

This article is cited by

Evaluation of the models generated from clinical features and deep learning-based segmentations: Can thoracic CT on admission help us to predict hospitalized COVID-19 patients who will require intensive care?
- Mutlu Gülbay
- Aliye Baştuğ
- Hürrem Bodur
BMC Medical Imaging (2022)
Application of Machine Learning and Deep Learning Techniques for COVID-19 Screening Using Radiological Imaging: A Comprehensive Review
- Asifuzzaman Lasker
- Sk Md Obaidullah
- Kaushik Roy
SN Computer Science (2022)
Facilitating standardized COVID-19 suspicion prediction based on computed tomography radiomics in a multi-demographic setting
- Yeshaswini Nagaraj
- Gonda de Jonge
- Peter M. A. van Ooijen
European Radiology (2022)
Machine learning-based CT radiomics model distinguishes COVID-19 from non-COVID-19 pneumonia
- Hui Juan Chen
- Li Mao
- Feng Chen
BMC Infectious Diseases (2021)
Radiomics-based machine learning differentiates “ground-glass” opacities due to COVID-19 from acute non-COVID-19 lung disease
- Andrea Delli Pizzi
- Antonio Maria Chiarelli
- Massimo Caulo
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.