Deep learning-based improved snapshot ensemble technique for COVID-19 chest X-ray classification

P, Samson Anosh Babu; Annavarapu, Chandra Sekhara Rao

doi:10.1007/s10489-021-02199-4

Deep learning-based improved snapshot ensemble technique for COVID-19 chest X-ray classification

Published: 23 March 2021

Volume 51, pages 3104–3120, (2021)
Cite this article

Download PDF

Applied Intelligence Aims and scope Submit manuscript

Deep learning-based improved snapshot ensemble technique for COVID-19 chest X-ray classification

Download PDF

Samson Anosh Babu P¹ &
Chandra Sekhara Rao Annavarapu¹

3618 Accesses
25 Citations
1 Altmetric
Explore all metrics

Abstract

COVID-19 has proven to be a deadly virus, and unfortunately, it triggered a worldwide pandemic. Its detection for further treatment poses a severe threat to researchers, scientists, health professionals, and administrators worldwide. One of the daunting tasks during the pandemic for doctors in radiology is the use of chest X-ray or CT images for COVID-19 diagnosis. Time is required to inspect each report manually. While a CT scan is the better standard, an X-ray is still useful because it is cheaper, faster, and more widely used. To diagnose COVID-19, this paper proposes to use a deep learning-based improved Snapshot Ensemble technique for efficient COVID-19 chest X-ray classification. In addition, the proposed method takes advantage of the transfer learning technique using the ResNet-50 model, which is a pre-trained model. The proposed model uses the publicly accessible COVID-19 chest X-ray dataset consisting of 2905 images, which include COVID-19, viral pneumonia, and normal chest X-ray images. For performance evaluation, the model applied the metrics such as AU-ROC, AU-PR, and Jaccard Index. Furthermore, it also obtained a multi-class micro-average of 97% specificity, 95% f₁-score, and 95% classification accuracy. The obtained results demonstrate that the performance of the proposed method outperformed those of several existing methods. This method appears to be a suitable and efficient approach for COVID-19 chest X-ray classification.

DeepCOVNet Model for COVID-19 Detection Using Chest X-Ray Images

Article 10 April 2023

Vandana Bhattacharjee, Ankita Priya, … Shamama Anwar

Automated COVID-19 Detection from Chest X-Ray Images: A High-Resolution Network (HRNet) Approach

Article 25 May 2021

Sifat Ahmed, Tonmoy Hossain, … Faisal Muhammad Shah

A Novel Deep Learning Model for COVID-19 Detection from Combined Heterogeneous X-ray and CT Chest Images

1 Introduction

The world will remember 2020 as a catastrophic year for humanity. Pneumonia of unknown etiology, which was identified in Wuhan, China in December 2019 [26] with its earliest death reported on 10^th January 2020, has become a pandemic [51] and is rapidly gulping the entire world under its net. The World Health Organization (WHO) named it COVID-19 (Corona Virus Disease-2019), and the virus is also known as SARS-CoV-2 (severe acute respiratory syndrome coronavirus-2) [37]. According to the Johns Hopkins Bloomberg School of Public Health, globally confirmed coronavirus cases reached 20,306,856, with 741,723 deaths recorded as of 12^th August 2020 [28]. Owing to the coronavirus flare-up, the Research and Development wings of various research communities are effectively participating in identifying an effective compelling symptomatic system and vaccination for its treatment [59]. Because the cure is under discovery, it is essential to take sufficient precautionary measures and maximize testing. Owing to the scarcity of test kits for this confirmatory test, the search for alternatives is ongoing. In general, coronavirus side effects range from the usual cold to fever, cough, brevity of breath, intense respiratory issues and multi-organ failure, and death [51]. These are challenging tasks for master clinicians at each medical clinic owing to the limited number of radiologists. Therefore, simple, accurate, and quick models can help conquer this issue and provide convenient help to patients.

Furthermore, the rapid rise of the COVID-19 epidemic has increased the need for expertise and expanded enthusiasm for creating automated discovery systems that depend on artificial intelligence (AI) methods. AI approaches can help acquire accurate results and are useful at eliminating hindrances such as lack of accessible real-time reverse transcription-polymerase chain reaction (RT- PCR) test kits and waiting time of test outcomes [6]. According to Ref. [51], acute respiratory problems are the primary symptoms of COVID-19 that can be detected using chest X-ray (CXR) images. Chest computed tomography (CT) scans can recognize the infection when the symptoms are mild [53]. The use of this information can overcome the constraints of different tools such as the absence of diagnostic kits. While CT scan is a better standard, X-ray is still valuable because it is less expensive, faster, and more widely used [45, 47]. The advantage of using X-ray images is the accessibility of imaging systems at most health centers and laboratories, even in rural areas. In the absence of common side effects, such as fever, the use of X-ray images of the chest has a relatively decent capacity to recognize the illness [63].

In previous studies, several machine learning algorithms have been used to automatically classify digitized chest images [11, 31, 35]. Although the application of machine learning techniques for automatic diagnosis is useful in the clinical field, if there are enough annotated images, deep learning approaches are superior compared to classical machine learning methods [3, 54]. Deep learning allows developing end-to-end models to accomplish guaranteed results using input information without the need for manual feature extraction [30, 32]. Various deep learning approaches have been effectively applied to numerous issues, including skin cancer classification [9, 19], breast cancer identification [7, 10], brain disease classification [56], pneumonia detection using X-ray images of the chest [44], and lung segmentation [14, 52].

Various ensemble learning methods have been proposed to improve the performance of deep learning neural networks. This improvement can be achieved by combining the predictions from multiple models [20]. Ensemble learning combines the predictions from multiple neural network models to reduce predictions’ variance and generalization error. Recently, many ensemble approaches have shown their efficient performance in many fields including ‘classification of rockburst intensity’ [62], ‘motor imagery classification’ [33], ‘cervical histopathology image classification’ [61], and ‘detection of misleading information on COVID-19’ [12]. By observing the advantages of deep learning applications shown in the clinical field, this study proposes a deep learning-based improved Snapshot Ensemble technique for the efficient classification of COVID-19 CXR images.

1.1 Contributions

Below is the list of technical contributions of this study.

This study proposes a deep learning-based improved Snapshot Ensemble technique for COVID-19 CXR classification.
A popular Convolutional Neural Network (CNN) architecture (ResNet50), which is a pre-trained network, is applied by the transfer learning approach.
Data augmentation is implemented to deal with a relatively small number of samples, which prevents the model from over-fitting, to provide efficient performance.
Snapshot Ensemble technique is implemented, which allows using an ensemble of multiple neural networks at no additional training cost.
An improved Snapshot Ensemble algorithm is proposed to enhance model training and accuracy.
The obtained results are evaluated using popular metrics such as AU- ROC, AU- PR, precision, recall, specificity, accuracy, f₁-score, and Jaccard Index.
The proposed model is compared with baseline methods to show the efficiency of COVID-19 CXR classification.

1.2 Roadmap

The rest of this paper is arranged as follows. Section 2 describes the related work. Section 3 discusses the preliminaries of the proposed method including data augmentation, convolution neural network, and Snapshot Ensemble technique. Section 4 describes the proposed improved Snapshot Ensemble technique. Moreover, dataset details and experimental results are reported in Section 5. Section 6 presents the discussion of experimental results. Finally, Section 7 presents the conclusions of this study.

2 Related work

Although this study is not constrained to clinical or biotechnology fields, it includes specialists from different fields (e.g., from AI and data science) to prevent and control the pandemic by providing their specialized perspectives and potential solutions. In previous studies, several methods have been proposed to detect, cure, and predict COVID-19. Different analysis approaches offer models to predict the pandemic’s evolution in specific geographical areas, countries, or create a global model. The models allow us to predict virus behavior, which is used to make future response plans. Hernandez-Matamoros et al. in [24] have analyzed the spread of COVID-19 using an Auto-Regressive Integrated Moving Average (ARIMA) model to show the spread of the pandemic through 6 geographic regions (continents). The model created a relationship between the countries and predicted the spread of the virus, behavior, and geographic region cases. Koushlendra Kumar Singh et al. in [50] have reported a Kalman filter-based short-term prediction model for forecasting COVID-19 using the popular machine learning techniques such as Random Forest and Pearson Correlation. However, the proposed approach is not used to predict the geographic region cases; instead, it is used to classify CXR images.

Among other diagnosis methods, medical images are essential [2, 13, 17, 18, 60]. Recently, CNN has become one of the most mainstream and successful methodologies that uses numerous radiology images to detect COVID-19. CNN’s initial advantage is its capacity to automatically learn features from domain-specific images, unlike the classical machine learning methods. The mainstream system for preparing CNN architecture is to transfer learned knowledge from a pre-trained system that satisfied one undertaking into another assignment [39]. The transfer of knowledge via the fine-tuning mechanism showed outstanding performance in X-ray image classification [3, 15, 49].

Hemdan et al. [23] have utilized a deep learning model called COVIDX-Net to analyze COVID-19 chest X-ray images containing 7 CNN models. Wang et al. [58] have proposed a deep learning CNN model for COVID-19 recognition (COVID-Net) using chest X-rays, which obtained a 92.4% accuracy when classifying COVID-19, non-COVID pneumonia, and normal classes. Apostolopoulos et al. [5] have built a deep transfer learning model using 224 positive COVID-19 images. This model achieved a 98.75% accuracy for binary class and a 93.48% progress rate for multi-class data. Ali Narin et al. [36] have achieved a 98% COVID-19 identification accuracy using CXR images combined with the ResNet50 model.

Recently, Sethy and Behera [48] have proposed a CNN-based model that relies on different ImageNet pre-prepared models to extract high-level features. Those features were fed into SVM as a machine learning classifier to distinguish COVID-19 CXR images. The abovementioned study shows that the combination of the ResNet50 model and SVM-classifier produced useful results. The abovementioned study suggested that transfer learning can separate critical biomarkers that are identified with the COVID-19 disease. Harsh Panwar et al. [40] have developed nCOVnet (i.e., a fast screening method for the detection of COVID-19 by analyzing X-rays), which is a deep neural network method. Rodolfo M. Pereira et al. have developed an RYDLS-20 model [41] using a resampling method. The model used a CXR image database and obtained a 0.65 f₁-score.

Turker Tuncer et al. [57] have applied an automated Residual Exemplar Local Binary Pattern and iterative ReliefF-based method for COVID-19 lung X-ray image classification. A modified deep CNN model has also been proposed by Mohammad Rahimzadeh and Abolfazl Attar [43] for detecting COVID-19 and pneumonia in CXR images by concatenating Xception and ResNet50V2 methods. Asmaa Abbas et al. [1] have proposed another deep CNN model called DeTraC (Decompose, Transfer, and Compose) to classify COVID-19 CXR images and deal with random images by investigating class boundaries. Tulin Ozturk et al. have proposed an automatic COVID-19 detection method (DarkCovidNet) for CXR images using the DarkNet model with a transfer learning approach. Recently, Perumal et al. [42] have presented a transfer learning model to accelerate the prediction process and assist medical professionals in identifying COVID-19 using CT scan and CXR images. However, the tests used to identify COVID-19 are not sufficiently fast. The proposed approach overcomes the limitation of a long testing period using an automated deep learning-based technique. The proposed approach allows for obtaining results in less time, especially during the initial stages of virus development.

3 Preliminaries

This section describes details of the methods used for distinguishing COVID-19 from CXR images.

3.1 Data augmentation

The data imbalance problem makes the model more or less biased towards certain classes [4]. The proposed method uses the data augmentation approach to solve class imbalance, which artificially adds images to fewer categories to equal those of the largest class. The proposed approach randomly chose and copied the images belonging to the class with fewer samples to create duplicate images while resampling. However, because deep neural networks perform better with a large amount of data, data augmentation helps create images that depict its class’s features at every possible angle. Data augmentation ensures that the trained model can predict a class with higher precision at any angle the image is obtained. Different techniques used for data augmentation are as follows:

Randomly rotate images in the range (0 to 180^∘)
Randomly zoom image
Randomly shift images horizontally
Randomly shift images vertically
Randomly flip the images horizontally
Randomly flip the images vertically

3.2 CNN

In general, deep learning approaches uncover the dataset’s highlights, such as images and videos that are hidden in the original data. Among these deep learning techniques, CNNs are widely used for medical image classification [21]. CNNs are feed-forward Artificial Neural Networks (ANN) [25] with alternating convolutional and sub-sampling layers. Profound 2D-CNN has many hidden layers and parameters. It can learn intricate patterns, given that it is trained on a gigantic size of visual database with ground-truth labels. Further, it is a modern architecture that processes high volumes of information with higher accuracy and relatively low computational expense compared to other classification algorithms owing to the efficiency in handling extensive data. Using different filters to identify specific features in images, CNN uses a unique way of image classification. Furthermore, the deep learning model’s relevant filters grasp the more in-depth features and convert them into predetermined features using pooling layers.

3.3 Transfer learning

The transfer learning approach is faster and simple to apply without the requirement for an enormous annotated dataset for training. Accordingly, numerous analysts tend to apply this strategy, particularly in medical imaging. This approach can be accomplished using the following important situations:

“Shallow- tuning” which adapts only the last classification layer to adapt to the new task and freezes the parameters of remaining layers without training;
“Deep- tuning” aims to retrain all of the parameters of the pre-trained model from the end-to-end approach;
“Fine- tuning” intends to continuously train more layers by tuning the learning parameters to achieve a considerable performance boost.

Transfer knowledge via the fine-tuning approach demonstrated exceptional performance in X-ray image classification [49].

3.4 Cyclic learning rate scheduling

To improve results and make the model converge at a global minimum instead of a local minimum, the learning rate should be increased periodically instead of exponentially to determine the optimal learning rate. Cyclic learning rate scheduling makes this possible by cyclically changing the learning rate, which helps the model escape several global minima. In addition, this eliminates the necessity to find an optimal maximum learning rate manually. The utilized cyclic learning rate approach is shown in Fig. 1.

3.5 Snapshot ensemble technique

Ensemble models of neural networks are known to be substantially robust and accurate than individual networks. However, training multiple deep networks for model averaging is computationally expensive. Therefore, ‘Snapshot Ensembling’ has been proposed to ensemble multiple neural networks at no additional training cost with consistent lower error rates [27]. By adopting cyclic learning rate scheduling, Snapshot Ensembling has confirmed its compatibility with diverse network architectures and learning tasks. The proposed approach used this technique to periodically save model parameters during training. When the model converges to local minima during a cycle, these parameters are saved, and the learning rate increases to apply another model. This approach allows us to gather an ensemble of models in a single training cycle. Figure 2 shows the illustration of Stochastic Gradient Descent (SGD) optimization (with a typical learning rate schedule) and the illustration of Snapshot Ensemble [27].

4 Proposed method

This section describes the proposed model creation and improved Snapshot Ensemble technique. Figure 3 visualizes the workflow of the proposed method.

4.1 Model creation

The idea behind the transfer learning approach is to use the CNN model, which has been already trained on the ImageNet [46] data. The model is then applied to the lower layers of the proposed model to capture the generic features. In the proposed approach, the higher layers are fine-tuned to our specific domain and redefine the last layer that outputs three values that correspond to three different classes. The proposed approach uses ResNet50 [22] pre-trained architecture owing to its better results. These results were obtained while experimenting with five epochs on different pre-trained architectures. The proposed approach also uses the ‘Adam’ (adaptive moment estimation) optimizer [29] with weight decay to reduce overfitting and obtain the best validation accuracy upon training the data. Adam is one of the latest algorithms in the family of optimizers for model training. It combines two powerful optimizers: RMSProp (Root Mean Square Propagation) and AdaGrad (Adaptive Gradient). Unlike other optimizers, as training proceeds, it uses a different learning rate for every parameter in the network and then adjusts it along with the parameter.

The proposed model is built in Python using Keras Sequential API. In this API, we have to attach one layer to the model at a time. First, the ResNet50 architecture (a pre-trained architecture, which is used to capture generic features) is added to the model. Then, a dropout layer is added, which is a new regularization method. The dropout layer randomly ignores a few nodes from each training sample and makes the model learn features in a distributed way. In addition, it improves the generalization and reduces overfitting. A dense layer with 128 nodes is added, which is a part of a fully connected layer where different features from ResNet50 are converted to provide an output from 128 nodes. Then, a dropout layer is added, followed by the dense output layer with three nodes, which correspond to 3 different CXR types; the net output provides the probability of each class. These layers utilize commonly used ReLU (Rectified Linear Unit) [16] as an activation function, which adds nonlinearity to the model. Furthermore, the Snapshot Ensemble technique is added to the abovementioned model. Table 1 shows the proposed model summary.

Table 1 Proposed model summary

Full size table

The core of Snapshot Ensembling is an optimization process that visits few local minima before converging to the ultimate solution. It gradually saves snapshots at each local minimum and averages their predictions to quickly obtain the global minimum. Nevertheless, in the regular model, we have to travel for a long time to reach a global minimum. Thus, this ensemble model helps us to reach the global minimum in fewer epochs. To converge to multiple local minima, Snapshot Ensemble follows a Cyclic Cosine Annealing schedule [34] as a cyclic learning rate schedule. This method splits the training process into C cycles; each cycle starts with a large learning rate, annealed to a lower learning rate. The learning rate of α for the iteration t is calculated as follows:

$$ \alpha(t) = \frac{\alpha_{0}}{2}(\cos (\frac{\pi mod(t-1,[T/C])}{[T/C]})+1) $$

(1)

Here, α₀ represents the initial learning rate; T represents the total number of training iterations, and C represents the number of cycles.

4.2 Improved snapshot ensemble technique

Improved Snapshot Ensemble technique proposes to consider the weighted average instead of taking the average of probabilities of all models. To choose the weights for different models, random weight initialization is considered. After checking the improvements, new weights are added to the best weights. If there is no improvement, the number of improvements counter increases. We continue this process until the number of improvements counter reaches a specific limit; here, the limit is named patience. Thus, the final accuracy and final improved weights can be obtained. Algorithm 1 represents the pseudo-code of the improved Snapshot Ensemble algorithm.

4.2.1 Improved snapshot ensemble calculation

The proposed Snapshot Ensemble model uses three models. During model execution, three snapshots of the model have been saved. The snapshots produce three different weight probabilities for each model evaluation. An ensemble probability is calculated by taking the ‘average’ of all three probabilities. Further, the weights of the resultant class with the highest probability are used to obtain the ensemble model accuracy. Example illustrations of ‘Snapshot Ensemble model’ and ‘improved Snapshot Ensemble model’ calculations are shown in Tables 2 and 3, respectively. The details of Tables 2 and 3 are as follows:

C₁,...C_n:represents the number of classes
M₁,...M_m:represents the number of models
P_a:represents the predicted class probabilities using the Snapshot Ensemble model and ${\sum }_{i=1}^{n} P_{a_{i}} = 1 $
P_w:represents the predicted class probabilities using the improved Snapshot Ensemble model and ${\sum }_{i=1}^{n} P_{w_{i}} = 1 $
p₁₁,...p_n1:represents the prediction probabilities of model ‘M₁’ for ‘n’ classes
Similarly, p_1m,...p_nm:represents the prediction probabilities of model ‘M_m’ for ‘n’ classes
p₁₁,...p_1m:represents the prediction probabilities of class ‘C₁’ for ‘m’ number of models
Similarly, p_n1,...p_nm:represents the prediction probabilities of class ‘C_n’ for ‘m’ number of models
w₁,...w_m: represents the random weights initialized to ‘m’ number of models for ‘n’ number of classes

Table 2 Illustration of the Snapshot Ensemble model prediction

Full size table

Table 3 Illustration of the improved Snapshot Ensemble model prediction

Full size table

5 Experimental evaluation

This section describes the evaluation metrics, dataset details, and experimental procedure, along with obtained results and comparisons. Test executions are performed using Python and Keras Sequential API.

5.1 Evaluation metrics

After the implementation of the proposed approach, the proposed model performance is evaluated using popular evaluation metrics such as Area Under the Curve (AUC) - Receiver Operating Characteristic (ROC), Area Under Precision-Recall curve (AU-PR), Specificity (or) True Negative Rate (TNR), Precision (or) Positive Predictive Value (PPV), Recall (or) Sensitivity (or) True Positive Rate (TPR), f₁-score (or) F-measure, accuracy, and Jaccard Index. ROC is a 2-dimensional graph that plots between TPR and False Positive Rate (FPR). Similarly, it may be characterized as an exchange between sensitivity and specificity. The ROC curve contains TPR on the Y-axis and FPR on the X-axis. AUC-ROC is most suitable when both classes maintain approximately the same number of samples. In the case of data imbalance, majority samples have a higher impact on the curve than minority samples, which causes a biased result. However, AUC-PR is mostly used for class imbalance problems because it does not consider false positives and false negatives, which produces unbiased results without sample influence. Medical studies require higher AUC results.

The Jaccard Index, which is also known as the Jaccard similarity coefficient, is a statistic that is used to understand similarities between sample sets. The mathematical representation of the Jaccard Index is as follows:

$$ J(M,F)= \frac{|M \cap F|}{|M \cup F|} = \frac{|M \cap F|}{|M|+|F| - |M \cap F|} $$

(2)

where, M and F represent the sample sets (if M and F are both empty, define J(M, F)), and 0 ≤ J(M, F) ≤ 1.

Similarly, the formulae used to evaluate the performance of the model (e.g., TNR, PPV, TPR, f₁-score, and accuracy) are given as follows:

$$ TNR= \frac{TN}{TN+FP} $$

(3)

$$ PPV=\frac{TP}{TP+FP} $$

(4)

$$ TPR= \frac{TP}{TP+FN} $$

(5)

$$ f_{1}-score= 2*\frac{precision*recall}{precision+recall} $$

(6)

$$ Accuracy= \frac{TP+TN}{TP+TN+FP+FN} $$

(7)

Where TP, FP, TN, and FN represent True Positive, False Positive, True Negative, and False Negative in independent datasets, respectively.

5.2 Dataset details

This study uses the collection of 2905 X-ray images from the COVID-19 CXR dataset. The image dataset is a publicly accessible COVID-19 CXR dataset [8], which is derived from the COVID-19 Radiography Database.^{Footnote 1} It contains 219 COVID-19, 1345 viral pneumonia, and 1341 normal CXR images. Table 4 shows the details of the dataset. Figure 4 shows sample images from the X-ray dataset containing COVID-19, viral pneumonia, and normal types.

Table 4 COVID-19 CXR dataset details

Full size table

5.3 Resizing of images and normalization

Keeping the same-size ratio does not result in the loss of information in the image. Because all original images in the dataset have different sizes, a considerable computation time is required to train the data. Therefore, all images in the dataset are resized to the dimension (75 × 100 × 3).

Because each input image pixel value ranges from 0 to 255, and the neural networks do not support the format, the following normalization method is applied to all images.

$$ NI = \frac{(X - \mu)}{\sigma} $$

(8)

Here, NI represents the normalized image; X represents the original image pixel value; μ represents the mean of all corresponding pixel values, and σ represents the standard deviation of all corresponding pixel values.

After normalization, each color code format value changed to the range of -2 to 2, which is preferred by neural networks. The obtained data were further divided into training and testing sets, i.e., 80% and 20%, respectively. The training dataset is divided into training and validation parts at the 80:20 ratio. After analyzing the data, the datasets were determined to be imbalanced. Then, according to Section 3.1, data augmentation was applied only to the final training data. Figure 5 shows the number of training instances of each category obtained after data preprocessing.

5.4 Model training and testing

Testing is necessary to measure the classification accuracy of the testing data. The proposed model’s test accuracy is obtained using different combinations of the epochs and number of ensemble models. The proposed model is trained using an exponential learning rate reducer to improve the test accuracy. Apart from this, to reduce the computation time, the proposed model is trained with combinations of a different number of models and number of epochs so that the number of models × Number of epochs per model = 30, which makes the total number of epochs for the entire model to be 30. After reviewing the abovementioned combinations, the following optimal values are fixed: the number of models = 3, the number of epochs per model = 10, batch size = 10, and the maximum learning rate remains 0.001.

Furthermore, to determine the efficiency of the proposed model, performance evaluation is made for the following models, such as the proposed model (i.e., ResNet50 + improved Snapshot Ensemble + data balance), ResNet50 with data balance, and ResNet50 without data balance, on the CXR dataset. Table 5 shows the performance details of the proposed model on the CXR dataset. The ResNet50 model’s performance with data balance on the CXR dataset is shown in Table 6. Similarly, Table 7 shows the performance of the ResNet50 model without data balance on the CXR dataset. These results show that the proposed model can achieve an overall accuracy of 95% and specificity of 97% for the multi-class CXR dataset. Moreover, the micro average of precision, recall, f₁-score, and Jaccard similarity is determined to be 95%, 95%, 95%, and 91%, respectively. Whereas the ResNet50 model, with data balance, achieved an overall accuracy of 92% for three classes of the same dataset. The micro average of precision, recall, f₁-score, and Jaccard similarity obtained are 92%, 92%, 92%, and 86%, respectively.

Table 5 Test accuracy of the proposed model on the COVID-19 CXR dataset

Full size table

Table 6 Test accuracy of the ResNet50 model with data balance on the COVID-19 CXR dataset

Full size table

Table 7 Test accuracy of the ResNet50 model without data balance on the COVID-19 CXR dataset

Full size table

Similarly, the ResNet50 model without data balance achieved an overall accuracy of 91% for the same dataset. In addition, the micro average of precision, recall, f₁-score, and Jaccard similarity is determined to be 92%, 92%, 92%, and 85%, respectively. Tables 5, 6, and 7 show that the proposed model exhibits an efficient performance compared with ResNet50 with data balance and ResNet50 without data balance. In addition, by observing the proposed model’s class results, the COVID-19 class has achieved a 100% accuracy, 100% specificity, and 99% f₁-score. Moreover, the ensemble accuracy for three models, along with the improved accuracy using the proposed algorithm, are shown in Table 8. The proposed algorithm acquired a multi-class micro average accuracy of 95.18%. Therefore, the proposed model shows the potential to assist in COVID-19 treatment and decision making at critical stages.

Table 8 Improved accuracy with the best weights using the proposed approach

Full size table

In Fig. 6, the AU-ROC and AU-PR curves represent the performance analysis of the proposed model for individual classes such as COVID-19, pneumonia, and normal CXR data. Similarly, in Fig. 7, the AU-ROC and AU-PR curves are plotted for the ResNet50 model with data balance. In Fig. 8, the AU-ROC and AU-PR curves are plotted for the ResNet50 model without data balance. Moreover, the proposed model obtained AU-ROC values of 1.00 for COVID-19, 0.99 for viral pneumonia, and 0.99 for normal data. Similarly, it obtained the AU-PR values of 0.99 for the COVID-19 class, 0.99 for viral pneumonia, and 0.99 for the normal class. These AU-ROC and AU-PR curves show the strength of the proposed approach in dealing with different types of image data.

5.5 Comparison

Table 9 provides a detailed comparison of the proposed model with recent baseline models to demonstrate the effectiveness of the proposed model. All comparisons are made only for the multi-class data. The majority of investigations referenced to compare with the proposed work used the COVID-19 CXR data, which was acquired from different openly accessible sources. The proposed model utilized a total of 2905 CXR images [219 COVID-19 (+), 1345 Viral Pneumonia, and 1341 Normal]. Although many proposed models in the literature showed efficient results while classifying CXR images, the proposed model obtained a multi-class micro-average of 97.16% specificity, 95.23% precision, 95.63% recall, 95.42% f₁-score, and 95.18% classification accuracy.

Table 9 Comparison of the proposed model with baseline models

Full size table

6 Discussion

Because the scarcity of COVID-19 test kits necessitated the need for automated discovery systems that depend on AI methods, the proposed improved Snapshot Ensemble technique utilizes ResNet50 (which is a transfer learning approach) to provide a quick alternative to aid the diagnosis process. Many models described in the literature utilized the advantage of transfer learning technique and ResNet models to achieve better performance results. For example, Xiaowei Xu et al. in [60] obtained an 86.70% accuracy using the combination of ResNet18 model and location-attention mechanism for early screening and distinguishing COVID-19 from influenza-A viral pneumonia (IAVP) and healthy CT images. Further, a deep learning DarkCovidNet [38] model was utilized to detect COVID-19. This approach utilized 1125 CXR images consisting of 125 COVID-19 positive, 500 pneumonia, and 500 no-findings samples to develop the model. This approach showed an accuracy of 87.02%.

Similarly, Rahimzadeh and Attar [43] achieved a 91.40% accuracy using the concatenation of Xception and ResNet50V2 networks. Abbas et al. [1] obtained 93.10% accuracy and 100% recall using the DeTraC model. Furthermore, the COVID-Net [58] model achieved a 93.30% progress rate for COVID-19 detection using radiography pictures. Moreover, Apostolopoulos and Mpesiana developed a transfer learning approach using VGG19 and MobileNet v2 model [5] for a similar reason as COVID-Net. For this experiment, 224 COVID-19 positive, 700 cases of pneumonia, and 504 ordinary radiology pictures were used, and a 93.48% accuracy was achieved.

Furthermore, the studies presented in [55] and [41] have shown excellent performance by achieving greater than 95% accuracy. Most of these prior studies experienced data scarcity during the building of the models. The proposed model is developed to deal with the challenging COVID-19 problem by exploiting data augmentation for the class imbalance problem. Although many existing methods achieved excellent results, the proposed model demonstrated efficient results by achieving a high f₁-score of 95.42% compared to other similar methods applied to CXR images. The proposed method demonstrated its robustness in coping with the limited availability of training data and irregularities in the data distribution. More importantly, the proposed improved Snapshot Ensemble algorithm provides a generic solution to improve the model’s efficiency. Moreover, the utilized metrics (e.g., micro average accuracy, precision, recall, f₁-score, specificity, Jaccard similarity, AU-ROC, and AU-PR) showed excellent results to support the efficient performance of the proposed model.

The proposed method’s significant advantages are as follows. X-ray images are considered owing to the readily available disease diagnosis methods. It is an efficient approach to assist technicians with diagnosing to get fast predictions. CT scan is an expensive and not readily available procedure because this equipment is usually located in big hospitals. It is essential to collect balanced data for better predictions. Here, data augmentation and class balancing are essential in model performance, as has been previously observed. Moreover, another important benefit of the proposed method is that it does not depend on the disease stage; it can be applied even at an early stage.

In addition to classic image processing techniques, pre-defined generative models can be used to improve the model’s performance. The proposed approach used the ReLU activation function, which is the most commonly used activation function. However, it is recommended to try the available advanced activation functions. In the future, the proposed work can be addressed using various sources of data for COVID-19 diagnosis to compare the outputs with the current CXR image outcomes. It is also possible to get local CXR images of COVID-19 patients and evaluate them using the proposed model. After the evaluation, the proposed model can be deployed at local health centers.

7 Conclusion

In this study, a deep learning-based improved Snapshot Ensemble technique is proposed to efficiently classify COVID-19 CXR images. The proposed model takes advantage of the popular transfer learning approach (e.g., ResNet50) for efficient deep feature extraction. The proposed model enhanced the existing Snapshot Ensemble technique by providing an improved Snapshot Ensemble algorithm. The proposed model demonstrated efficient performance in classifying COVID-19, viral pneumonia, and normal CXR images. The model can obtain high micro average multi-class accuracy of 95% (with 97% specificity and 95% f₁-score). The model can also obtain AU-ROC and AU-PR values of 1.00 and 0.99 for the COVID-19 class. Moreover, the model achieved a high f₁-score compared to several modern methods. These results clearly show that this model can assist in COVID-19 treatment and decision making at critical stages.

Notes

https://www.kaggle.com/tawsifurrahman/covid19-radiography-database

References

Abbas A, Abdelsamea MM, Gaber MM (2021) Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. Appl Intell 51:854–864. https://doi.org/10.1007/s10489-020-01829-7
Article Google Scholar
Alimadadi A, Aryal S, Manandhar I, Munroe PB, Joe B, Cheng X (2020) Artificial intelligence and machine learning to fight COVID-19. Physiol Genomics 52(4):200–202. https://doi.org/10.1152/physiolgenomics.00029.2020. PMID: 32216577
Article Google Scholar
Anthimopoulos M, Christodoulidis S, Ebner L, Christe A, Mougiakakou S (2016) Lung pattern classification for interstitial lung diseases using a deep convolutional neural network. IEEE Trans Med Imaging 35(5):1207–1216. https://doi.org/10.1109/TMI.2016.2535865
Article Google Scholar
Antoniou A, Storkey A, Edwards H (2017) Data augmentation generative adversarial networks. arXiv:1711.04340
Apostolopoulos ID, Mpesiana TA (2020) Covid-19: automatic detection from X-ray images utilizing transfer learning with convolutional neural networks. Phys Eng Sci Med 43:635–640. https://doi.org/10.1007/s13246-020-00865-4
Article Google Scholar
Caobelli F (2020) Artificial intelligence in medical imaging: game over for radiologists? Eur J Radiol 126:108940–108940
Article Google Scholar
Celik Y, Talo M, Yildirim O, Karabatak M, Acharya UR (2020) Automated invasive ductal carcinoma detection based using deep transfer learning with whole-slide images. Pattern Recognit Lett 133:232–239. https://doi.org/10.1016/j.patrec.2020.03.011. http://www.sciencedirect.com/science/article/pii/S0167865520300891 http://www.sciencedirect.com/science/article/pii/S0167865520300891
Article Google Scholar
Chowdhury MEH, Rahman T, Khandakar A, Mazhar R, Kadir MA, Mahbub ZB, Islam KR, Khan MS, Iqbal A, Emadi NA, Reaz MBI, Islam MT (2020) Can AI help in screening viral and COVID-19 pneumonia? IEEE Access 8:132665–132676. https://doi.org/10.1109/ACCESS.2020.3010287
Article Google Scholar
Codella NCF, Nguyen Q, Pankanti S, Gutman DA, Helba B, Halpern AC, Smith JR (2017) Deep learning ensembles for melanoma recognition in dermoscopy images. IBM J Res Develop 61(4/5):5:1–5:15. https://doi.org/10.1147/JRD.2017.2708299
Article Google Scholar
Cruz-Roa A, Basavanhally A, González F, Gilmore H, Feldman M, Ganesan S, Shih N, Tomaszewski J, Madabhushi A (2014) Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks. In: Gurcan MN, Madabhushi A (eds) Medical imaging 2014: digital pathology. https://doi.org/10.1117/12.2043872, vol 9041. International society for optics and photonics, SPIE, pp 1–15
Dandıl E, Çakiroğlu M, Ekşi Z, Özkan M, Kurt ÖK, Canan A (2014) Artificial neural network-based classification system for lung nodules on computed tomography scans. In: 2014 6th international conference of soft computing and pattern recognition (SoCPaR). https://doi.org/10.1109/SOCPAR.2014.7008037, pp 382–386
Elhadad MK, Li KF, Gebali F (2020) An ensemble deep learning technique to detect COVID-19 misleading information. In: International conference on network-based information systems. Springer, pp 163–175
Fanelli D, Piazza F (2020) Analysis and forecast of COVID-19 spreading in China, Italy and France. Chaos, Solitons & Fractals 134:109761. https://doi.org/10.1016/j.chaos.2020.109761. http://www.sciencedirect.com/science/article/pii/S0960077920301636
Article MathSciNet Google Scholar
Gaál G, Maga B, Lukács A (2020) Attention U-net based adversarial architectures for chest X-ray lung segmentation. arXiv:2003.10304
Gao M, Bagci U, Lu L, Wu A, Buty M, Shin HC, Roth H, Papadakis GZ, Depeursinge A, Summers RM, Xu Z, Mollura DJ (2018) Holistic classification of CT attenuation patterns for interstitial lung diseases via deep convolutional neural networks. Computer methods in biomechanics and biomedical engineering. Imaging Visual 6(1):1–6. https://doi.org/10.1080/21681163.2015.1124249. https://europepmc.org/articles/PMC5881940
Google Scholar
Glorot X, Bordes A, Bengio Y (2011) Deep sparse rectifier neural networks. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, pp 315–323
Gozes O, Frid-Adar M, Greenspan H, Browning PD, Zhang H, Ji W, Bernheim A, Siegel E (2020) Rapid AI development cycle for the coronavirus (COVID-19) pandemic:, Initial results for automated detection & patient monitoring using deep learning CT image analysis. arXiv:2003.05037
Hall LO, Paul R, Goldgof DB, Goldgof GM (2020) Finding COVID-19 from chest X-rays using deep learning on a small dataset. arXiv:2004.02060
Han SS, Kim MS, Lim W, Park GH, Park I, Chang SE (2018) Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J Invest Derma 138(7):1529–1538. https://doi.org/10.1016/j.jid.2018.01.028. http://www.sciencedirect.com/science/article/pii/S0022202X18301118
Article Google Scholar
Hansen LK, Salamon P (1990) Neural network ensembles. IEEE Trans Pattern Anal Machine Intell 12(10):993–1001. https://doi.org/10.1109/34.58871
Article Google Scholar
Haskins G, Kruger U, Yan P (2020) Deep learning in medical image registration: a survey. Mach Vis Appl 31(1):8
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/CVPR.2016.90, pp 770–778
Hemdan EED, Shouman MA, Karar ME (2020) COVIDX-Net: a framework of deep learning classifiers to diagnose COVID-19 in X-ray images. arXiv:2003.11055
Hernandez-Matamoros A, Fujita H, Hayashi T, Perez-Meana H (2020) Forecasting of COVID19 per regions using ARIMA models and polynomial functions. Applied Soft Computing 96:106610. https://doi.org/10.1016/j.asoc.2020.106610. http://www.sciencedirect.com/science/article/pii/S1568494620305482
Article Google Scholar
Hopfield JJ (1988) Artificial neural networks. IEEE Circ Devices Magazine 4(5):3–10. https://doi.org/10.1109/101.8118
Article Google Scholar
Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z, Yu T, Xia J, Wei Y, Wu W, Xie X, Yin W, Li H, Liu M, Xiao Y, Gao H, Guo L, Xie J, Wang G, Jiang R, Gao Z, Jin Q, Wang J, Cao B (2020) Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet 395(10223):497–506. https://doi.org/10.1016/S0140-6736(20)30183-5. http://www.sciencedirect.com/science/article/pii/S0140673620301835
Article Google Scholar
Huang G, Li Y, Pleiss G, Liu Z, Hopcroft JE, Weinberger KQ (2017) Snapshot ensembles: train 1, get M for free. In: 5th international conference on learning representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net. https://openreview.net/forum?id=BJYwwY9ll
JHU (2020) Johns Hopkins University coronavirus resource center. Accessed 12 Aug 2020. https://coronavirus.jhu.edu/
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. CoRR arXiv:1412.6980
Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90. https://doi.org/10.1145/3065386
Article Google Scholar
Kuruvilla J, Gunavathi K (2014) Lung cancer classification using neural networks for CT images. Comput Methods Programs Biomed 113(1):202–209. https://doi.org/10.1016/j.cmpb.2013.10.011. http://www.sciencedirect.com/science/article/pii/S0169260713003532 http://www.sciencedirect.com/science/article/pii/S0169260713003532
Article Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Article Google Scholar
Lee BH, Jeong JH, Lee SW (2020) Sessionnet: feature similarity-based weighted ensemble learning for motor imagery classification. IEEE Access 8:134524–134535. https://doi.org/10.1109/ACCESS.2020.3011140
Article Google Scholar
Loshchilov I, Hutter F (2016) SGDR: stochastic gradient descent with warm restarts. arXiv:1608.03983
Manikandan T, Bharathi N (2016) Lung cancer detection using fuzzy auto-seed cluster means morphological segmentation and SVM classifier. J Med Syst 40(7):181
Article Google Scholar
Narin A, Kaya C, Pamuk Z (2020) Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks. arXiv archivePrefix
Organization WH (2020) Naming the coronavirus disease (covid-19) and the virus that causes it. Accessed 02 Nov 2020. https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it
Ozturk T, Talo M, Yildirim EA, Baloglu UB, Yildirim O, Rajendra Acharya U (2020) Automated detection of COVID-19 cases using deep neural networks with X-ray images. Comput Biol Med 121:103792. https://doi.org/10.1016/j.compbiomed.2020.103792. http://www.sciencedirect.com/science/article/pii/S0010482520301621
Article Google Scholar
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22 (10):1345–1359. https://doi.org/10.1109/TKDE.2009.191
Article Google Scholar
Panwar H, Gupta P, Siddiqui MK, Morales-Menendez R, Singh V (2020) Application of deep learning for fast detection of COVID-19 in X-Rays using nCOVnet. Chaos, Solitons & Fractals 138:109944. https://doi.org/10.1016/j.chaos.2020.109944. http://www.sciencedirect.com/science/article/pii/S096007792030343X
Article MathSciNet Google Scholar
Pereira RM, Bertolini D, Teixeira LO, Silla CN, Costa YM (2020) COVID-19 identification in chest X-ray images on flat and hierarchical classification scenarios. Comput Methods Programs Biomed 194:105532. https://doi.org/10.1016/j.cmpb.2020.105532. http://www.sciencedirect.com/science/article/pii/S0169260720309664
Article Google Scholar
Perumal V, Narayanan V, Rajasekar SJS (2021) Detection of COVID-19 using CXR and CT images using Transfer Learning and Haralick features. Appl Intell 51:341–358. https://doi.org/10.1007/s10489-020-01831-z
Article Google Scholar
Rahimzadeh M, Attar A (2020) A modified deep convolutional neural network for detecting COVID-19 and pneumonia from chest X-ray images based on the concatenation of Xception and ResNet50V2. Inform Med Unlocked 19:100360. https://doi.org/10.1016/j.imu.2020.100360. http://www.sciencedirect.com/science/article/pii/S2352914820302537
Article Google Scholar
Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, Ding D, Bagul A, Langlotz C, Shpanskaya K, Lungren MP, Ng AY (2017) CheXNet: radiologist-level pneumonia detection on chest X-rays with deep learning. arXiv archivePrefix
Rubin GD, Ryerson CJ, Haramati LB, Sverzellati N, Kanne JP, Raoof S, Schluger NW, Volpi A, Yim JJ, Martin IBK, Anderson DJ, Kong C, Altes T, Bush A, Desai SR, Goldin O, Goo JM, Humbert M, Inoue Y, Kauczor HU, Luo F, Mazzone PJ, Prokop M, Remy-Jardin M, Richeldi L, Schaefer-Prokop CM, Tomiyama N, Wells AU, Leung AN (2020) The role of chest imaging in patient management during the COVID-19 pandemic: a multinational consensus statement from the fleischner society. Radiology 296(1):172–180. https://doi.org/10.1148/radiol.2020201365. PMID: 32255413
Article Google Scholar
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, et al. (2015) Imagenet large scale visual recognition challenge. Int J Comput Vision 115(3):211–252
Article MathSciNet Google Scholar
Self WH, Courtney DM, McNaughton CD, Wunderink RG, Kline JA (2013) High discordance of chest X-ray and computed tomography for detection of pulmonary opacities in ED patients: implications for diagnosing pneumonia. Amer J Emerg Med 31(2):401–405. https://doi.org/10.1016/j.ajem.2012.08.041. http://www.sciencedirect.com/science/article/pii/S0735675712004639
Article Google Scholar
Sethy PK, Behera SK (2020) Detection of coronavirus disease (COVID-19) based on deep features. Preprints 2020030300 :2020
Google Scholar
Shin H, Roth HR, Gao M, Lu L, Xu Z, Nogues I, Yao J, Mollura D, Summers RM (2016) Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging 35(5):1285–1298. https://doi.org/10.1109/TMI.2016.2528162
Article Google Scholar
Singh KK, Kumar S, Dixit P, Bajpai MK (2020) Kalman filter based short term prediction model for COVID-19 spread. Appl Intell
Sohrabi C, Alsafi Z, O’Neill N, Khan M, Kerwan A, Al-Jabir A, Iosifidis C, Agha R (2020) World Health Organization declares global emergency: a review of the 2019 novel coronavirus (COVID-19). Int J Surg 76:71–76. https://doi.org/10.1016/j.ijsu.2020.02.034. http://www.sciencedirect.com/science/article/pii/S1743919120301977
Article Google Scholar
Souza JC, Bandeira Diniz JO, Ferreira JL, França da Silva GL, Corrêa Silva A, de Paiva AC (2019) An automatic method for lung segmentation and reconstruction in chest X-ray using deep neural networks. Comput Methods Programs Biomed 177:285–296. https://doi.org/10.1016/j.cmpb.2019.06.005. http://www.sciencedirect.com/science/article/pii/S0169260719303517
Article Google Scholar
Sun D, Li H, Lu XX et al (2020) Clinical features of severe pediatric patients with coronavirus disease 2019 in Wuhan: a single center’s observational study. World J Pediatr 16:251–259. https://doi.org/10.1007/s12519-020-00354-4
Article Google Scholar
Sun W, Zheng B, Qian W (2016) Computer aided lung cancer diagnosis with deep learning algorithms. In: Tourassi GD, III SGA (eds) Medical imaging 2016: computer-aided diagnosis. https://doi.org/10.1117/12.2216307, vol 9785. International Society for Optics and Photonics, SPIE, pp 241–248
Tahir A, Qiblawey Y, Khandakar A, Rahman T, Khurshid U, Musharavati F, Kiranyaz S, Chowdhury ME (2020) Coronavirus: comparing COVID-19. SARS and MERS in the eyes of AI. arXiv:2005.11524
Talo M, Yildirim O, Baloglu UB, Aydin G, Acharya UR (2019) Convolutional neural networks for multi-class brain disease detection using MRI images. Computerized Medical Imaging and Graphics 78:101673. https://doi.org/10.1016/j.compmedimag.2019.101673. http://www.sciencedirect.com/science/article/pii/S0895611119300886
Article Google Scholar
Tuncer T, Dogan S, Ozyurt F (2020) An automated residual exemplar local binary pattern and iterative ReliefF based COVID-19 detection method using chest X-ray image. Chemomet Intell Labor Syst 203:104054. https://doi.org/10.1016/j.chemolab.2020.104054. http://www.sciencedirect.com/science/article/pii/S0169743920301970
Article Google Scholar
Wang L, Wong A (2020) COVID-net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-Ray images. arXiv:2003.09871
WHO (2020) World health organization- coronavirus disease (COVID-2019) R & D. Accessed 24 June 2020. https://www.who.int/teams/blueprint/covid-19
Xu X, Jiang X, Ma C, Du P, Li X, Lv S, Yu L, Ni Q, Chen Y, Su J, Lang G, Li Y, Zhao H, Liu J, Xu K, Ruan L, Sheng J, Qiu Y, Wu W, Liang T, Li L (2020) A deep learning system to screen novel coronavirus disease 2019 pneumonia. Engineering. https://doi.org/10.1016/j.eng.2020.04.010. http://www.sciencedirect.com/science/article/pii/S2095809920301636
Xue D, Zhou X, Li C, Yao Y, Rahaman MM, Zhang J, Chen H, Zhang J, Qi S, Sun H (2020) An application of transfer learning and ensemble learning techniques for cervical histopathology image classification. IEEE Access 8:104603–104618. https://doi.org/10.1109/ACCESS.2020.2999816
Article Google Scholar
Zhang J, Wang Y, Sun Y, Li G (2020) Strength of ensemble learning in multiclass classification of rockburst intensity. Int J Numer Anal Methods Geomech 44(13):1833–1853
Article Google Scholar
Zu ZY, Jiang MD, Xu PP, Chen W, Ni QQ, Lu GM, Zhang LJ (2020) Coronavirus disease 2019 (COVID-19): a perspective from China. Radiology 296(2):E15–E25
Article Google Scholar

Download references

Funding

This study was not funded by any grants.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology (ISM), Dhanbad, 826004, India
Samson Anosh Babu P & Chandra Sekhara Rao Annavarapu

Authors

Samson Anosh Babu P
View author publications
You can also search for this author in PubMed Google Scholar
Chandra Sekhara Rao Annavarapu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chandra Sekhara Rao Annavarapu.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflicts of interest.

Ethical Approval

We further confirm that any aspect of the work covered in this manuscript has not involved human patients.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Artificial Intelligence Applications for COVID-19, Detection, Control, Prediction, and Diagnosis

Rights and permissions

Reprints and permissions

About this article

Cite this article

P, S.A.B., Annavarapu, C.S.R. Deep learning-based improved snapshot ensemble technique for COVID-19 chest X-ray classification. Appl Intell 51, 3104–3120 (2021). https://doi.org/10.1007/s10489-021-02199-4

Download citation

Accepted: 06 January 2021
Published: 23 March 2021
Issue Date: May 2021
DOI: https://doi.org/10.1007/s10489-021-02199-4

Deep learning-based improved snapshot ensemble technique for COVID-19 chest X-ray classification

Abstract

Similar content being viewed by others

DeepCOVNet Model for COVID-19 Detection Using Chest X-Ray Images

Automated COVID-19 Detection from Chest X-Ray Images: A High-Resolution Network (HRNet) Approach

A Novel Deep Learning Model for COVID-19 Detection from Combined Heterogeneous X-ray and CT Chest Images

1 Introduction

1.1 Contributions

1.2 Roadmap

2 Related work

3 Preliminaries

3.1 Data augmentation

3.2 CNN

3.3 Transfer learning

3.4 Cyclic learning rate scheduling

3.5 Snapshot ensemble technique

4 Proposed method

4.1 Model creation

4.2 Improved snapshot ensemble technique

4.2.1 Improved snapshot ensemble calculation

5 Experimental evaluation

5.1 Evaluation metrics

5.2 Dataset details

5.3 Resizing of images and normalization

5.4 Model training and testing

5.5 Comparison

6 Discussion

7 Conclusion

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Ethical Approval

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation