RETRACTED ARTICLE: GraphCovidNet: A graph neural network based model for detecting COVID-19 from CT scans and X-rays of chest

Saha, Pritam; Mukherjee, Debadyuti; Singh, Pawan Kumar; Ahmadian, Ali; Ferrara, Massimiliano; Sarkar, Ram

doi:10.1038/s41598-021-87523-1

Download PDF

Article
Open access
Published: 15 April 2021

RETRACTED ARTICLE: GraphCovidNet: A graph neural network based model for detecting COVID-19 from CT scans and X-rays of chest

Scientific Reports volume 11, Article number: 8304 (2021) Cite this article

18k Accesses
35 Citations
16 Altmetric
Metrics details

Subjects

This article was retracted on 30 November 2021

This article has been updated

Abstract

COVID-19, a viral infection originated from Wuhan, China has spread across the world and it has currently affected over 115 million people. Although vaccination process has already started, reaching sufficient availability will take time. Considering the impact of this widespread disease, many research attempts have been made by the computer scientists to screen the COVID-19 from Chest X-Rays (CXRs) or Computed Tomography (CT) scans. To this end, we have proposed GraphCovidNet, a Graph Isomorphic Network (GIN) based model which is used to detect COVID-19 from CT-scans and CXRs of the affected patients. Our proposed model only accepts input data in the form of graph as we follow a GIN based architecture. Initially, pre-processing is performed to convert an image data into an undirected graph to consider only the edges instead of the whole image. Our proposed GraphCovidNet model is evaluated on four standard datasets: SARS-COV-2 Ct-Scan dataset, COVID-CT dataset, combination of covid-chestxray-dataset, Chest X-Ray Images (Pneumonia) dataset and CMSC-678-ML-Project dataset. The model shows an impressive accuracy of 99% for all the datasets and its prediction capability becomes 100% accurate for the binary classification problem of detecting COVID-19 scans. Source code of this work can be found at GitHub-link.

Multimodal graph attention network for COVID-19 outcome prediction

Article Open access 09 November 2023

Detection and analysis of COVID-19 in medical images using deep learning techniques

Article Open access 04 October 2021

Development and evaluation of an artificial intelligence system for COVID-19 diagnosis

Article Open access 09 October 2020

Introduction

Recently, Coronavirus (COVID-19) disease has created an unprecedented situation across the world. Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2), a novel virus enveloped with large single stranded RNA genome¹ is the root cause for this disease². Although this virus is originated from Wuhan in China, in December 2019, later America and several other countries of Europe have severely affected in early days of the year 2020³. According to recent statistics, both America and India have more number of confirmed cases than other affected countries. World Health Organization (WHO)⁴ announced COVID-19 as a global health emergency on January 30, 2020 considering the adverse effects of this situation.

To diagnose the SARS-CoV-2, it has been observed that both CXRs as well as CT-scans are found to be beneficial^5,6. CXR images are more appreciated by the medical practitioners, since it can be obtained easily from the radiology departments. According to radiologists, CXR images help to understand the chest pathology clearly¹. However, CT scans provide high sensitivity, for example, 97% of the positive CT scans are confirmed in a case study in Wuhan⁷. Due to the exponential growth in cases, it is required to develop a automated and fast paced system which can identify COVID-19 from chest CT-scans or CXR images. Figure 1 shows some samples of these CT-scan and CXR images.

SARS-CoV-2 generally affects the lungs and turbid formation of cough around lungs can be detected from CT-scans and CXRs. The usual symptoms of COVID-19 are related to fever, dry cough and tiredness. The severity of COVID-19 symptoms can range from very mild to critical. Some people may show only a few symptoms, and sometimes no symptoms can be observed at all. In some cases, symptoms start worsen mere after a week and frequent shortness of breath and Pneumonia may happen. Elders and people with chronic medical conditions may possess a higher risk of serious illness from COVID-19. Now in case of mild COVID-19, CT-scans and CXR images may be inefficient as the cough clouds may not be prominent. Another failed case for CT-scan and CXR diagnosis may be any other kind of Pneumonia because of the presence of turbid lungs. So, an advanced classification model is needed to classify these CT-scans and CXR properly.

Due to this pandemic, a lot of people have been affected around the world and rapid tests are required with proper treatment and quarantine. Though Real Time Reverse Transcription Polymerase Chain Reaction (RT-PCR) test is the most common way to detect the virus, but the time required to get the results is around 1–2 days¹⁰. So, an automatic and accurate classification method, where CT-scans or CXRs are used, can be considered as an alternative approach in order to deal with this pandemic whose turn around time is significantly less.

Although spread of COVID-19 has started recently, many research works have already been performed by the researchers during this short time span. Since the current problem consists of classifying COVID-19 images, various machine learning as well as deep learning methods have been proposed. In this section, a few works have been mentioned in brief.

Soares et al.⁸ have used an explainable deep learning model called xDNN on the SARS-CoV-2 CT-scan dataset⁸ and have achieved 97.31% accuracy in the binary classification of scans between COVID and Non-COVID. Yang et al.¹¹ have introduced the COVID-CT dataset in their work. Since the total number of images are approximately 700 in the original dataset, they have used segmentation masks for lungs and lesion region to gain additional information from the original images. In their work, Contrastive Self-Supervised Learning (CSSL), an unsupervised learning approach¹² has been implemented to fine-tune the ImageNet pretrained models DenseNet-169¹³ and ResNet-50¹⁴. Overall, they have achieved best accuracy as 89.1%. Pedro et al.¹⁵ have utilized the EfficientNet¹⁶ model along with transfer learning citetranferlearning and have achieved accuracies 87.60% and 98.99% for COVID-CT dataset¹¹ and SARS-CoV-2 CT-scan dataset⁸ respectively. Sharma et al.¹⁷ have applied ResNet¹⁴ on the database consisting of datasets: (i) GitHub COVID-CT dataset¹¹, (ii) COVID dataset provided by Italian Society of Medical and Interventional Radiology¹⁸, (iii) dataset provided by hospitals of Moscow, Russia¹⁹, (iv) dataset provided by SAL Hospital, Ahmedabad, India²⁰ and have obtained almost 91% accuracy.

Elaziz et al.²¹ have used a modified version of Manta-Ray Foraging Optimization (MRFO) for feature selection and later have used K-Nearest neighbor (KNN)²² for classification. They have considered two COVID datasets: (i) combined database of: covid-chestxray-dataset²³ and ChestX-Ray Images (Pneumonia) dataset²⁴, (ii) dataset provided by Chowdhury et al.²⁵. They have achieved 96.09% and 98.09% accuracies on those two datasets respectively. Turkoglu et al.²⁶ have proposed an COVIDetectioNet model where they have used transfer learning on a pre-trained Convolutional Neural Network (CNN)²⁷ called AlexNet. They have used Relief feature selection algorithm from all the layers of the architecture, and for the classification they have used Support Vector Machine (SVM)²⁸. They have conducted their experiments on the combined dataset of: (i) dataset provided by Chowdhury et al.²⁵, (ii) Chest X-RayImages (Pneumonia) dataset²⁴ and (iii) COVID-19 Radiography Database²⁹. Their proposed model has predicted 99.18%. accurate results on the combined dataset. Oh et al.³⁰ have performed semantic segmentation by using an extended fully convolutional (FC)-DenseNet103¹³ and later have used ResNet-18¹⁴ on the combined database of: (i) JSRT dataset³¹, (ii) SCR dataset³², (iii) NLM(MC) dataset³³, (iv) covid-chestxray-dataset²³, (v) CoronaHack dataset³⁴. In their work, 88.9% accurate results have been achieved.

Nour et al.³ have proposed a five-layer CNN model on the COVID-19 radiology database²⁵. This dataset is composed of different benchmark datasets^18,23,35. After extracting features from the proposed CNN model, basic machine learning algorithms KNN²², SVM²⁸ and Decision Tree (DT)³⁶ are applied on the extracted features. State-of-the-art result is achieved using SVM with accuracy 98.97%. Chandra et al.³⁷ have used majority voting based ensemble of five classifiers—SVM²⁸, KNN²², DT³⁶, Artificial Neural Network (ANN)³⁸, Naive Bayes (NB)³⁹ on the database consisting of three publicly available CXR image datasets: covid-chestxray dataset²³, Montgomery dataset⁴⁰, and NIH ChestX-ray14 dataset⁴¹. Among the total 8196 features extracted from all the pre-processed images, 8 are First Order Statistical Features (FOSF)⁴², 88 are Grey Level Co-occurrence Matrix (GLCM)⁴³ based features and the rest 8100 are Histogram of Oriented Gradients (HOG)⁴⁴ features. The proposed classifier ensemble has predicted with 98.06% and 93.41% accuracy for 2 class (normal and abnormal) and 3 class (i.e., normal, COVID-19 and Pneumonia) classification problems respectively.Hemdam et al.⁴⁵ have used seven benchmark image classifier models: VGG19⁴⁶, DenseNet201⁴⁷, InceptionV3⁴⁸, ResNetV2¹⁴, Inception-ResNet-V2⁴⁹, Xception⁵⁰, MobileNetV2⁵¹ on the dataset combined from covid-chestxray-dataset²³ and dataset provided by Dr. Rosebrock⁵². VGG19 and DenseNet201 have provided results with best accuracy as 90%.

Makris et al.⁵³ have used various existing CNN models along with transfer learning on the CXR images collected from sources: covid-chestxray dataset²³ and Chest X-Ray Images dataset by Mooney et al.²⁴. Among all the used models, VGG16 and VGG19⁴⁶ have provided the best accuracy as 95%. Zhong et al.⁵⁴ have used a CNN model based on VGG16⁴⁶ architecture on the database consisted of- covid-chestxray-dataset²³, ChestX-RayImages (Pneumonia) dataset²⁴, Figure 1 COVID-19 Chest X-ray Dataset Initiative dataset⁵⁵ and ActualMed COVID-19 Chest X-ray Dataset Initiative dataset⁵⁶. Finally, they have obtained 87.3 % accurate results on their work. Sun et al.⁶ have proposed an Adaptive Feature Selection guided Deep Forest (AFS-DF) algorithm and have achieved 91.79% accurate results on the CT scan database collected from the Third Hospital of Jilin University, Ruijin Hospital of Shanghai Jiao Tong University, Tongji Hospital of Huazhong University of Science and Technology, Shanghai Public Health Clinical Center of Fudan University, Hangzhou First People’s Hospital of Zhejiang University, and Sichuan University West China Hospital.

Chattopadhyay et al.⁵⁷ have contributed in two-ways in their work in this domain. After extracting deep features from the original image dataset, they have applied a completely novel meta-heuristic feature selection approach named Clustering-based Golden Ratio Optimizer (CGRO). They have conducted the necessary experiments on the SARS-COV-2 Ct-Scan Dataset⁸, COVID-CT dataset¹¹ and Chest X-Ray dataset²⁴ and have achieved the state-of-the-art accuracies of 99.31%, 98.65%, 99.44% respectively.

Sen et al.⁵⁸ have proposed a CNN architecture and bi-stage Feature Selection (FS) approach to extract the most relevant features from the chest CT-scan images. Initially, they have applied a guided FS methodology by employing two filter procedures: (i) Mutual Information (MI), (ii)Relief-F. In the second stage, Dragonfly algorithm (DA) has been used for the further selection of the most relevant features. Finally, SVM has been applied to the overall feature set. The proposed model has been tested on two open-access datasets: SARS-CoV-2⁸ CT images and COVID-CT¹¹ datasets and has got 98.39% and 90.0% accuracies on the said datasets respectively.

Besides classification of CT-scans and CXRs, there are other research fields related to COVID-19. One such field is mask detection. Loey et al.⁵⁹ have used first ResNet50¹⁴ and then an ensemble of DT and SVM for the final classification. They have achieved best results for the SVM classifier with 99.64%, 99.49% and 100% accuracies for the three datasets: e Real-World Masked Face Dataset (RMFD)⁶⁰, the Simulated Masked Face Dataset (SMFD)⁶¹, and the Labeled Faces in the Wild (LFW)⁶² respectively.

From the above mentioned works, it is clear that in most of the cases pre-existing or novel CNN²⁷ models are used as a classifier since this is basically an image classification problem. However, CNN has some limitations, for example, it can be overfitted when there is some class imbalance in the dataset⁶³. On the other hand, Graph Neural Network (GNN)⁶⁴ based models can overcome the problems like: overfitting and class imbalance. From the experimental results found in other fields, it is evident that a GNN based model generally works fast⁶⁵. GNN, a relatively new approach in the field of deep learning domain, is applied for graph classification problems. So, GNN requires input data represented in the form of graph data structure. Whereas, any 2D-CNN model directly accepts a 2D image matrix as input. Therefore, we need a proper technique for mapping an image classification problem to a graph classification one. We have resolved this issue with the help of an appropriate pre-processing technique to convert an image into a graph data. Considering all the advantages and novelties of GNN approach, we have implemented our proposed GraphCovidNet, a Graph Isomorphism Network (GIN)⁶⁶ based model (a special category of GNN) called GraphCovidNet.

The experimental results show that our proposed model performs very well with respect to time-requirement by the model. Our architecture has also performed well for highly class imbalanced dataset due to the injective nature of the aggregation function. The architecture is able to map different graphs into different representations in the embedding space properly. Hence, the proposed model is able to identify the class with a lesser image count perfectly. We have used four publicly available datasets: (i) SARS-COV-2 Ct-Scan Dataset⁸, (ii) COVID-CT dataset¹¹, (iii) 3-class and 4-class datasets under CMSC-678-ML-Project⁹, (iv) combination of two datasets: (1) covid-chestxray-dataset available on GitHub²³, (2) Chest X-Ray Images (Pneumonia) dataset available on Kaggle²⁴. The main contributions of our work can be summarized as follows:

In our work, we have introduced a new classification model, called GraphCovidNet, for screening COVID-19 CT-scan and CXR images.
In the proposed model, we have used GIN as its backbone architecture which falls under a specialized category of GNN. Based on authors’ knowledge, any GNN based architecture has not been used previously in this domain.
We have mapped image classification problem into a graph classification problem with proper pre-processing technique.
We have also reduced the space complexity of our model by considering only the edges of an image instead of the whole image which, in turn, makes our approach computationally inexpensive.
Our approach is not limited to a particular type of input as we have considered both CT-scan and CXR images and we have also worked binary to multi-class classification problem.
Our model has also surpassed the existing state-of-the-art approaches.

Our proposed method is diagrammatically represented in Fig. 2.

Our entire work has several sections that include: (1) Introduction, (2) Results and discussion, (3) Methodology, (4) Conclusion, (5) Data availability and finally, (6) Code availability.

Results and discussion

In our experiments, we have used 5-fold cross-validation for evaluating the model. During each fold, the training is done for 10 epochs. We have used Adam optimizer and stochastic gradient descent (SGD) approach with a learning rate of 0.001 to train our model.

Here we have used five standard evaluation metrics such as Accuracy, Precision, Recall, F1 Score and Receiver Operating Characteristic (ROC) curve to assess our model performance. Table 1 shows the performance results as well as the average time taken for both training and testing in each fold given by our proposed GraphCovidNet model for all the four datasets.

Table 1 Detailed results of the proposed GraphCovidNet model for all the four datasets in terms of some standard evaluation metrics.

Full size table

From Table 1, it is clear that the GraphCovidNet model has achieved at least 99% accuracy for all the datasets, whereas it gives 100% accuracy for the 2-class datasets. Generally, with increase in number of classes, our proposed model’s prediction capability drops from 100 to 99%. One notable point is that our proposed model provides nearly perfect (99.84%) accuracy for the heavily class imbalanced combined database of- covid-chestxray-dataset, Chest X-Ray Images (Pneumonia) dataset. Intuitively it can be said that a powerful GNN maps two nodes to the same location only if they have identical sub-trees with identical features on the corresponding nodes. Sub-tree structures are defined recursively via node neighborhoods. Thus, we can reduce our analysis to the question whether a GNN maps two neighborhoods (i.e., two multi-sets) to the same embedding or representation. A maximally powerful GNN would never map two different neighborhoods, i.e., multi-sets of feature vectors to the same representation. This means its aggregation scheme must be injective. Thus, it can be said that a powerful GNN’s aggregation scheme is able to represent injective multi-set functions.

Theorem

Let A : $G \rightarrow R^d$ be a GNN. With a sufficient number of GNN layers, A maps any graphs, say, G1 and G2 such that the Weisfeiler–Lehman test of isomorphism decides as non-isomorphic, to different embeddings if the following conditions hold:

A aggregates and updates node features iteratively with $h_v^{(k)} = \phi (h_v^{(k-1)},f({h_u^{(k-1)} : u \in N(v)})$, where the function,f, which operates on multi-sets, and $\phi$ are injective.
$A's$ graph-level readout, which operates on the multi-set of node features, is injective.

The mathematical proof of the above theorem is already reported in⁶⁶. The GIN follows this theorem. As this network is able to map any two different graphs into different embeddings, which helps to solve the challenging graph isomorphism problem. That is, isomorphic graphs are required to be mapped to the same representation, whereas the non-isomorphic ones to different representations. Due to these reasons, the proposed model even works well on heavily class imbalanced datasets. Based on the data from Table 1, it is also notable that our proposed model takes considerably less time both in training (1–18 min) and testing (0.6–7 s) phases. Less number of epochs is also responsible for such low training time. But again, training loss becomes very less from the very beginning. So, there is no need to consider a large number of epochs for training purpose. We can visualize this low training loss from Fig. 3.

In Fig. 3, it is evident that at the first epoch, accuracy is at least 99%, whereas the loss is barely 0.4 for each of the datasets. Further, training reduces the loss value to almost 0, whereas the classification accuracy remains either almost the same or slightly increases with increasing epoch size. Since the change in loss is more prominent as compared to the change in overall accuracy, however, the accuracy seems constant as seen from Fig. 3. Due to proper pre-processing, the proposed architecture is able to understand the input graphs properly. Thus the loss becomes very low from beginning and training gets completed in at most 10 epochs. To verify more about the goodness of our classification model, we have generated Receiver Operating Characteristic (ROC) curves for each of the datasets which are shown in Fig. 4. Additionally, we have conducted experiments by varying the training to testing ratio from 10% to 90% with an interval of 10%. To have a better visualization, we have generated graphs of training and testing accuracies vs training to testing ratio for each of the datasets which are shown in Fig. 5.

So, from Fig. 4, it is evident that for all kind of training to testing ratios, the GraphCovidNet model predicts at least 95% samples correctly, which is a sign of its robustness. Figure 5 further proves its success as a classifier because the Area Under the Curve (AUC) for each of the ROC curves is 0.97 units at worst. the AUC for both 2-class datasets is 1 unit and ROC is also perfect. In short, the GraphCovidNet model is able to deal with both of the 2-class datasets regardless of the training to testing ratio. We have also conducted experiments on different datasets having equal number of classes for both training and testing purposes. The results of all such training-testing combinations are enlisted in Table 2.

Table 2 Detailed results of the proposed GraphCovidNet model for all combination of different train-test datasets having same number of classes.

Full size table

Table 2 shows that proposed model ensures accuracy above 98% even when training and testing data are from two different sources. Such highly accurate results further confirm the validity of GraphCovidNet.

To further ensure the superiority of our proposed model, we have also compared its performance against some pretrained CNN models such as Inception-ResNet-V2⁴⁹, VGG19⁴⁶, ResNet152¹⁴, DenseNet201⁴⁷, Xception⁵⁰, MobileNetV2⁵¹ for both raw and edge-mapped images. Table 3 shows the accuracies (%) obtained in all the experiments considering the mentioned CNN models.

Table 3 Accuracies(%) obtained by applying Inception-ResNet-V2, VGG19, ResNet152, DenseNet201, Xception, MobileNetV2 models for both raw and edge-mapped images.

Full size table

Comparison between Tables 1 and 3 validates that GraphCovidNet outperforms all these conventional CNN models which gives a more clear view about the robustness of our proposed model.

We have also compared the results of our proposed GraphCovidNet model with some past works done on the chosen datasets. Table 4 demonstrates such comparative results.

Table 4 Comparison of our proposed GraphCovidNet model with some previous works on all the datasets (Oh et al.³⁰, Chandra et al.³⁷, Nour et al.³, Hemdam et al.⁴⁵, Turkoglu et al.²⁶ have combined other dataset; Oh et al.³⁰, Chandra et al.³⁷, Hemdam et al.⁴⁵ have considered the first dataset only; Nour et al.³, Turkoglu et al.²⁶ have considered the second dataset only).

Full size table

From Table 4, it is clear that our proposed approach surpasses all the previous works considered here for comparison in terms of accuracy. Although some of the listed previous works are done on database different or even larger than ours, the GraphCovidNet model still outperforms the ones on the same dataset. Based on our knowledge, there are no previous works performed on the CMSC-678-ML-Project GitHub dataset⁹. Still there are very few works previously done on a 4-class database in the domain of COVID-19 classification. So, we have considered to note down the results of CMSC-678-ML-Project GitHub dataset⁹. Not only that, any deep learning network generally is unable to achieve high accuracy for very less number of input samples such as CMSC-678-ML-Project GitHub dataset⁹. But GraphCovidNet is able to predict with 99% and 99.11% accuracy for its 3-class and 4-class cases respectively as shown in Table 1. So, our proposed model is able to perform very well even in case of datasets having very small number of samples.

In a nutshell, we can say that our proposed model is very accurate, and robust with respect to other existing models.

Methodology

In this section, we have discussed our proposed work along with the proper pre-processing required for COVID-19 image classification. We have also described the benchmark datasets briefly. This section consists of three subsections: (i) Datasets used, (ii) Pre-processing, and (iii) Proposed model.

Datasets used

In our work, we have selected the following four datasets to conduct the experiments individually-

1.
SARS-COV-2 Ct-Scan Dataset⁸, a 2-class CT scan dataset collected by Plamen et al. available on Kaggle.
2.
COVID-CT dataset¹¹, a 2-class CT scan dataset introduced by Yang et al. available on GitHub.
3.
3-class dataset which is consisted of CXR from the two sources-
- covid-chestxray-dataset²³ collected by Cohen et al. available on GitHub.
- Chest X-Ray Images (Pneumonia) dataset²⁴ collected by Mooney et al. available on Kaggle.
For combining these two datasets, we have considered COVID-19 patients’ scans from the covid-chestxray-dataset and normal, Pneumonia patients’ scans from the Chest X-Ray Images (Pneumonia) dataset.
4.
Finally, 3-class and 4-class CXR datasets under the CMSC-678-ML-Project available on GitHub⁹

In this section, at first we look at the datasets used in the present work than training constraints alongside the detailed results of the experiments.

Basically, all the 2-class datasets contain classes of COVID and Non-COVID whereas the 3-class datasets contain: Normal, COVID, and Pneumonia. For the 4-class dataset of CMSC-678-ML Project, there are two separate classes of Pneumonia, which are: Bacterial Pneumonia and Viral Pneumonia. Table 5 illustrates the details of these datasets.

Table 5 Statistical description of all the datasets used for experimentation.

Full size table

Pre-processing

As mentioned earlier, the CT scans or CXRs are first pre-processed in order to apply our proposed GraphCovidNet model. We have considered two stages for pre-processing, which are illustrated as follows:

1.
Edge detection: First, the edges of the raw images are estimated using Prewitt filter⁶⁷.
2.
Graph preparation: Next, these edge maps are converted into graph dataset by proper means.

Now these two stages are explained to have a better understanding of the whole pre-processing part.

Edge detection

Basically, an edge is a region denoting a local change of intensity in an image which means that a local maxima or minima will occur for the change of intensity in the edge region. By applying proper filter on the original image, the edges can be prominent. In our work, we have convoluted the original image matrix with 3*3 Prewitt filter⁶⁷ for both horizontal and vertical edge detection which are defined as: $\begin{bmatrix} -1 &{}&{} -1 &{}&{} -1\\ 0 &{}&{} 0 &{}&{} 0\\ 1 &{}&{} 1 &{}&{} 1\\ \end{bmatrix}$ and $\begin{bmatrix} -1 &{}&{} 0 &{}&{} 1\\ -1 &{}&{} 0 &{}&{} 1\\ -1 &{}&{} 0 &{}&{} 1\\ \end{bmatrix}$ respectively. We have selected Prewitt operator for this experiment because it is easy to implement and it detects the edges quite efficiently⁶⁸. Comparison among the three most popular edge filters: Canny, Sobel and Prewitt applied on a COVID-CT image is shown in Fig. 6. Figure 6 reveals that Sobel filter is the most noisy one, whereas Canny filter produces the least noisy image. Although image produced by Prewitt filter is more noisy than Canny, all edges have different pixel intensity in the case of Prewitt unlike Canny. So choosing pixel value as feature would be wiser for Prewitt filter.

After applying convolution on each 3 $\times$ 3 sub-matrix by both of the horizontal and vertical filters, gradient for each sub-matrix has been evaluated. Since all the images are in grayscale, we have considered that a pixel would be situated in an edge if the magnitude of the gradient crosses halfway i.e., the gradient value is greater than or equal to 128. We can get a more clear view of the edge-detection step from Fig. 7.

Graph preparation

After the Prewitt filter⁶⁷ is applied on an image, each image is converted to graph. The graph preparation is done using a 3-step procedure which is discussed below:

1.
Each pixel having grayscale intensity value greater than or equal to 128 is qualified as a node or a graph vertex. This implies that nodes reside only on the prominent edges of the edge image. Feature of a node consists of the grayscale intensity of the corresponding pixel.
2.
Edge exists between the two nodes which represent neighboring pixels in the original image.
3.
For each image, one graph is formed. This means that all the nodes as well as the edges constructed from a single image belongs to the same graph. The node attributes, which are simply grayscale values, are normalized graph-wise. Finally, normalization is done by subtracting the mean of all attributes under a graph from the original value and then dividing it by the standard deviation.

Since nodes are formed only from edges present in an image instead of the whole image, so less memory is consumed to prepare such data. Since COVID-19 and any kind of Pneumonia scans contain cloudy region for coughs, detected edges would be different as well as the nature of the graph. This difference might be useful later for classification. Overall five kind of datasets are formed to represent the graph data of all the scans, which are-

1.
Node-attribute-dataset: Here the attribute value (in this case the normalized grayscale value) of each node is stored.
2.
Graph-indicator-dataset: Here the graph-id for each node is stored.
3.
Node-label-dataset: Here the class-label for each node is stored. Since this is a graph level classification, each node under same graph would have same label which is actually the class-label for the corresponding graph.
4.
Graph-label-dataset: Here the class-label for each graph is stored.
5.
Adjacency-dataset: Here the adjacency sparse matrix for all the graphs is stored.

Figure 8 summarizes the whole edge-preparation process.

Proposed model

We have introduced our novel approach named as GraphCovidNet, where we have implemented GIN for classification and prediction tasks. So, before we move deeper into the architecture we will briefly discuss about the graphs, GNN and GIN.

Graph neural network

A graph g can be described by set of components, nodes (V) and edges (E) as $g=(V,E)$, where V is the set of vertices and E is the set of edges. The GNN can be used to classify an unlabelled node in a graph, where some nodes in the graph are labeled using a supervised learning technique. Also, it can do graph classification tasks where each graph has its corresponding labels. Now here, we have formed one graph from each labelled image and have used supervised learning to classify these graphs.

Embeddings and graph isomorphism network

In GNN, the nodes of a graph are embedded into a d-dimensional embedded space denoted as $h_v$. These nodes are encoded in such a way that the connected nodes or the nodes which have same neighbors are close to each other in embedded space and vice versa. Every node uses its own feature vector $f_v$ and its neighborhood embedding vector $h_{nev}$ to find out it own embedding vector $h_v$.

GNNs uses the graph structure and node features to learn a representation vector of a node, $f_v$, where each node contains the feature vectors, $f_v$ $\forall$ v $\in$ V and each edge contains the feature vectors, $f_e$, $\forall$ e $\in$ E or the entire graph, $h_g$, where $h_g$ = $Readout({h_v, \forall v \in V})$ , where $h_v$ is the final embeddings of the node V is set of all nodes in the graph g. Now every node defines a computation graph based on its neighborhood i.e., every node has its own neural network architecture⁶⁴. This is shown in Fig. 9.

The model for each node can be of arbitrary length. GNN follows a neighborhood aggregation strategy, where we iteratively update the representation of a node by aggregating representations of its neighbors. Nodes have embeddings at each layer. First layer of node is the input feature of that node and after k iterations of aggregation, a node’s representation captures the structural information within its k-hop network neighborhood. Let $x_v$ be the feature vector of the node and $h_{v}^{0}$ be the initial layer embedding. Now, $h_{v}^{0}$ = $x_v$, initial layer embeddings are equal to feature vectors. Formally, the k-th layer of a GNN is

$a_v^{k} = Aggregate^{(k)}({h_u^{(k-1)} : u \in N(v)}) , Combine^{(k)}(h_v^{(k-1)},a_v^{k})$ where, $h_v^{(k)}$ is the feature vector of node v at the k-th layer and $ha_v^{(k)}$ is the aggregated message from its neighborhoods. N(v) is a set of nodes adjacent to v. The choice of $Aggregate^{(k)}$ ($\cdot$) and $Combine^{(k)}$ ($\cdot$) in GNNs are crucial.

Different architectures for Aggregate function have been proposed. In the pooling variant of GraphSAGE⁶⁹, Aggregate has been calculated as $a_v^{k} = Max({ReLU(b^{(k)} * h_u^{(k-1)}) : u \in N(v)})$ where $b^{(k)}$ is a parameter metrics, and Max represents an element-wise max-pooling. The Combine step could be a concatenation of its neighborhood aggregation and its previous layer’s embedding$\cdot$ $h_v^{(k)} = w^{(k)} * Concat(h_v^{(k)},a_v^{k})$ as in GraphSAGE, here $w^{(k)}$ is a parameter metrics. In Graph Convolutional Networks (GCN)⁷⁰, the element-wise mean pooling is used instead, and the Aggregate and Combine steps are integrated as follows: $h_v^{(k)} = ReLU(W^{(k)} * mean({h_u^{(k-1)}), \forall u \in N(v) })$ .

Mean and max-pooling aggregators are still well-defined multi-set (contains the feature vectors of adjacent nodes of a particular node) functions because they are permutation invariant. But, they are not injective. When performing neighborhood aggregation, the mean(GCN) or max(GraphSage) pooling always obtains the same node representation everywhere. Thus, in this case mean and max pooling aggregators fail to capture any structural information of the graph⁶⁶. GNNs and the Weisfeiler–Lehman (WL) graph isomorphism test⁷¹, a powerful test known to distinguish a broad class of graphs⁷², are very closely connected.

The WL test has aggregated the labels of nodes and their neighborhoods iteratively and then it hashed the aggregated labels into unique new labels. The algorithm decides that two graphs are non-isomorphic if at some iteration the labels of the nodes between the two graphs differ.Each iteration of WL test has been described as follows: FOR ALL vertices v $\in$ g

1.
Compute a hash of $(h_v,h_{v_1},\ldots ,h_{v_n})$ where $h_{v_i}$ are the attributes of the neighbors of vertex v.
2.
Use the computed hash as vertex attribute for v in the next iteration.

The algorithm will terminate when this iteration has converged in terms of unique assignments of hashes to vertices.

The WL test is so powerful due to its injective aggregation update that maps different node neighborhoods to different feature vectors. Our key insight is that a GNN can have as large distinguishable power as the WL test if the GNN’s aggregation scheme is highly expressive and can model injective functions. This task to map any two different graphs to different embedding have implied solving graph isomorphism problem. That is, we want isomorphic graphs to be mapped to the same representation and non-isomorphic ones to different representations. Now, the GIN that satisfies the conditions for WL test and generalizes it and hence achieves maximum discriminative power among GNNs. The k-th layer embedding of GIN is given by: $h_v^{(k)} = MLP^{(k)}((1+\epsilon ^{(k)}) * h_v^{(k)} + \sum _{u \in N(v)} h_u^{(k-1)})$, where MLP stands for Multi Layer Perception and $\epsilon ^{(k)}$ is a floating point value.

Now for node classification, the node representation $h_v^{(k)}$ of the kth layer is used for prediction. For graph classification, the Readout function aggregates node features from the final iteration to obtain the entire graph’s embedding $h_g$ that is given by the following equation :

$h_g = Readout({h_v^{(k}, \forall v \in V})$. After we have got the embedding of the final layer, a supervised learning for node or graph classification (in our case) needs to be performed.

Architecture of our proposed GraphCovidNet model

Our architecture consists of a block of GINConv layer which uses MLP⁶⁶ in its subsequent layers for the neighborhood aggregation. In MLP, we have used a block of sequential layers which consist of a linear layer, then a Rectangular Linear Unit (ReLU) layer, followed by another linear layer. It is shown in Fig. 10.

GINConv layer basically takes two different inputs:

1.
x which is the feature matrix of each node with dimension v*d, where V is the total number of nodes in the graph and d is embedded dimension.
2.
The edge index E has a dimension of 2*L consisting of all edges present in the entire graph in the form of pair (v1, v2), where v1 and v2 are two nodes connected by an edge and L is the total number of edges in the entire graph.

The output of the GINConv layer is passed through ReLU activation function to introduce non-linearity and then we apply a dropout of 0.5 and it is followed by a normalization (norm) layer, which applies layer normalization over a mini-batch of inputs. This output (out1) is passed on to another block of the same GINConv-ReLU-dropout-norm layers whose output is out2. Now, this out2 is passed onto a block which consists of GINConv-ReLU-dropout layers and then it is followed by a global mean pooling layer. After that, a linear layer followed by a dropout layer with dropout rate is equal to 0.5, and then a linear layer with dimension is equal to that of the number of classes of the problem under consideration. Finally, we have used a Log Softmax as the activation function that is used to produce the final probability vector, z. The whole architecture is shown in Fig. 11

$$\begin{aligned} logsoftmax(z_i) = \log \left( \frac{e^{z_i}}{\sum _{j=1}^{c} e^{z_j}}\right) \end{aligned}$$

(1)

where, $z_i$ is the probability of the ith element in the last linear layer vector and $\sum _{j=1}^{c} e^{z_j}$ is the sum of all probability values of all the elements including in the vector for the number of classes. We have used negative log likelihood (nll) function as the objective function for classification which needs to be minimized and can be represented as follows: nll(z) $=$ − $\sum _{i=1}^{c} (y_i * logsoftmax(z_i))$ where, $y_i$ is the ground truth label of the ith graph.

Conclusion

For the past one year, COVID-19 has affected our social and economical lives greatly. In this situation, researchers are focusing on CT scan and CXR images for screening COVID-19 cases of the affected persons. In this paper, we have proposed a novel model, named as GraphCovidNet, which basically deals with classification of COVID-19 or any kind of Pneumonia patients from healthy people. Prewitt filter⁶⁷ has been used in the pre-processing stage which produces the edges of an image. Thus our proposed approach utilizes the memory more optimally than the typical CNN based models. Proposed model performs impressively well over different dataset considered in the present work. For some cases, its prediction accuracy even reaches to 100% and it can easily overcome the problems like overfitting and class imbalance. The proposed model has also outperformed many past models in terms of accuracy, precision, recall and f1-score. In future, we can apply the proposed GraphCovidNet in other COVID-19 or other medical datasets having CT-scans or CXRs. To be precise, GNN based models are applicable in any kind of image classification problems. We have conducted the present experiments using only 10 epochs to build the training model. So in future, we shall try to improve our model’s speed so that it can be trained in very less time even for larger number of samples.

Data availability

No datasets are generated during the current study. The datasets analyzed during this work are made publicly available in this published article.

Code availability

The codes used for this research work are made publicly available in the GitHub repository: https://github.com/debadyuti23/GraphCovidNet.

Change history

30 November 2021
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1038/s41598-021-02469-8

References

Goel, T., Murugan, R., Mirjalili, S. & Chakrabartty, D. K. Optconet: An optimized convolutional neural network for an automatic diagnosis of covid-19. Appl. Intell., 1–16 (2020).
https://covid19.who.int (2020).
Nour, M., Cömert, Z. & Polat, K. A novel medical diagnosis model for covid-19 infection detection based on deep features and bayesian optimization. Appl. Soft Comput. 106580. https://doi.org/10.1016/j.asoc.2020.106580 (2020).
https://www.who.int/ (2020).
Wu, J. et al. Chest CT findings in patients with coronavirus disease 2019 and its relationship with clinical features. Investig. Radiol. 55, 257 (2020).
Article CAS Google Scholar
Liang, S., Zhanghao, M. & Fuhua, Y. Adaptive Feature Selection Guided Deep Forest for COVID-19 Classification With Chest CT (IEEE, 2020).
Hope, M. D. et al. A role for CT in COVID-19? What data really tell us so far. Lancet (London, England) 395, 1189–1190 (2020).
Article CAS Google Scholar
Soares, E., Angelov, P., Biaso, S., Higa Froes, M. & Kanda Abe, D. SARS-CoV-2 CT-scan dataset: A large dataset of real patients CT scans for SARS-CoV-2 identification. medRxiv. https://doi.org/10.1101/2020.04.24.20078584 (2020). https://www.medrxiv.org/content/early/2020/05/14/2020.04.24.20078584.full.pdf.
Jamdade, V. CMSC-678-ML-Project. https://github.com/vj2050/Transfer-Learning-COVID-19 (2020).
Perumal, V., Narayanan, V. & Rajasekar, S. J. S. Detection of COVID-19 using CXR and CT images using transfer learning and Haralick features. Appl. Intell. 51, 341–358 (2021).
Article Google Scholar
Yang, X. et al. Covid-CT-dataset: A CT scan dataset about COVID-19. ArXiv e-prints arXiv-2003 (2020).
He, K., Fan, H., Wu, Y., Xie, S. & Girshick, R. Momentum contrast for unsupervised visual representation learning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 9729–9738 (2020).
Jégou, S., Drozdzal, M., Vazquez, D., Romero, A. & Bengio, Y. The one hundred layers tiramisu: Fully convolutional DenseNets for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 11–19 (2017).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778 (2016).
Silva, P. et al. COVID-19 detection in CT images with deep learning: A voting-based scheme and cross-datasets analysis. Inf. Med. Unlocked 20, 100427 (2020).
Article Google Scholar
Tan, M. & Le, Q. V. Efficientnet: Improving accuracy and efficiency through AutoML and model scaling. arXiv preprint arXiv:1905.11946 (2019).
Sharma, S. Drawing insights from COVID-19-infected patients using CT scan images and machine learning techniques: A study on 200 patients. Environ. Sci. Pollut. Res. 27, 37155–37163 (2020).
Article CAS Google Scholar
Radiology IS of M and I. Italian society of medical and interventional radiology. https://www.sirm.org/category/senza-categoria/covid-19/ (2020).
https://mosmed.ai/en/ (2020).
http://www.salhospital.com/ (2020).
Elaziz, M. A. et al. New machine learning method for image-based diagnosis of COVID-19. PLoS ONE 15, 1–18. https://doi.org/10.1371/journal.pone.0235187 (2020).
Article CAS Google Scholar
Guo, G., Wang, H., Bell, D., Bi, Y. & Greer, K. KNN model-based approach in classification. In OTM Confederated International Conferences ”On the Move to Meaningful Internet Systems”, 986–996 (Springer, 2003).
Cohen, J. P. et al. COVID-19 image data collection: Prospective predictions are the future (2020). arXiv:2006.11988.
Mooney, P. Chest X-ray images (pneumonia) [Online]. https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia, tanggal akses (2018).
Chowdhury, M. E. H. et al. Can AI help in screening viral and COVID-19 pneumonia?. IEEE Access 8, 132665–132676. https://doi.org/10.1109/access.2020.3010287 (2020).
Article Google Scholar
Turkoglu, M. Covidetectionet: COVID-19 diagnosis system based on X-ray images using features selected from pre-learned deep features ensemble. Appl. Intell., 1–14 (2020).
O’Shea, K. & Nash, R. An introduction to convolutional neural networks (2015). arXiv:1511.08458
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
Article MATH Google Scholar
Rahman, T. COVID-19 Radiography Database. https://www.kaggle.com/tawsifurrahman/covid19-radiography-database (2020).
Oh, Y., Park, S. & Ye, J. C. Deep learning COVID-19 features on CXR using limited training data sets. IEEE Trans. Med. Imaging (2020).
Shiraishi, J. et al. Development of a digital image database for chest radiographs with and without a lung nodule: Receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules. Am. J. Roentgenol. 174, 71–74 (2000).
Article CAS Google Scholar
Van Ginneken, B., Stegmann, M. B. & Loog, M. Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database. Med. Image Anal. 10, 19–40 (2006).
Article Google Scholar
Jaeger, S. et al. Two public chest X-ray datasets for computer-aided screening of pulmonary diseases. Quant. Imaging Med. Surg. 4, 475 (2014).
PubMed PubMed Central Google Scholar
Praveen. CoronaHack—Chest X-ray-dataset. https://www.kaggle.com/praveengovi/coronahack-chest-xraydataset (2020).
Kermany, D. S. et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172, 1122–1131 (2018).
Article CAS Google Scholar
Rokach, L. & Maimon, O. Decision trees. In Data Mining and Knowledge Discovery Handbook, 165–192 (Springer, 2005).
Chandra, T. B., Verma, K., Singh, B. K., Jain, D. & Netam, S. S. Coronavirus disease (COVID-19) detection in chest X-ray images using majority voting based classifier ensemble. Expert Syst. Appl. 165, 113909 (2020).
Article Google Scholar
Mishra, M. & Srivastava, M. A view of artificial neural network. In 2014 International Conference on Advances in Engineering Technology Research (ICAETR-2014), 1–3. https://doi.org/10.1109/ICAETR.2014.7012785 (2014).
Webb, G. I. Naïve bayes. Encycl. Mach. Learn. 15, 713–714 (2010).
Google Scholar
Candemir, S. et al. Lung segmentation in chest radiographs using anatomical atlases with nonrigid registration. IEEE Trans. Med. Imaging 33, 577–590 (2013).
Article Google Scholar
Wang, X. et al. Chestx-ray8: Hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2097–2106 (2017).
Srinivasan, G. & Shobha, G. Statistical texture analysis. Proc. World Acad. Sci. Eng. Technol. 36, 1264–1269 (2008).
Google Scholar
Gómez, W., Pereira, W. C. A. & Infantosi, A. F. C. Analysis of co-occurrence texture statistics as a function of gray-level quantization for classifying breast ultrasound. IEEE Trans. Med. Imaging 31, 1889–1899 (2012).
Article Google Scholar
Dalal, N. & Triggs, B. Histograms of oriented gradients for human detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), Vol. 1, 886–893. https://doi.org/10.1109/CVPR.2005.177 (2005).
Hemdan, E. E.-D., Shouman, M. A. & Karar, M. E. Covidx-net: A framework of deep learning classifiers to diagnose COVID-19 in X-ray images. arXiv preprint arXiv:2003.11055 (2020).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
Huang, G., Liu, Z., Van Der Maaten, L. & Weinberger, K. Q. Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4700–4708 (2017).
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2818–2826 (2016).
Too, E. C., Yujian, L., Njuki, S. & Yingchun, L. A comparative study of fine-tuning deep learning models for plant disease identification. Comput. Electron. Agric. 161, 272–279 (2019).
Article Google Scholar
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv:1602.07261 (2016).
Chollet, F. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1251–1258 (2017).
Rosebrock, A. https://www.pyimagesearch.com/category/medical/ (2020).
Makris, A., Kontopoulos, I. & Tserpes, K. Covid-19 detection from chest X-ray images using deep learning and convolutional neural networks. In 11th Hellenic Conference on Artificial Intelligence, 60–66 (2020).
Zhong, Y. Using deep convolutional neural networks to diagnose COVID-19 from chest X-ray images. arXiv preprint arXiv:2007.09695 (2020).
Chung, A. Figure 1 COVID-19 chest X-ray data initiative. https://github.com/agchung/Figure1-COVID-chestxray-dataset (2020).
Chung, A. Actualmed COVID-19 chest X-ray data initiative. https://github.com/agchung/Actualmed-COVID-chestxray-dataset (2020).
Chattopadhyay, S., Dey, A., Singh, P. K., Geem, Z. W. & Sarkar, R. COVID-19 detection by optimizing deep residual features with improved clustering-based golden ratio optimizer. Diagnostics 11, 315 (2021).
Article CAS Google Scholar
Sen, S., Saha, S., Chatterjee, S., Mirjalili, S. & Sarkar, R. A bi-stage feature selection approach for COVID-19 prediction using chest CT images. Appl. Intell. (2021).
Loey, M., Manogaran, G., Taha, M. H. N. & Khalifa, N. E. M. A hybrid deep transfer learning model with machine learning methods for face mask detection in the era of the COVID-19 pandemic. Measurement 167, 108288 (2020).
Article Google Scholar
Wang, Z. et al. Masked face recognition dataset and application. arXiv preprint arXiv:2003.09093 (2020).
prajnasb, “observations,” observations. https://github.com/prajnasb/observations (2020).
Learned-Miller, E., Huang, G. B., RoyChowdhury, A., Li, H. & Hua, G. Labeled faces in the wild: A survey. In Advances in Face Detection and Facial Image Analysis, 189–248 (Springer, 2016).
Joshi, S., Verma, D. K., Saxena, G. & Paraye, A. Issues in training a convolutional neural network model for image classification. Springer 1046, 282–293 (2019).
Google Scholar
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M. & Monfardini, G. The graph neural network model. IEEE Trans. Neural Netw. 20, 61–80. https://doi.org/10.1109/TNN.2008.2005605 (2009).
Article PubMed Google Scholar
Mondal, R., Mukherjee, D., Singh, P. K., Bhateja, V. & Sarkar, R. A new framework for smartphone sensor based human activity recognition using graph neural network. IEEE Sens. J. (2020).
Xu, K., Hu, W., Leskovec, J. & Jegelka, S. How powerful are graph neural networks? arXiv preprint arXiv:1810.00826 (2018).
Prewitt, J. M. S. Picture Processing and Psychopictorics (Academic Press, New York, 1970).
Google Scholar
Priyam, P., Dey, D. & Shreya, D. P. Edge detection by using canny and Prewitt. Int. J. Sci. Eng. Res. 7 (2016).
Hamilton, W., Ying, Z. & Leskovec, J. Inductive representation learning on large graphs. Adv. Neural Inf. Process. Syst., 1024–1034 (2017).
Berg, R. V. D., Kipf, T. N. & Welling, M. Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263 (2017).
Weisfeiler, B. & Lehman, A. A. A reduction of a graph to a canonical form and an algebra arising during this reduction. Nauchno-Technicheskaya Informatsia 2, 12–16 (1968).
Google Scholar
Babai, L. & Kucera, L. Canonical labelling of graphs in linear average time. In 20th Annual Symposium on Foundations of Computer Science (SFCS 1979), 39–46 (IEEE, 1979).

Download references

Acknowledgements

We would like to thank the CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India for providing us the infrastructural support.

Author information

These authors contributed equally: Pritam Saha and Debadyuti Mukherjee.

Authors and Affiliations

Department of Electrical Engineering, Jadavpur University, Kolkata, 700032, India
Pritam Saha
Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India
Debadyuti Mukherjee & Ram Sarkar
Department of Information Technology, Jadavpur University, Kolkata, 700106, India
Pawan Kumar Singh
Institute of IR 4.0, The National University of Malaysia, Bangi, 43600 UKM, Selangor, Malaysia
Ali Ahmadian
School of Mathematical Sciences, College of Science and Technology, Wenzhou-Kean University, Wenzhou, China
Ali Ahmadian
ICRIOS-The Invernizzi Centre for Research in Innovation, Organization, Strategy and Entrepreneurship, Department of Management and Technology, Bocconi University, Via Sarfatti, 25, 20136, Milan (MI), Italy
Massimiliano Ferrara

Authors

Pritam Saha
View author publications
You can also search for this author in PubMed Google Scholar
Debadyuti Mukherjee
View author publications
You can also search for this author in PubMed Google Scholar
Pawan Kumar Singh
View author publications
You can also search for this author in PubMed Google Scholar
Ali Ahmadian
View author publications
You can also search for this author in PubMed Google Scholar
Massimiliano Ferrara
View author publications
You can also search for this author in PubMed Google Scholar
Ram Sarkar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.K.S. and R.S. conceived the experiment(s); P.S. and D. M. conducted the experiment(s); A.A. and M.F. provided the funding acquisition, supervision and validation of the data; P.S., D.M., P.K.S. and R.S. analysed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Ali Ahmadian.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article has been retracted. Please see the retraction notice for more detail:https://doi.org/10.1038/s41598-021-02469-8

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

About this article

Cite this article

Saha, P., Mukherjee, D., Singh, P.K. et al. RETRACTED ARTICLE: GraphCovidNet: A graph neural network based model for detecting COVID-19 from CT scans and X-rays of chest. Sci Rep 11, 8304 (2021). https://doi.org/10.1038/s41598-021-87523-1

Download citation

Received: 15 December 2020
Accepted: 29 March 2021
Published: 15 April 2021
DOI: https://doi.org/10.1038/s41598-021-87523-1

This article is cited by

Dental Caries diagnosis from bitewing images using convolutional neural networks
- Parsa ForouzeshFar
- Ali Asghar Safaei
- Sedighe Sadat Hashemikamangar
BMC Oral Health (2024)
How intra-source imbalanced datasets impact the performance of deep learning for COVID-19 diagnosis using chest X-ray images
- Zhang Zhang
- Xiaoyong Zhang
- Noriyasu Homma
Scientific Reports (2023)
Biased, wrong and counterfeited evidences published during the COVID-19 pandemic, a systematic review of retracted COVID-19 papers
- Angelo Capodici
- Aurelia Salussolia
- Davide Golinelli
Quality & Quantity (2023)
Combined Cloud-Based Inference System for the Classification of COVID-19 in CT-Scan and X-Ray Images
- Ankit Kumar Dubey
- Krishna Kumar Mohbey
New Generation Computing (2023)
Efficient Deep Neural Networks for Classification of Alzheimer’s Disease and Mild Cognitive Impairment from Scalp EEG Recordings
- Saman Fouladi
- Ali A. Safaei
- M. J. Ebadi
Cognitive Computation (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and discussion

Theorem

Methodology

Datasets used

Pre-processing

Edge detection

Graph preparation

Proposed model

Graph neural network

Embeddings and graph isomorphism network

Architecture of our proposed GraphCovidNet model

Conclusion

Data availability

Code availability

Change history

30 November 2021

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links