Deep Transfer Learning Pipelines with Apache Spark and Keras TensorFlow combined with Logistic Regression to Detect COVID-19 in Chest CT Images

Authors

  • Houssam BENBRAHIM Engineering Sciences Laboratory, National School of Applied Sciences, Ibn Tofail University, Kenitra, Morocco
  • Hanaa HACHIMI Systems Engineering Laboratory, Sultan Moulay Slimane University, Beni Mellal, Morocco
  • Aouatif AMINE Engineering Sciences Laboratory, National School of Applied Sciences, Ibn Tofail University, Kenitra, Morocco

DOI:

https://doi.org/10.48048/wjst.2021.13109

Keywords:

COVID-19, Deep Transfer Learning Pipelines, CNN, Apache Spark, Logistic Regresion

Abstract

The SARS-CoV-2 (COVID-19) has propagated rapidly around the world, and it became a global pandemic. It has generated a catastrophic effect on public health. Thus, it is crucial to discover positive cases as early as possible to treat touched patients fastly. Chest CT is one of the methods that play a significant role in diagnosing 2019-nCoV acute respiratory disease. The implementation of advanced deep learning techniques combined with radiological imaging can be helpful for the precise detection of the novel coronavirus. It can also be assistive to surmount the difficult situation of the lack of medical skills and specialized doctors in remote regions. This paper presented Deep Transfer Learning Pipelines with Apache Spark and KerasTensorFlow combined with the Logistic Regression algorithm for automatic COVID-19 detection in chest CT images, using Convolutional Neural Network (CNN) based models VGG16, VGG19, and Xception. Our model produced a classification accuracy of 85.64, 84.25, and 82.87 %, respectively, for VGG16, VGG19, and Xception.

HIGHLIGHTS

  • Deep Transfer Learning Pipelines with Apache Spark and Keras TensorFlow combined with Logistic Regression using CT images to screen for Corona Virus Disease (COVID-19)      
  • Automatic detection of  COVID-19 in chest CT images
  • Convolutional Neural Network (CNN) based models VGG16, VGG19, and Xception to predict COVID-19 in Computed Tomography image

GRAPHICAL ABSTRACT

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

JFW Chan, S Yuan, KH Kok, KKW To, H Chu, J Yang, F Xing, J Liu, CCY Yip, RWS Poon, HW Tsoi, SKF Lo, KH Chan, VKM Poon, WM Chan, JD Ip, JP Cai, VCC Cheng, H Chen, CKM Hui and KY Yuen. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: A study of a family cluster. Lancet 2020; 395, 514-23.

N Zhu, D Zhang, W Wang, X Li, B Yang, J Song, X Zhao, B Huang, W Shi, R Lu, P Niu, F Zhan, X Ma, D Wang, W Xu, G Wu, GF Gao, D Phil and W Tan. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 2020; 382, 727-33.

World Health Organization, Coronavirus disease (COVID-19), Available at: https://covid19.who.int, accessed September 2020.

Ministry of Health of Morocco. The official portal of Coronavirus in Morocco, Available at: http://www.covidmaroc.ma/Pages/LESINFOAR.aspx, accessed September 2020.

W Wang, J Tang and F Wei. Updated understanding of the outbreak of 2019 novel coronavirus (2019-nCoV) in Wuhan, China. J. Med. Virol. 2020; 92, 441-7.

M Shen, Y Zhou, J Ye, AAA Al-Maskri, Y Kang, S Zeng and S Cai. Recent advances and perspectives of nucleic acid detection for coronavirus. J. Pharm. Anal. 2020; 10, 97-101.

T Ai, Z Yang, H Hou, C Zhan, C Chen, W Lv, Q Tao, Z Sun and L Xia. Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: A report of 1014 cases. Radiology 2020; 296, E32-E40.

Y Fang, H Zhang, J Xie, M Lin, L Ying, P Pang and W Ji. Sensitivity of chest CT for COVID-19: Comparison to RT-PCR. Radiology 2020; 296, E115-E117.

JP Kanne, BP Little, JH Chung, BM Elicker and LH Ketai. Essentials for radiologists on COVID-19: An update-radiology scientific expert panel. Radiology 2020; 296, E113-E114.

X Xie, Z Zhong, W Zhao, C Zheng, F Wang and J Liu. Chest CT for typical 2019-nCoV pneumonia: Relationship to negative RT-PCR testing. Radiology 2020; 296, E41-E45.

S Wang, B Kang, J Ma, X Zeng, M Xiao, J Guo, M Cai, J Yang, Y Li, X Meng and B Xu. A deep learning algorithm using CT images to screen for Corona Virus Disease (COVID-19). Eur. Radiol. 2021. https://doi.org/10.1007/s00330-021-07715-1.

O Gozes, M Frid-Adar, H Greenspan, PD Browning, H Zhang, W Ji, A Bernheim and E Siegel. Rapid AI development cycle for the coronavirus (covid-19) pandemic: Initial results for automated detection & patient monitoring using deep learning CT image analysis. arXiv preprint 2020. Available at: https://arxiv.org/abs/2003.05037, accessed September 2020.

A Abbas, MM Abdelsamea and MM Gaber. Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. Appl. Intell. 2021; 51, 854-64.

L Wang and A Wong. COVID-Net: A tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Sci. Rep. 2020; 10, 1-12.

H Benbrahim, H Hachimi and A Amine. Deep transfer learning with Apache Spark to detect COVID-19 in chest X-ray images. Rom. J. Inf. Sci. Tech. 2020; 23, S117-S129.

AA Ardakani, AR Kanafi, UR Acharya, N Khadem and A Mohammadi. Application of deep learning technique to manage COVID-19 in routine clinical practice using CT images: Results of 10 convolutional neural networks. Comput. Biol. Med. 2020; 121, 103795.

CM Bishop. Pattern recognition and machine learning. Springer, New York, 2006, p. 738.

Y Anzai. Pattern recognition and machine learning. Elsevier, Amsterdam, 2012, p. 407.

H Wang, C Ma and L Zhou. A brief review of machine learning and its application. In: Proceedings of the International Conference on Information Engineering and Computer Science, Wuhan, China. IEEE. 2009, p. 1-4.

J Watt, R Borhani and A Katsaggelos. Machine learning refined: Foundations, algorithms, and applications, Cambridge University Press, Cambridge, 2016, p. 286.

E Alpaydin. Introduction to machine learning. MIT press, Cambridge, 2020, p. 712.

CYJ Peng, KL Lee and GM Ingersoll. An introduction to logistic regression analysis and reporting. J. Educ. Res. 2002; 96, 3-14.

JI Hoffman. Biostatistics for medical and biomedical practitioners. Academic Press, California, 2015, p. 734.

FE Harrell. Binary logistic regression. In: Regression modeling strategies. Springer, Cham. 2015, p. 219-74.

A Tattersall and MJ Grant. Big data: What is it and why it matters. Health Info. Libr. J. 2016; 33, 89-91.

LV Satyanarayana. A Survey on challenges and advantages in big data. Int. J. Comput. Sci. Inf. Technol. 2015; 6, 115-9.

K Wangand MMH Khan. Performance prediction for apache spark platform. In: Proceedings of the 17th International Conference on High Performance Computing and Communications, 7th International Symposium on Cyberspace Safety and Security, and 12th International Conference on Embedded Software and Systems. IEEE, 2015, p. 166-73.

M Frampton. Mastering apache spark. Packt Publishing, Birmingham, 2015, p. 318.

HP Sahana, MS Sanjana, NM Muddasir and KP Vidyashree. Apache spark methods and techniques in big data: A review. In: Proceedings of the Inventive Communication and Computational Technologies. Springer, Singapore. 2020, p. 721-6.

I Goodfellow, Y Bengio, A Courville and Y Bengio. Deep learning. Vol I. MIT press, Cambridge, 2016, p. 800.

NF Hordri, SS Yuhaniz and SM Shamsuddin. Deep learning and its applications: A review. In: Proceedings of Conference on Postgraduate Annual Research on Informatics Seminar, Kuala Lumpur, 2016.

Y Bengio, A Courville and P Vincent. Representation learning: A review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 2013; 35, 1798-828.

JG Lee, S Jun, YW Cho, H Lee, GB Kim, JB Seo and N Kim. Deep learning in medical imaging: General overview. Korean J. Radiol. 2017; 18, 570-84.

S Rao. Building a data pipeline for deep learning. NetApp 2019, Available at: lenovonetapp.com/pdf/wp-7299.pdf, accessed September 2020.

H El-Amir and M Hamdy. Deep learning pipeline: Building a deep learning model with TensorFlow. Apress, New York, 2019, p. 576.

Databricks. A vision for making deep learning simple, Available at: https://databricks.com/blog/ 2017/06/06/databricks-vision-simplify-large-scale-deep-learning.html, accessed September 2020.

Y Kim. Convolutional neural networks for sentence classification. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Doha, Qatar, 2014, p. 1746-51.

A Bhandare, M Bhide, P Gokhale and R Chandavarkar. Applications of convolutional neural networks. Int. J. Comput. Sci. Inf. Technol. 2016; 7, 2206-15.

A Hidaka and T Kurita. Consecutive dimensionality reduction by canonical correlation analysis for visualization of convolutional neural networks. In: Proceedings of the ISCIE International Symposium on Stochastic Systems Theory and its Applications, Kusatsu, Japan, 2017, p. 160-7.

A Khan, A Sohail, U Zahoora and AS Qureshi. A survey of the recent architectures of deep convolutional neural networks. Artif. Intell. Rev. 2020; 53, 1-62.

K Simonyan and A Zisserman. Very deep convolutional networks for large-scale image recognition. In: Proceedings of the International Conference on Learning Representations, California, USA, 2015.

M Jaderberg, K Simonyan, K Kavukcuoglu and A Zisserman. Spatial transformer networks. In: Advances in neural information processing systems. Google DeepMind, London, 2015, p. 2017-25.

F Chollet. Xception: Deep learning with depthwise separable convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Hawaii, USA, 2017, p. 1251-8.

F Zhuang, Z Qi, K Duan, D Xi, Y Zhu, H Zhu, H Xiong and Q He. A comprehensive survey on transfer learning. Proc. IEEE 2020; 109, 43-76.

Z Fuzhen and L Ping. Some applications in transfer learning, Available at: http://www.intsci.ac.cn/users/zhuangfuzhen/TransferApplication.pdf, accessed September 2020.

C Tan, F Sun, T Kong, W Zhang, C Yang and C Liu. A survey on deep transfer learning. In: Proceedings of the International Conference on Artificial Neural Networks. Springer, Cham. 2018, p. 270-9.

Databricks. Featurization for transfer learning, Available at: https://docs.databricks.com/ applications/machine-learning/preprocess-data/transfer-learning-tensorflow.html, accessed September 2020.

M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, A Davis, J Dean, M Devin, S Ghemawat, I Goodfellow, A Harp, G Irving, M Isard, Y Jia, R Jozefowicz, L Kaiser, M Kudlur, J Levenberg, D Mane, R Monga, S Moore, D Murray, C Olah, M Schuster, J Shlens, B Steiner, I Sutskever, K Talwar, P Tucker, V Vanhoucke, V Vasudevan, F Viegas, O Vinyals, P Warden, M Wattenberg, M Wicke, Y Yu and X Zheng. TensorFlow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint 2016. Available at: https://arxiv.org/abs/1603.04467, accessed September 2020.

MT Martinez. An overview of Google’s machine intelligence software TensorFlow. Sandia National, New Mexico, 2016.

G Zaccone. Getting started with TensorFlow. Packt Publishing, Birmingham, 2016, p. 180.

NK Manaswi. Understanding and working with Keras. In: Deep learning with applications using python. Springer, California, 2018, p. 31-43.

A Gulli and S Pal. Deep learning with Keras. Packt Publishing, Birmingham, 2017, p. 318.

J Zhao, Y Zhang, X He and P Xie. COVID-CT-Dataset: A CT scan dataset about COVID-19. arXiv preprint 2020. Available at: https://arxiv.org/abs/2003.13865, accessed September 2020.

Github. COVID CT, Available at: https://github.com/UCSD-AI4H/COVID-CT, accessed September 2020.

M Polsinelli, L Cinque and G Placidi. A light CNN for detecting COVID-19 from CT scans of the chest. Pattern Recognit. Lett. 2020; 140, 95-100.

L Sarker, MM Islam, T Hannan and Z Ahmed. COVID-DenseNet: A deep learning architecture to detect COVID-19 from chest radiology images. Preprint 2020. https://doi.org/10.20944/preprints202005.0151.v1

S Wang, YZha, W Li, Q Wu, X Li, M Niu, M Wang, X Qiu, H Li, H Yu, and W Gong. A fully automatic deep learning system for COVID-19 diagnostic and prognostic analysis. Eur. Respir. J. 2020; 56, 2000775.

Downloads

Published

2021-05-25

How to Cite

BENBRAHIM, H. ., HACHIMI, H. ., & AMINE, A. . (2021). Deep Transfer Learning Pipelines with Apache Spark and Keras TensorFlow combined with Logistic Regression to Detect COVID-19 in Chest CT Images. Walailak Journal of Science and Technology (WJST), 18(11), Article 13109 (14 pages). https://doi.org/10.48048/wjst.2021.13109