To read this content please select one of the options below:

Retrospective and prospective approaches of coronavirus publications in the last half-century: a Latent Dirichlet allocation analysis

Farshid Danesh (Information Management Research Department, Regional Information Center for Science and Technology, Shiraz, Iran)
Meisam Dastani (Gonabad University of Medical Sciences, Gonabad, Iran)
Mohammad Ghorbani (Gonabad University of Medical Sciences, Gonabad, Iran)

Library Hi Tech

ISSN: 0737-8831

Article publication date: 1 April 2021

Issue publication date: 13 September 2021

2571

Abstract

Purpose

The present article's primary purpose is the topic modeling of the global coronavirus publications in the last 50 years.

Design/methodology/approach

The present study is applied research that has been conducted using text mining. The statistical population is the coronavirus publications that have been collected from the Web of Science Core Collection (1970–2020). The main keywords were extracted from the Medical Subject Heading browser to design the search strategy. Latent Dirichlet allocation and Python programming language were applied to analyze the data and implement the text mining algorithms of topic modeling.

Findings

The findings indicated that the SARS, science, protein, MERS, veterinary, cell, human, RNA, medicine and virology are the most important keywords in the global coronavirus publications. Also, eight important topics were identified in the global coronavirus publications by implementing the topic modeling algorithm. The highest number of publications were respectively on the following topics: “structure and proteomics,” “Cell signaling and immune response,” “clinical presentation and detection,” “Gene sequence and genomics,” “Diagnosis tests,” “vaccine and immune response and outbreak,” “Epidemiology and Transmission” and “gastrointestinal tissue.”

Originality/value

The originality of this article can be considered in three ways. First, text mining and Latent Dirichlet allocation were applied to analyzing coronavirus literature for the first time. Second, coronavirus is mentioned as a hot topic of research. Finally, in addition to the retrospective approaches to 50 years of data collection and analysis, the results can be exploited with prospective approaches to strategic planning and macro-policymaking.

Keywords

Acknowledgements

Researchers appreciate the Vice-Chancellor for Research and Technology of Gonabad University of Medical Sciences for its financial support and the respiratory infectious diseases specialists for their expert advice.

Research funding: The present article is extracted from a research project with code A‐10–1263–2 and research ethics ID. I.R.GMU.REC.1398.189 approved by the Research Council of Gonabad University of Medical Sciences and conducted with financial support.

Citation

Danesh, F., Dastani, M. and Ghorbani, M. (2021), "Retrospective and prospective approaches of coronavirus publications in the last half-century: a Latent Dirichlet allocation analysis", Library Hi Tech, Vol. 39 No. 3, pp. 855-872. https://doi.org/10.1108/LHT-09-2020-0216

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles