Next Article in Journal
A Sisterhood of Hope: How China’s Transgender Sex Workers Cope with Intimate Partner Violence
Previous Article in Journal
The Unexpected Holiday Souvenir: The Public Health Risk to UK Travellers from Ticks Acquired Overseas
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Discovering Correlations between the COVID-19 Epidemic Spread and Climate

1
Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
2
Beijing Institute of Smart City, Beijing University of Technology, Beijing 100124, China
3
Beijing Big Data Center, Beijing 100142, China
4
Green Intelligence Environmental School, Yangtze Normal University, Chongqing 408107, China
5
Institute of Remote Sensing and Geographical Information Systems, School of Earth and Space Sciences, Peking University, Beijing 100871, China
*
Authors to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2020, 17(21), 7958; https://doi.org/10.3390/ijerph17217958
Submission received: 27 September 2020 / Revised: 22 October 2020 / Accepted: 23 October 2020 / Published: 29 October 2020

Abstract

:
The outbreak of Corona Virus Disease 2019 (COVID-19) has affected the lives of people all over the world. It is particularly urgent and important to analyze the epidemic spreading law and support the implementation of epidemic prevention measures. It is found that there is a moderate to high correlations between the number of newly diagnosed cases per day and temperature and relative humidity in countries with more than 10,000 confirmed cases worldwide. In this paper, the correlation between temperature/relative humidity and the number of newly diagnosed cases is obvious. Governments can adjust the epidemic prevention measures according to climate change, which will more effectively control the spread of COVID-19.

1. Introduction

Corona Virus Disease 2019 (COVID-19) is a global pandemic and serious threat to human health which halt the economic activities [1]. The COVID-19 outbreak has become a global public health emergency [2]. Corona Virus Disease 2019 (COVID-19) is pneumonia caused by SARS-CoV-2 infection. Since December 2019, there have been many cases of pneumonia caused by this virus infection all over the world. The COVID-19 pandemic forced many countries to implement full or partial lockdown, causing a substantial reduction in the anthropogenic activities due to being prohibited from outdoor invasion which resulted in less transportation and shutting down of industries [3]. On 11 March 2020, the World Health Organization (WHO) declared COVID-19 a world pandemic [4]. To date, COVID-19 infection has been reported worldwide and may be further transmitted by air travel [5]. Currently, many scholars are conducting research on the spatial spread of COVID-19. For example, Jelodar used LSTM Recurrent Neural Network (RNN) to carry out deep emotional classification of COVID-19 [6]; Alsaeedy used the existing cellular wireless network function to detect COVID-19 risk areas [7].
According to the data on “Baidu COVID-19 Epidemic Real-time Big Data Report”, from 15 February to 22 June, the total number of confirmed cases of the COVID-19 in the United States exceeded 2.31 million, and the cumulative number of deaths exceeded 120,000, reaching 123,053. The cumulative number of confirmed cases of COVID-19 in Brazil exceeded 1.1 million, and the cumulative number of deaths exceeded 50,000, reaching 51,271. As of 22 June, three countries in the southern hemisphere were among the top five in the daily list of newly diagnosed cases. On 22 June, there were 26,616 new cases in Brazil. Therefore, there is an urgent need to evaluate the correlation between the spread of COVID-19 and climate conditions, so as to better support the decision making and response measures and control the further spread of COVID-19.

2. Related Research Work

2.1. Research on Epidemic Situation and Climate

Since 2000, Severe Acute Respiratory Syndrome (SARS), bird flu and other epidemics have broken out successively. The SARS epidemic broke out in Guangdong, China in mid-November 2002, and then spread to other parts of China and the world. Under the action of a series of countermeasures, SARS disappeared in mid-June 2003. SARS lasted for about 7 months, and reached its peak from April to May 2003. Bird flu epidemics mostly occur in winter and spring, and they all end in late spring to early summer. SARS was first discovered in Guangdong and Hong Kong, China in 2003 with the warm climate, where the average temperature from January to February is above 10 °C. The bird flu was first discovered in Guangxi, Hubei and other provinces in 2004 with a suitable climate, while the average temperature from January to February is around 10 °C. The COVID-19 epidemic was first reported in Wuhan, Hubei Province, while the average temperature from January to February was also around 10 °C [8].
In China, the epidemics such as SARS and bird flu have the common characteristics of beginning in winter, ending in summer, and originating in southern China. These viruses have high activity and transmission capacity in low-temperature and high-humidity environment [9]. Some reference believe that the increase temperature will the virus lose its infective activity.
At present, there have been some studies on the correlation between the COVID-19 epidemic spread and climate. Zhu et al. [10] confirmed the highly significant correlation between absolute humidity and daily new COVID-19 cases using Multiple Linear Regression Model, after collecting the daily number of new cases and corresponding climate data from eight regions in four countries in South America. David et al. [11] utilized the generalized additive model (GAM) to explore the linear and non-linear relationship between the annual average temperature compensation and the confirmed COVID-19 cases in the capital city of Brazil. It was found that the daily cumulative number of confirmed cases decreased by 4.8951% when the temperature increased by 1 °C. Goswami [12] used Sen’s Slope and Man-Kendall test and generalized additive regression model (GAM) to detect the impact of daily temperature and relative humidity on incidence rate in India countries. Lowen [13], Barreca [14] and Żuk [15] pointed out that environmental temperature plays an important role in the survival and transmission of viruses.
A large number of studies reveal that temperature and humidity can affect the spread of epidemics, thus prompting this study to further explore the global impact of environmental factors on COVID-19.
At present, all the current literature selected samples are limited to local areas, which may lead to the problem that the conclusions are not universal. Therefore, this study collected the epidemic situation and meteorological data in the high incidence area of global epidemic situation, analyzed the development trend of the epidemic situation on a global scale, including more climatic conditions that may affect the spread of the virus, so as to study the climatic factors affecting the activity of SARS-CoV-2.

2.2. Multiple Regression Analysis

Multiple Linear Regression Models are suitable for scenarios where multiple variables affect single outcome. It can accurately measure the correlation degree and regression fitting degree of each variable, and improve the prediction model effect. In this study, climate factors have an impact on the spread of the epidemic in many aspects. As it was necessary to evaluate the correlation between various climatic factors and the spread of SARA-CoV-2. Multiple Linear Regression Models was selected for analysis.
Multiple Regression Analysis Models have been widely used in various scenarios of COVID-19. Rath [16] used Multiple Linear Regression Model to predict that the number of daily active cases in India would reach 52,290 by 15 August. Ayyoubzadeh et al. [17] used Multiple Linear Regression Method to predict the spread of COVID-19 in Iran, and found that in addition to the incidence of the previous day, factors that can effectively improve the accuracy of the prediction also include hand sanitizer usage and hand washing frequency. Kass et al. [18] analyzed the relationship between Body Mass Index (BMI) and age in patients diagnosed with COVID-19 through Multiple Linear Regression Model, and concluded that obesity may increase the infection rate of COVID-19. Yang et al. [19] estimated the early mortality of COVID-19 by linear regression model, and concluded that the mortality of COVID-19 was lower than that of coronavirus epidemic caused by SARS-CoV and MERS-CoV. Xiong et al. [20] analyzed the correlation between initial CT features and turbidity progression in COVID-19 patients by linear regression models and Spearman correlation coefficient.

3. Data Sources and Research Methods

3.1. Data Sources

The open data sets of confirmed cases published by the Center for System Science and Engineering (CSSE) of Johns Hopkins University were used in this study. The data from all over the world were collated from 22 January, the early stage of the epidemic. Considering that countries with less confirmed epidemic cases might have problems with less obvious climate characteristics, and it is necessary to avoid the problem that too few objects lead to the research results not being universal, 65 countries with more than 10,000 confirmed cases from 22 March to 22 June were selected. The total number of confirmed cases per day is subtracted from that of the previous day to get the new daily number of confirmed cases in each country, so as to reflect the epidemic transmission capacity.
The climate data in this study comes from the daily records of weather stations around the world collected by China Meteorological Data Network (http://data.cma.cn/). We selected high average monthly temperature, low average monthly temperature, sea level pressure, altitude, wind speed, rainfall, dew point temperature and relative humidity as the climatic indicators of each region from 22 March to 22 June during the epidemic periods in each region to reflect the regional weather changes.
In this experiment, 65 countries with more than 10,000 confirmed COVID-19 cases at 24:00 on 22 June were selected as experimental subjects. The number of newly diagnosed cases per day and 8 climate factors were collected for experimental analysis. Set the number of new daily confirmed cases (New) as the dependent variable y, the monthly average maximum temperature during the epidemic periods (Tmax), the monthly average minimum temperature during the epidemic periods (Tmin), sea level pressure (Sea_Pressure), Wind_Speed, Elevation, Rainfall, Dew point temperature (DP), and Relative humidity (Humidity) are respectively Arguments x1, x2, x3, x4, x5, x6, x7, x8. The samples of observation data are shown in Table 1.

3.2. Methodology

We chose the Multiple Linear Regression Analysis Method to analyze the correlation between the number of daily increased confirmed cases in each region and the climate indicators of the region. Firstly, the relevant Multiple Linear Regression Method was used to perform a series of verifications and establish a multiple regression equation. Then the Pearson correlation coefficient was used to evaluate the relative importance of the influence of each independent variable on the dependent variable, i.e., the correlation coefficient between each independent variable and the dependent variable. The linear relationship between them was discovered and the correlation between the observed variables was determined. The advantage of using this method is that the relationship between the variables can be clearly defined and expressed quantitatively, and the influence of different climatic factors on the number of new diagnosed cases per day can be clearly shown.
According to the selected observation variable data, the Multiple Linear Regression Models as shown in Equation (1) can be constructed.
y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + β 4 x 4 + β 5 x 5 + β 6 x 6 + β 7 x 7 + β 8 x 8 + ε
where y is the dependent variable, which represents the number of newly increased confirmed cases every day in this experiment; x 1 , x 2 , x 3 , x 4 , x 5 , x 6 , x 7 , x 8 are independent variables, respectively representing the monthly average maximum temperature, monthly average minimum temperature, sea level pressure, wind speed, elevation, rainfall, dew point temperature and relative humidity in this experiment. β 0 , β 1 , β 2 , β 3 , β 4 , β 5 , β 6 , β 7 , β 8 is the unknown parameters of the corresponding independent variables; ε is called the error term, which is an unobservable random variable with a mean value of zero and a variance of σ 2 > 0 , and ε N ( 0 , σ 2 ) . The above Linear Regression Model can be used to predict the number of new daily confirmed cases and determine the correlation between each independent variable and the dependent variable. Therefore, for different dates, n groups of different data can be obtained, as shown in Equation (2).
y 1 = β 0 + β 1 x 11 + . . . + β 8 x 18 + ε 1 y 2 = β 0 + β 1 x 21 + . . . + β 8 x 28 + ε 2 y n = β 0 + β 1 x n 1 + . . . + β 8 x n 8 + ε n
where ε 1 , ε 2 , , ε n is independent of each other and ε N ( 0 , σ 2 ) .
The Pearson correlation coefficient is calculated on the basis of the above data. For a single independent variable, the calculation method is shown in Equation (3).
R = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
R (Pearson correlation coefficient) is used to describe the correlation between two groups of different data. When the development trend of two different groups of data presents weak correlation, 0 R < 0.3 . When the development trend of two different groups of data show medium correlation, 0.3 R < 0.6 . And When the development trendof two different groups of data presents high correlation, 0.6 R < 1.0 .
Pearson correlation coefficient was utilized to analyze the correlation between variables, and the correlation coefficient values are shown in Table 2.
In Table 2, R is the correlation coefficient, R 01 is the correlation coefficient between the monthly average maximum temperature (Tmax) and the number of the daily increased confirmed cases (New), and R 10 is the correlation coefficient between the daily increased confirmed cases (New) and the monthly average maximum temperature (Tmax). Because the correlation between the monthly average maximum temperature (Tmax) and the number of the daily increased confirmed cases (New) is equivalent to the correlation between the number of the daily increased confirmed cases (New) and the monthly average maximum temperature (Tmax). Therefore, in the above table, R 01 = R 10 , R 12 = R 21 , and so on. The closer the absolute value of the correlation coefficient to 1, the stronger the correlation between the two observed variables in the region.

3.3. Model Testing

After the establishment of Multiple Linear Regression Model, it is necessary to test the Linear Regression Model. In this experiment, the modified determination coefficient R ¯ 2 is selected to determine the performance of the model.The formula for selecting the modified determination coefficient is shown in Equation (4).
R ¯ 2 = 1 S S E / ( n k 1 ) S S T / ( n 1 )
where SSE represents the Sum of Squares of Residuals, and SST represents the Sum of Squares of Deviations, while nk − 1 represents the degree of freedom of the Sum of Squares of Residuals, and n − 1 is the degree of freedom of the Sum of the Squares of Deviation. By dividing the SSE by their degrees of freedom and dividing the SST by their degrees of freedom, the influence of the number of variables on the coefficient of determination can be suppressed, and the fitting degree of the linear regression model for the relationship between variables can be better reflected. The closer the correction coefficient is to 1, the higher the fitting degree of the relationship between variables is, and the more accurate the model effect is.
R 2 = S S R S S T = i = 1 n y ^ i y ¯ 2 i = 1 n y i y ¯ 2
The difference between the modified determination coefficient R ¯ 2 and the coefficient of determination R 2 is that a penalty term is introduced. The function of the penalty term is that our coefficient of determination can be increased only when variables that are really helpful for analysis are introduced, effectively solving the problem of As the number of independent variables in the model increases, the coefficient of determination also gradually increases.
In order to ensure that there are enough data for model training, 70% of the observation data is selected as the training set, and the remaining 30% of the observation data are used as the test set. The climate data of the day in the test set is input into the establish multiple linear regression model, and the predicted value of the number of new confirmed cases on that day is obtained, and compare with the actual observation value of the day. This is to test the constructed Multiple Linear Regression Model is in line with the actual situation.

4. Experimental Results and Discussion

4.1. Regression Analysis between the Number of New Daily Confirmed Cases and Climate Variables

After data processing with Multiple Linear Regression Models, the fitting function between the number of new daily confirmed cases and each climate variable is obtained. Among the 65 linear regression models (each model for a country), the correlation coefficient of 42 models were greater than 0.5, showing that these models have good fitting effect. Countries with the top six number of confirmed cases as of 22 June were selected for display. Table 3 and Table 4 shows the Multiple Linear Regression Model constructed by the training set of data from United States, Brazil, India, Mexico, South Africa and Peru.
Eight climatic parameters were inputted into the established Multiple Linear Regression Model to calculate the daily number of new confirmed cases. The results were compare with the daily number of new confirmed cases in the observation, to detect whether the Multiple Linear Regression Models can effectively reflect the relationship between the daily number of new confirmed cases and the climate data. The prediction of daily number of new confirmed cases provides a theoretical basis. Figure 1, Figure 2 and Figure 3 show the comparison between test set and model prediction of the United States, Brazil, India, Mexico, South Africa and Peru; The red curve is the observation data of the test set, and the blue curve is the daily increased number of confirmed cases predicted by the constructed multiple regression model.
According to Figure 1, it can be noted that the Multiple Linear Regression Model constructed by the experiment fits with the observation data, and can predict the number of new confirmed cases every day based on the relevant climate data.

4.2. The Correlation between the New Confirmed Case Number and Climate in Different Countries

We explore the relationship between the daily number of new confirmed cases (New) and climate data in different countries, and select the countries with the top six number of confirmed epidemic cases as of 24:00 on 22 June for demonstration. The correlation coefficient between the daily number of the confirmed cases and the climate data is shown in Table 5.
As can be seen in Table 5, the correlation coefficient between New of countries and each climatic variable illustrates significant correlations between New and Tmax, Tmin, Humidity.
  • For Tmax and Tmin, four of the top six countries in the total number of confirmed cases showed a strong correlation, including Brazil, India, Mexico and Peru. In addition, South Africa showed a moderate correlation, while the United States, which had the largest number of confirmed cases, showing a weak correlation.
  • For Humidity, four of the top six countries in the total number of confirmed cases showed strong correlation, including Brazil, India, Mexico and South Africa, while the United States showed moderate correlation, and Peru showed weak correlation.

4.3. The Correlation between the Number of New Daily Confirmed Cases and Various Climate Variables in Different Countries

In the 65 countries selected in this experiment, the experimental results of the correlation coefficient between the New and Various Climate Parameters are shown in Figure 2.
  • The New and Tmax generally show a medium or high correlation. There are 54 countries where the New and Tmax show medium or high correlations, accounting for about 83%. Among them, there are 31, 23, and 11 countries showing high, medium, and low correlations respectively, accounting for about 48%, 35%, and 17% respectively, as shown in Figure 4a.
  • The New and Tmin generally show a medium or high correlation. There are 53 countries where the New and Tmin show medium or high correlations, accounting for about 82%. Among them, there are 31, 22, and 12 countries showing high, medium, and low correlations respectively, accounting for about 48%, 34%, and 18% respectivle, as shown in Figure 4b.
  • There is a low correlation between the New and Sea Level Pressure overall. There are only 14 countries where the New and Sea Level Pressure show medium or high correlations, accounting for about 22%. Among them, there are 5, 9, and 51 countries showing high, medium, and low correlations respectively, accounting for about 8%, 14%, and 78% respectively, as shown in Figure 4c.
  • There is a low correlation between the New and Wind Speed overall. There are only 10 countries where the New and Wind Speed show medium or high correlations, accounting for about 15%. Among them, there are 1, 9, and 55 countries showing high, medium, and low correlations respectively, accounting for about 1%, 14%, and 85% respectively, as shown in Figure 4d.
  • There is a low correlation between the New and Elevation. There is no country where the New and Elevation show medium or high correlation, while all countries show a low correlation, as shown in Figure 4e.
  • There is a low correlation between the New and Rainfall overall. There are only 9 countries where the New and Rainfall show medium or high correlations, accounting for about 14%. Among them, there are 1, 8, and 56 countries showing high, medium, and low correlations respectively, accounting for about 2%, 12%, and 86% respectively, as shown in Figure 4f.
  • The New and the Dew Point Temperature generally show a medium or high correlation. There are 41 countries where the New and the Dew Point Temperature show a medium or high correlations, accounting for about 63%. Among them, there are 23, 18, and 24 countries showing high, medium, and low correlations respectively, accounting for about 35%, 28%, and 37% respectively, as shown in Figure 4g.
  • The New and Relative Humidity generally show a medium or high correlation. There are 46 countries where the New and Relative Humidity show medium or high correlations, accounting for about 71%. Among them, there are 21, 25, and 19 countries showing high, medium, and low correlations respectively, accounting for about 32%, 39%, and 29% respectively, as shown in Figure 4h.
Based on the above analysis, the following further inferences can be drawn:
  • The activity of the COVID-19 have little correlation with Elevation, Sea Level Pressure, Wind Speed, and Rainfall. The New in all selected 65 countries shows low correlations with Elevation. Only 22% of the selected countries have medium or high correlations between the New and Sea Level Pressure, and only 8% of the countries show high correlations. Only 15% of the countries have medium or high correlations between the New and Wind Speed, and only 1% of the countries show high correlations. 14% of the countries have medium or high correlations between the New and Rainfall, and only 2% of the countries show high correlations.
  • The activity of the COVID-19 is correlated with Tmax, Tmin, Dew Point Temperature and, Relative Humidity. 82% of the selected 65 countries have medium or high correlations between the New and Tmax or Tmin, and 48% of them have high correlations. 71% of the countries have medium or high correlations between the New and the Dew Point Temperature or Relative Humidity, and about 32% of them show high correlations.
  • The activity of the COVID-19 is mainly related to Temperature and Humidity. Since both Tmax and Tmin belong to air temperature, and the Dew Point Temperature can be obtained from the Relative Humidity and Temperature [21]. It is inferred that the activity of the COVID-19 is mainly related to temperature and humidity. It is worth noting that the temperature and humidity compared with the correlation between the number of new daily confirmed cases, more countries show medium or high correlations between temperature and the number of new daily confirmed cases. The temperature seems to have a more obvious impact on virus activity. However, the correlation between the number of new daily confirmed cases and humidity should not be ignored. It is necessary to consider the impact of climate factors on the spread of the epidemic in combination with temperature and humidity.

4.4. Geospatial Analysis of the Correlation between the New and Climate Variables

Figure 3, Figure 4, Figure 5 and Figure 6 show the maps of the correlation between New and Tmax, Tmin, Relative Humidity, verifying and demonstrating the analysis and inference above on the whole. In the maps, red color indicates high correlation, yellow color indicates medium correlation, and blue-green color means low correlation, while white color means unselected countries. We select most countries in the world for analysis, trying to reveal the impact of various climate factors on the activity of COVID-19 in the global range, and effectively reduce the risk of erroneous conclusions because the occasional weather conditions in individual countries are similar to the climatic conditions related to virus activity, so that the experimental results are more reliable and universal. Integrating with the spatial visualization analysis in Figure 3, Figure 4, Figure 5 and Figure 6, we further analyze and infer the following points of view:
  • The intervention of epidemic prevention and control measures can restrain the influence of climatic factors on the spread of the epidemic. China, where the epidemic broke out in January, shows a low correlation between the New and various climate factors. In the analysis of Israel, South Korea, and Singapore, which had good epidemic prevention and control, the correlation analysis between the New and various climate factors also shows a low correlation. Therefore, this article speculates that government interventions in epidemic prevention and control measures can effectively reduce the spread of the epidemic and restrain the impact of climate on the spread of the epidemic.
  • At the stage of a pandemic, climate factors are not enough to restrain the spread of the epidemic. As of 22 June, the United States, which has the largest number of confirmed cases of the epidemic, does not have a high correlation between the New and Temperature and Humidity. This article speculates that the United States shows a low correlation because it was already in the pandemic stage. The main reason affecting the number of new confirmed cases daily is the large number of confirmed cases and their interpersonal and social contact. In the case of a large number of confirmed cases and failure to effectively prevent interpersonal communication, Climate factors have a low impact on the transmission speed of the epidemic. The impact of temperature and humidity on the spread of COVID-19 is not enough to completely suppress the pandemic.
  • Temperature and Humidity in tropical areas have a more obvious impact on the spread of the epidemic. Cases of COVID-19 have also been confirmed in the equatorial Africa and Amazon tropical rainforest. Countries in the relevant regions such as Brazil, Colombia, Ecuador, Nigeria and other countries have reported moderate or above correlation between the numbers of new confirmed cases and temperature, humidity. It can also be seen from the map that more countries in the equator and South America show medium or high correlation. This article speculates that temperature and humidity in the equator of Africa and the Amazon rainforest have a greater influence on the spread of the epidemic.

5. Discussion

According to the research results, temperature and humidity have a high correlation with the activity of SARS-CoV-2. Other literature shows that in addition to temperature and humidity, biological gender [22], obesity rate [23], age [24], comorbidities [25] can affect the transmission of COVID-19. Therefore, the government should pay special attention to some special environment when formulating measures to prevent and control the epidemic situation, which can make SARS-CoV-2 have high activity. Countries should strengthen the control of disinfection and social isolation on the environment. The development of the epidemic can be predicted according to the changes in temperature and humidity, and corresponding prevention and control measures can be taken. According to the climate change in the area where the virus activity changes, epidemic prevention, and control measures should be strengthened when the virus activity is low, so as to suppress the trend of a virus outbreak and achieve the purpose of epidemic prevention and control. At the same time, attention should be paid to the prevention and control of the environmental factors such as temperature and humidity suitable for virus survival and transmission channels. Therefore, countries must seize the influence of climatic factors to take active measures to control the first COVID-19 epidemic when the virus activity is at a low level.
Therefore, the development of the epidemic can be predicted according to the changes of temperature and humidity, and corresponding prevention and control measures can be taken. According to the climate change in the area where the virus activity changes, the epidemic prevention and control measures should be strengthened when the virus activity is low, so as to suppress the trend of virus outbreak and achieve the purpose of epidemic prevention and control. At the same time, Attention should be paid to the prevention and control of the environmental factors such as temperature and humidity suitable for virus survival and transmission channels. Therefore, countries must seize the influence of climatic factors to take active measures to control the first COVID-19 epidemic when the virus activity is at a low level. At the same time, they should not relax the prevention and control measures to prevent the second COVID-19 epidemic caused by the increase of virus activity due to the change of climate factors.
At present, the specific mechanism of the interaction between Temperature, Humidity and Virus activity is unknown. But like influenza virus, COVID-19 can be transmitted by aerosol [26]. Casanova believes that compared with medium relative humidity (50%), COVID-19 has a greater survival rate or greater protection at high relative humidity (80%) [27]. So this research speculated that low temperature and high humidity lead to the increase of suspended solids in the atmosphere, which provides the ideal conditions for virus attachment, replication and transmission. Low temperature can also dry the mucous membrane, reduce the function of cilia, and support the survival and transmission of virus and the spread of disease [28]. Therefore, it is speculated that temperature and humidity can affect the spread of COVID-19 by affecting the spread of COVID-19. On the one hand, temperature affects the human mucous membrane to reduce the human resistance to viruses. On the other hand, high humidity can increase the quality of aerosols in the air. The concentration increases the number of aerosol particles in the air [29], which leads to an increase in the speed of virus transmission. The combination of temperature and humidity reduces human resistance to viruses and enhances the speed of virus transmission, thus showing that the speed of virus transmission is accelerated, and the number of new confirmed cases increases daily.

6. Conclusions and Future Work

The analysis in this paper shows that the activity of the new coronavirus has the following characteristics:
  • There is a high correlation between the activity of the COVID-19 and temperature and humidity, while temperature is more correlated with the activity of the COVID-19. Wind speed, sea level pressure, altitude, and rainfall have little effect on the spread of the epidemic.
  • The intervention of epidemic prevention and control measures can restrain the influence of climatic factors on the spread of the virus. But climatic factors alone are not enough to restrain the spread of the epidemic.
  • Temperature and humidity in tropical areas have a more obvious impact on the spread of the epidemic.
Accordingly, this paper proposes the following conclusions, recommendations for global COVID-19 prevention and control.
  • It is better for each country to take appropriate or even more stringent prevention and control measures to minimize the risk of outbreak the epidemic by taking into account the development and changes of climatic factors such as temperature and humidity in the early stage of the epidemic.
  • As time goes by, the climate changes in the northern and southern hemispheres will affect the activity of the virus. It is necessary to pay special attention to the prevention and control of the virus, so as to prevent the spread of the COVID-19 epidemic in the southern hemisphere and secondary outbreak in some parts of the northern hemisphere due to the increased activity of the virus.
  • Countries with better epidemic prevention and control or countries with less serious epidemic situation shouldn’t take it lightly. It is necessary to strictly control various public areas to prevent the risk of re-outbreak caused by the enhancement of virus activity due to the climatic factors such as temperature and humidity.
This paper assesses the correlation analysis between the spread of COVID-19 epidemic and climatic factors. The number of confirmed cases is inevitably underestimated due to different detection coverage rates of COVID-19 in different countries, and the impact of changes in policies and local prevention and control strategies on the spread of the epidemic was not assessed in this study. Therefore, in the future work these issues need to be more detailed exploration.

Author Contributions

Y.F. collected literature and wrote the manuscript; S.L. reviewed and edited the manuscript; X.J., S.D., Y.W. and Z.H. Put forward suggestions for revision of the article. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Muhammad, S.; Long, X.; Salman, M. COVID-19 pandemic and environmental pollution: A blessing in disguise? Sci. Total Environ. 2020, 728, 138820. [Google Scholar] [CrossRef] [PubMed]
  2. Xiang, Y.; Wang, S. Chinaś city novel coronavirus pneumonia epidemic spread and population flow spatial relationship and its implications for the classification and governance of urban public health in China. Trop. Geogr. 2020, 17, 3903. [Google Scholar]
  3. Gautam, A.S.; Dilwaliya, N.K.; Srivastava, A.; Kumar, S.; Bauddh, K.; Siingh, D.; Gautam, S. Temporary reduction in air pollution due to anthropogenic activity switch-off during COVID-19 lockdown in northern parts of India. Environ. Dev. Sustain. 2020. [Google Scholar] [CrossRef] [PubMed]
  4. World Health Organization. WHO Characterizes COVID-19 as a Pandemic—11 March 2020. [EB/OL]. Available online: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/events-as-they-happen (accessed on 20 March 2020).
  5. Wang, C.; Horby, P.W.; Hayden, F.G.; Gao, G.F. A novel coronavirus outbreak of global health concern. Lancet 2020, 395, 470–473. [Google Scholar] [CrossRef] [Green Version]
  6. Jelodar, H.; Wang, Y.; Orji, R.; Huang, H. Deep sentiment classification and topic discovery on novel coronavirus or covid-19 online discussions: Nlp using lstm recurrent neural network approach. arXiv 2020, arXiv:2004.11695. [Google Scholar] [CrossRef] [PubMed]
  7. Alsaeedy, A.A.; Chong, E.K. Detecting Regions at Risk for Spreading COVID-19 Using Existing Cellular Wireless Network Functionalities. IEEE Open J. Eng. Med. Biol. 2020, 1, 187–189. [Google Scholar] [CrossRef]
  8. Li, W.; Zhang, G. Chinaś novel coronavirus pneumonia and SARS epidemic contrast and its impact on Chinaś economy. First Financ. Dly. 2020. [Google Scholar]
  9. Gu, P.; Xu, X. Temperature characteristics and countermeasures in epidemic seasons such as new crown pneumonia. Life Disasters 2020. [Google Scholar]
  10. Zhu, L.; Liu, X.; Huang, H. Meteorological impact on the COVID-19 pandemic: A study across eight severely affected regions in South America. Sci. Total Environ. 2020, 744, 140881. [Google Scholar] [CrossRef]
  11. Prata, D.N.; Rodrigues, W.; Bermejo, P.H. Temperature significantly changes COVID-19 transmission in (sub) tropical cities of Brazil. Sci. Total Environ. 2020, 729, 138862. [Google Scholar] [CrossRef]
  12. Goswami, K.; Bharali, S.; Hazarika, J. Projections for COVID-19 pandemic in India and effect of temperature and humidity. Diabetes Metab. Syndr. Clin. Res. Rev. 2020, 14, 801–805. [Google Scholar] [CrossRef] [PubMed]
  13. Lowen, A.C.; Mubareka, S.; Steel, J.; Palese, P. Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog. 2007, 3, e151. [Google Scholar] [CrossRef] [PubMed]
  14. Barreca, A.I.; Shimshack, J.P. Absolute humidity, temperature, and influenza mortality: 30 years of county-level evidence from the United States. Am. J. Epidemiol. 2012, 176, S114–S122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Żuk, T.; Rakowski, F.; Radomski, J.P. Probabilistic model of influenza virus transmissibility at various temperature and humidity conditions. Comput. Biol. Chem. 2009, 33, 339–343. [Google Scholar] [CrossRef]
  16. Rath, S.; Tripathy, A.; Tripathy, A.R. Prediction of new active cases of coronavirus disease (COVID-19) pandemic using multiple linear regression model. Diabetes Metab. Syndr. Clin. Res. Rev. 2020. [Google Scholar] [CrossRef]
  17. Ayyoubzadeh, S.M.; Ayyoubzadeh, S.M.; Zahedi, H.; Ahmadi, M.; Kalhori, S.R.N. Predicting COVID-19 incidence through analysis of google trends data in iran: Data mining and deep learning pilot study. JMIR Public Health Surveill. 2020, 6, e18828. [Google Scholar] [CrossRef]
  18. Kass, D.A.; Duggal, P.; Cingolani, O. Obesity could shift severe COVID-19 disease to younger ages. Lancet 2020, 395, 1544–1545. [Google Scholar] [CrossRef]
  19. Yang, S.; Cao, P.; Du, P.; Wu, Z.; Zhuang, Z.; Yang, L.; Yu, X.; Zhou, Q.; Feng, X.; Wang, X.; et al. Early estimation of the case fatality rate of COVID-19 in mainland China: A data-driven analysis. Ann. Transl. Med. 2020, 8, 128. [Google Scholar] [CrossRef]
  20. Xiong, Y.; Sun, D.; Liu, Y.; Fan, Y.; Zhao, L.; Li, X.; Zhu, W. Clinical and high-resolution CT features of the COVID-19 infection: Comparison of the initial and follow-up changes. Investig. Radiol. 2020. [Google Scholar] [CrossRef]
  21. Xu, X.; Yu, J.; Li, H.; Yang, L. Comparative study on calculation methods of dew point temperature. J. Meteorol. Environ. 2016, 32, 107–111. [Google Scholar]
  22. Klein, S.L.; Dhakal, S.; Ursin, R.L.; Deshpande, S.; Sandberg, K.; Mauvais-Jarvis, F. Biological sex impacts COVID-19 outcomes. PLoS Pathog. 2020, 16, e1008570. [Google Scholar] [CrossRef] [PubMed]
  23. Dietz, W.; Santos-Burgoa, C. Obesity and its Implications for COVID-19 Mortality. Obesity 2020, 28, 1005. [Google Scholar] [CrossRef] [Green Version]
  24. Lighter, J.; Phillips, M.; Hochman, S.; Sterling, S.; Johnson, D.; Francois, F.; Stachel, A. Obesity in Patients Younger Than 60 Years Is a Risk Factor for COVID-19 Hospital Admission. Clin. Infect. Dis. 2020, 71, 896–897. [Google Scholar] [CrossRef] [Green Version]
  25. Robilotti, E.V.; Babady, N.E.; Mead, P.A.; Rolling, T.; Perez-Johnston, R.; Bernardes, M.; Joanow, A. Determinants of COVID-19 disease severity in patients with cancer. Nat. Med. 2020, 26, 1218–1223. [Google Scholar] [CrossRef]
  26. Anderson, E.L.; Turnham, P.; Griffin, J.R.; Clarke, C.C. Consideration of the Aerosol Transmission for COVID-19 and Public Health. Risk Anal. 2020, 40, 902–907. [Google Scholar] [CrossRef] [PubMed]
  27. Casanova, L.M.; Jeon, S.; Rutala, W.A.; Weber, D.J.; Sobsey, M.D. Effects of air temperature and relative humidity on coronavirus survival on surfaces. Appl. Environ. Microbiol. 2010, 76, 2712–2717. [Google Scholar] [CrossRef] [Green Version]
  28. Gardner, E.G.; Kelton, D.; Poljak, Z.; Van Kerkhove, M.; Von Dobschuetz, S.; Greer, A.L. A case-crossover analysis of the impact of weather on primary cases of Middle East respiratory syndrome. BMC Infect. Dis. 2019, 19, 113. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Sun, Y.; Wang, Z.; Fu, P.; Jiang, Q.; Yang, T.; Li, J.; Ge, X. The impact of relative humidity on aerosol composition and evolution processes during wintertime in Beijing, China. Atmos. Environ. 2013, 77, 927–934. [Google Scholar] [CrossRef]
Figure 1. Comparison of test sets and model predictions in Different Countries.
Figure 1. Comparison of test sets and model predictions in Different Countries.
Ijerph 17 07958 g001
Figure 2. Correlations between the number of daily new confirmed cases and climate variables.
Figure 2. Correlations between the number of daily new confirmed cases and climate variables.
Ijerph 17 07958 g002
Figure 3. Correlation between New and Tmax.
Figure 3. Correlation between New and Tmax.
Ijerph 17 07958 g003
Figure 4. Correlation between New and Tmin.
Figure 4. Correlation between New and Tmin.
Ijerph 17 07958 g004
Figure 5. Correlation between New and DP.
Figure 5. Correlation between New and DP.
Ijerph 17 07958 g005
Figure 6. Correlation between New and Humidity.
Figure 6. Correlation between New and Humidity.
Ijerph 17 07958 g006
Table 1. Observation Variable Table.
Table 1. Observation Variable Table.
DataNewTmaxTminSea_PressureWind_SpeedElevationRainfallDPHumidity
03/22 Y 1 X 11 X 12 X 13 X 14 X 15 X 16 X 17 X 18
03/23 Y 2 X 21 X 22 X 23 X 24 X 25 X 26 X 27 X 28
..............................
06/22 Y n X n 1 X n 2 X n 3 X n 4 X n 5 X n 6 X n 7 X n 8
Table 2. Correlation Coefficient Table.
Table 2. Correlation Coefficient Table.
NewTmaxTminSea_PressureWind_SpeedElevationRainfullDPHumidity
New1.00 R 01 R 02 R 03 R 04 R 05 R 06 R 07 R 08
Tmax R 10 1.00 R 12 R 13 R 14 R 15 R 16 R 17 R 18
Tmin R 20 R 21 1.00 R 23 R 24 R 25 R 26 R 27 R 28
Sea_Pressure R 30 R 31 R 32 1.00 R 34 R 35 R 36 R 37 R 38
Wind_Speed R 40 R 41 R 42 R 43 1.00 R 45 R 46 R 47 R 48
Elevation R 50 R 51 R 52 R 53 R 54 1.00 R 56 R 57 R 58
Rainfull R 60 R 61 R 62 R 63 R 64 R 65 1.00 R 67 R 68
DP R 70 R 71 R 72 R 73 R 74 R 75 R 76 1.00 R 78
Humidity R 80 R 81 R 82 R 83 R 84 R 85 R 86 R 87 1.00
Table 3. Multiple Linear Regression Models of Some Countries.
Table 3. Multiple Linear Regression Models of Some Countries.
Country β 1 β 2 β 3 β 4 β 5 β 6 β 7 β 8 β 0
Brazil−863555442482600−1409−755 2.7 × e 5
India−12861259−119100 2.1 × e 5 0 7.6 × e 7 2.1 × e 9
Peru 4.4 × e 14 2.3 × e 14 7.7 × e 10 388 4.9 × e 10 −6.3 6.9 × e 14 2.2 × e 14 2.7 × e 16
Mexico5368100−1789−352793−4548
South Africa139−7220−44059188−1645518
US 3.8 × e 9 9.0 × e 9 −7184 6.1 × e 11 593 7.0 × e 8 8.9 × e 8 1.4 × e 14
Table 4. Modified Determination Coefficients of Some Countries.
Table 4. Modified Determination Coefficients of Some Countries.
ItemBrazilIndiaPeruMexicoSouth AfricaUS
R ¯ 2 0.600.850.850.750.800.43
Table 5. Correlation Coefficient between New and Climate Variables in the Top Six Countries.
Table 5. Correlation Coefficient between New and Climate Variables in the Top Six Countries.
CountryTmaxTminSea_PressureWind_SpeedElevationRainfullDPHumidity
Brazil−0.81−0.840.570.48 2.0 × e 16 0.52−0.84−0.85
India0.710.760.000.00 2.0 × e 16 −0.840.000.00
Peru−0.69−0.720.000.39 9.3 × e 17 −0.13−0.38−0.21
Mexico0.750.73−0.55−0.31 6.7 × e 16 −0.08−0.84−0.81
South Africa−0.41−0.370.137 0.1 × e 16 −0.17−0.53−0.80
United States0.110.11−0.020.19 4.9 × e 16 0.02−0.32−0.46
* Due to the lack of India’s dew point temperature and relative humidity data, the correlation is set to 0 to avoid affecting the experimental results.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Lin, S.; Fu, Y.; Jia, X.; Ding, S.; Wu, Y.; Huang, Z. Discovering Correlations between the COVID-19 Epidemic Spread and Climate. Int. J. Environ. Res. Public Health 2020, 17, 7958. https://doi.org/10.3390/ijerph17217958

AMA Style

Lin S, Fu Y, Jia X, Ding S, Wu Y, Huang Z. Discovering Correlations between the COVID-19 Epidemic Spread and Climate. International Journal of Environmental Research and Public Health. 2020; 17(21):7958. https://doi.org/10.3390/ijerph17217958

Chicago/Turabian Style

Lin, Shaofu, Yu Fu, Xiaofeng Jia, Shimin Ding, Yongxing Wu, and Zhou Huang. 2020. "Discovering Correlations between the COVID-19 Epidemic Spread and Climate" International Journal of Environmental Research and Public Health 17, no. 21: 7958. https://doi.org/10.3390/ijerph17217958

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop