Forecasting the COVID-19 transmission in Italy based on the minimum spanning tree of dynamic region network

View article
Bioinformatics and Genomics

Introduction

The world is currently witnessing a major and devastating pandemic with substantial mortality and morbidity–Coronavirus Disease 2019 (COVID-19) (Mohanty et al., 2020). It was declared by the World Health Organization (WHO) as a public health emergency of international concern in January 2020 (Team, 2020; Lai et al., 2020). As of November 28, 2020, about 60 million cases and 1.45 million deaths were confirmed globally. Italy is one of the most affected countries. In about two months, i.e., from mid-February 2020 to mid-April 2020, it has been one of the main epicenters of the COVID-19 pandemic when the epidemic reached its first peak. Then, the epidemic curve gradually decreased until early October 2020, after which the spread of infection accelerated again until today (Perone, 2020). As shown in Fig. 1A, a new and more severe epidemic is sweeping Italy. As of November 28, Italy had suffered 54,363 deaths and 1,564,532 cases (Dong, Du & Gardner, 2020). Therefore, there is an urgent need for an effective and low-cost model to build an epidemic surveillance system to help countries severely affected by the epidemic like Italy to warn of a new wave of COVID-19 outbreaks in the future.

Schematic diagram of COVID-19 pandemic in Italy and prediction principle.

Figure 1: Schematic diagram of COVID-19 pandemic in Italy and prediction principle.

(A) From mid-February 2020 to mid-April 2020, the epidemic in Italy reached its first peak. Since then, the epidemic curve gradually declined until early October, and then the spread of COVID-19 infection accelerated again until today. A new wave of COVID-19 outbreaks is affecting Italy. (B) According to the MST-DNM model, the process of COVID-19 outbreaks is divided into three stages, including the normal stage, the pre-outbreak stage and the outbreak stage. The sudden increase in the MST-DNM indicator indicates a transition from the normal stage to the pre-outbreak stage, i.e., the critical point before the upcoming outbreak of COVID-19 that results in an increase in daily cases.

The COVID-19 pandemic has sparked an intense debate about the factors underlying the dynamics of the outbreak (Pacheco Coelho et al., 2020; Pequeno et al., 2020). Meanwhile, the study of mathematical models of epidemiology is helpful to understand the dynamics of epidemics, being an important tool to evaluate the potential effects of preventive and controlled measures, especially when their characteristics are still unclear (Chatterjee et al., 2020; Khan et al., 2019; Cheng et al., 2020). Under such circumstances, by exploring the dynamical information from region networks and time-series data, we employed a combined model, the minimum-spanning-tree-based dynamical network marker (MST-DNM) (Yang et al., 2020), to quantitatively describe the dynamics of COVID-19 transmission and thus identify the early-warning signal of a new wave of COVID-19 pandemic in Italy. This is the first time that this model has been applied to the study of the emerging infectious disease like COVID-19. This model is improved from our recently proposed concept, the so-called dynamical network biomarker (DNB), which determines the critical state of complex diseases by analyzing the dynamics of driven biomolecules (i.e., a group of genes and proteins that are the leading factors to the critical state transition) (Chen et al., 2012). The DNB based methods have been applied to a number of biological progresses and obtained remarkable results, including identifying the critical points of cell fate decision (Mojtahedi et al., 2016) and cellular differentiation (Richard et al., 2016), detecting the critical period during various biological processes (Liu et al., 2014; Liu et al., 2019; Chen et al., 2015; Chen et al., 2017), and predicting the warning signals of influenza outbreaks (Yang et al., 2020; Chen et al., 2019; Chen et al., 2020).

By extracting the minimum spanning tree from the dynamic region network, we could typically describe the dynamics of the spread of the infectious diseases among regions. Specifically, we collected the historical information of daily cases by COVID-19 infection in Italy from February 24, 2020 to November 28, 2020. When applied to the region networks constructed based on geographical location and traffic conditions, the MST-DNM model has the ability to monitor the whole process of epidemic spread in Italy, and successfully identify the early-warning signals about two weeks in advance. It is worth noting that in the previous research, the key role of the minimum spanning tree in this model has been described in detail (Yang et al., 2020), which can avoid issuing wrong warning signals due to the appearance of local abnormal correlation. Therefore, we pay more attention to the interpretability and practical significance of the MST-DNM model in our study, as detailed in the Materials & Methods section and in the Results section. Consequently, our model is quite suitable for predicting the potential outbreak of COVID-19 in Italy with the characteristics of nonlinearity time series and small sample size based on Italian region network, which may help to develop new control strategies for COVID-19 before its new wave of outbreaks.

Materials & Methods

Theoretical background

The spread of infectious diseases in a region is described as the dynamic evolution of a nonlinear system, while the outbreak of COVID-19 is regarded as a qualitative state transition of a dynamic system (Yang et al., 2020). From the perspective of dynamic modelling, the stage before the COVID-19 outbreak is regarded as a pre-outbreak stage, immediately after which the system undergoes a critical transition. Then the dynamical process of an epidemic system can be roughly modelled as three stages similar to the dynamics of disease progression (Chen et al., 2012): (i) the normal stage, which is a stable stage with high resilience; (ii) the pre-outbreak stage, which is dynamically unstable. At this stage, the epidemic is still controllable through appropriate measures; and (iii) the outbreak stage, which is an uncontrolled stage with high elastic dynamic features. As shown in Fig. 1B, when the system transits from the normal stage to the pre-outbreak stage, the region network changes significantly and the indicator of our model rises sharply. Different from the traditional detection of the outbreak stage, our model could determine the pre-outbreak stage which generally has no obvious abnormalities but with high potential of state transition into a severe and irreversible stage.

Data collection

Data about the COVID-19 pandemic can be obtained from the GitHub repository managed by Johns Hopkins University for modeling (Dong, Du & Gardner, 2020), which contains publicly available data from multiple sources. The open dataset of Italy we use is available on the website COVID-19 in Italy at: https://github.com/pcm-dpc/COVID-19. This database has been created and managed by the Italian Civil Protection Department, which is updated and integrated daily (Italian Civil Protection D et al., 2020).

Algorithm applied in Italy

The MST-DNM model is illustrated in Fig. 2 and the detailed process of our model applied in Italy is described in the following four steps.

i. Modeling and mapping

It is noted that our model is applied to the region network to monitor the COVID-19 transmission and outbreak in Italy. Therefore, it is necessary to construct the regional network based on the Italian regions’ geographic distribution and their adjacent information at first. The adjacent information is shown in Table S1 of Supplementary Information. In the network, each node represents a region or an autonomous province, while each edge represents the adjacent relation between two regions. Then a 21 ∗ 279 data matrix formed by the records of COVID-19 daily confirmed cases is mapped to the network. The region network model of Italy is presented as in Fig. 3.

ii. Weighting and extracting

The region network in the first step can be represented as an undirected graph G = V , E , which contains a collection V = v i i = 1 M composed of M vertexes and a collection E = e j j = 1 N composed of N edges in this network. In the Italian region network (Fig. 3), M is 21 and N is 34. It should be noted that in order to make the application of our model easier to understand, Node 15 is not considered in the subsequent calculations. When the model is applied to other regions, the construction of regional network should be reconsidered, instead of simply excluding islands. In addition, as shown in Fig. S1, as the COVID-19 epicenter is located in northern Italy, whether Sardinia is included in the model has little influence on the final warning results.

The flow chart of algorithm application in Italy.

Figure 2: The flow chart of algorithm application in Italy.

The flow chart above shows how the algorithm works in Italy based on region networks and minimum spanning tree. The model is driven by time series and dynamic region networks over time. Regarding a point T = t as a candidate tipping point, MST-DNM indications can be calculated based on weighted region networks. If the indication increases significantly, the candidate T = t is determined as the identified tipping point, and the algorithm ends. Otherwise, if there is no significant change, then T = t is classified as a time point belonging to the normal stage, and the algorithm continues with T = t+1 being a new candidate tipping point.
The region network model of Italy.

Figure 3: The region network model of Italy.

(A) A 21-node network model is constructed based on the geographic information and adjacent relationships of 21 regions/autonomous provinces in Italy. (B) A detailed list of correspondences between regions/autonomous provinces and nodes.

The number of daily confirmed cases of a region is considered as a sample s to form a set of time series data. Therefore, for each vertex vi of the region network on the day t, there is a corresponding time series of confirmed cases S t v i = s 1 , s 2 , , s t . In order to assign a weight W t k to each edge ek of the region network on the day t, it is necessary to calculate the correlations between the two vertexes vivj of the edge ek as follows:

W t k = Δ S D t i , j Δ P C C t i , j ,

where

Δ S D t i , j = S D t S t v i + S D t S t v j ¯ S D t 1 S t 1 v i + S D t 1 S t 1 v j ¯

is the differential standard deviation of the nodes vivj on day t and t − 1, and S D t S t v i and S D t S t v j represent the standard deviation of the time series data of the two vertices vivj. Δ P C C t i , j = | P C C t S t v i , S t v j | | P C C t 1 S t 1 v i , S t 1 v j |

is the differential Pearson’s correlation coefficient between the two vertices vivj of the edge ek, where P C C t S t v i , S t v j and P C C t 1 S t 1 v i , S t 1 v j represent the Pearson’s correlation coefficient between the two vertices vivj on day t and t − 1 respectively.

Based on the above work, an undirected and edge-weighted network that changes dynamically over time is obtained. The next step is to extract the minimum spanning tree from the dynamic region network at each moment. In detail, when Italy is on the t day of the COVID-19 pandemic, we could extract the minimum spanning tree M S T t = V , E to better describe the evolution of Italian regional network with the change of daily cases. In this work, a classical minimum spanning tree algorithm, the Prim’s algorithm, is applied to the differential weighted network Gt at a specific time t to obtain its minimum spanning tree MSTt.

iii. Calculating early warning indicators It

Then, the MST-DNM indicator It can be obtained by calculating the weight sum of the minimum spanning tree. I t = i = 1 K W t i , where W t i represents the weight of the edge ei of the minimum spanning tree MSTt at time t and K is the total number of edges of the minimum spanning tree MSTt. The specific algorithm flow is shown in Algorithm 1.

 
_______________________ 
Algorithm 1 The indicator It using Prim’s algorithm________________________________ 
Require: The nodes of the weighted undirected graph V ; the function w(u,v) 
     which means the weight of the edge (u,v); the function adj(v) which means 
     the nodes adjacent to v. 
Ensure: The sum of weights of the minimum spanning tree in the input graph, 
     It. 
  1:  It ← 0 
  2:  choose an arbitrary node in V to be the root 
  3:  dis(root) ← 0 
  4:  for each node v ∈ (V −{root}) do 
 5:       dis(v) ←∞ 
 6:  end for 
 7:  rest ← V 
 8:  while rest ⁄= Φ do 
 9:       cur ← the node with the minimum dis in rest 
10:       It ← It + dis(cur) 
11:       rest ← rest − cur 
12:       for each node v ∈ adj(cur) do 
13:            dis(v) ← min(dis(v),g(cur,v)) 
14:       end for 
15:  end while 
16:  return It________________________________________________________________________________________    

According to DNB theory, during the critical stage, there are two cases for the minimum spanning tree MSTt at time point t:

  • In the MSTt, all of the nodes are DNB members;

  • In the MSTt, DNB and non-DNB members both exist.

For the above two cases respectively, the statistical indicator It has significant changes as presented in Table 1. Obviously, the MSTt based on the indicator It and the edges’ weight Wt has the ability to monitor the dynamical process of COVID-19 spread between regions and issue a warning signal timely.

iv. Identifying early-warning points

In previous studies, machine learning methods, i.e., logistic regression (Yang et al., 2020), have been applied to identify the appearance of critical points based on years of high-dimensional data. However, for COVID-19 which originated at the beginning of 2020, the time series data obtained is of quite small scale, which is difficult for machine learning algorithm to learn the appropriate parameters and features. Therefore, the fold-change threshold, an index of volatility, is used to detect the early-warning signal. Specifically, a 2-fold change threshold is applied to identify the significant changes of the indicator It in our study.

Table 1:
Critical behaviours of the MST-DNM indicator It for different cases.
Case Nodes SDt Δ SD t i . j PCC t i , j Δ PCC t i , j W t k It
1 All DNM
2 DNM and non-DNM D ↗ D ↗
N → N ↘
DOI: 10.7717/peerj.11603/table-1

Notes:

When the system moves from time point t − 1 to t, it is approaching the critical point.

“ ↗” represents the increase of variables; “ ↘” represents the decrease of variables; “ →” represents that there is no significant change in the variables.

“D” represents the DNM members; “N” represents the non-DNM members.

SDt is the standard deviation at time point t; P C C t i , j is the Pearson’s correlation coefficient between two nodes vivj.

The significance of indicator It

The transmission of COVID-19 is a complicated dynamic system, which contains many biomedical and social factors. Due to the massive number of influencing factors, it is difficult to describe the transport dynamics in high-dimensional space mathematically. The sharp or qualitative transition of regional network from the normal state to the outbreak state corresponds to the bifurcation point in dynamic system theory (Gilmore, 1993). According to this theory, if the system approaches the critical point, it will eventually be confined to one-dimensional or two-dimensional space, where the dynamic system can be represented in quite simple forms. This is the theoretical basis for developing a general indicator that can describe the dynamics of COVID-19 transmission.

It’s clear from the above statement that the meaning of the variables in Formula (1) is as follows: (i) Δ S D t i , j can describe the differential fluctuation of cases growth in two adjacent regions compared with the previous time point. (ii) Δ P C C t i , j can describe the difference of the COVID-19 interaction between two adjacent regions compared with the previous time point. Apparently, attention should be paid to the edge with larger weight. Because it means that the regions associated with this edge not only worsen their own epidemic situation, but also have a great impact on the surrounding regions. Therefore, it’s obvious that the indicator It, the weight sum W t k of all edges in MSTt, has the ability to observe the change of a group of weighted differential networks.

Results

Early warning of COVID-19 outbreaks in Italy

We collect the historical data of daily cases infected by COVID-19 from February 24, 2020 to November 28, 2020 in Italy. The outbreak points of COVID-19 are defined as the peak of the daily cases.

Provided as in Fig. 4, the early-warning signals are identified through the MST-DNM model for each outbreak of COVID-19. For the first wave of COVID-19 outbreak from mid-February to mid-April, the early-warning signal was issued on March 6, which is about 15 days ahead of the outbreak point. This means that our model has successfully played an early warning role.

Forecast of the COVID-19 outbreaks in Italy.

Figure 4: Forecast of the COVID-19 outbreaks in Italy.

In each subgraph, the left y-axis is the number of daily cases in Italy and the right y-axis is the MST-DNM indicator of corresponding date. (A) For the first wave of COVID-19 outbreak in Italy, the early-warning signal was about 15 days ahead of the outbreak point. (B) For the second wave of COVID-19 epidemic, the MST-DNM indicator significantly increases about 10 days before the actual number of confirmed cases skyrockets.

On 9 March 2020, the Italian prime minister Mr. G Conte announced the implementation of placing the country into lockdown to restrict the movement of people, thus reducing the possibility of human to human infection (Chintalapudi, Battineni & Amenta, 2020; Remuzzi & Remuzzi, 2020). Since the last week of March, the statistics have become consciously optimistic, and the number of daily cases has been stabilizing. For the second wave of epidemic since early October, later developing into a larger outbreak, the indicator It was sensitive and significantly increased about 10 days before the actual number of confirmed cases skyrockets. In addition, the indicator showed a continuous downward trend with wave type after November 5, which means that the number of daily cases in Italy has initially peaked. The successful prediction of each wave of COVID-19 outbreaks in Italy demonstrates the robustness and effectiveness of the MST-DNM model in detecting real-time warning signals for emerging infectious diseases.

The dynamics of COVID-19 transmission in Italy

Dynamic monitoring map

To better illustrate the MST-DNM model’s principle, we introduce the dynamic evolution of the COVID-19 transmission network in Italy. As shown in Fig. 5, the daily number of newly confirmed cases with MinMaxScaler in each region is mapped to each node and the correlation between two vertices of an edge is mapped to the thickness of the edge in the tree network. The specific method of MinMaxScaler is to subtract the minimum value of the feature from the processed value and divide it by the feature range, which is the difference between the original maximum and the original minimum. MinMaxScaler can keep the shape of the original data distribution of each region, and make the coloring of each node unaffected by other regions. It is clear that the edges became thicker before the nodes turned darker on October 20, which indicated our model identified the early-warning signal in the pre-outbreak stage when the actual number of confirmed cases did not increase significantly. After that, i.e., on October 26, the edges continued to become thicker, which meant that the epidemic might continue to worsen in Italy.

Dynamic evolution of the minimum spanning tree extracted from Italian region network.
Figure 5: Dynamic evolution of the minimum spanning tree extracted from Italian region network.
The tree networks are respectively selected on September 26th (the normal stage), October 12th (the normal stage), October 20th (the pre-outbreak stage), October 26th (the pre-outbreak stage), November 11th (the outbreak stage) and November 28th (the outbreak stage). In each subgraph, each node is colored by the number of daily cases with MinMaxScaler in each region and the thickness of the edge represents the correlation between two vertices of the edge. It is clear that the edges become thicker before the nodes turn darker, which indicates our model can identify the early-warning signal when the actual number of confirmed cases do not increase significantly.

Warning function of MST-DNM in Italy

As of November 28, the five regions with the highest cumulative number of confirmed cases in Italy are Lombardy, Piedmont, Campania, Veneto and Emilia-Romagna, which correspond to Nodes 10, 13, 5, 21 and 6 in Fig. 6A, respectively. Among them, Lombardy region is considered as the epicenter of COVID-19 outbreak in Italy (Grasselli, Pesenti & Cecconi, 2020; Tuite et al., 2020).

Demonstration of early warning function in Italy.
Figure 6: Demonstration of early warning function in Italy.
(A) The minimum spanning tree of Italian dynamic region network on October 26th, colored by the scaled value of daily cases and the weight of edges, is divided into two local networks according to the thickness of the edges. (B) The stacked bar chart of new cases in Italy shows the severity of the COVID-19 outbreak after the early warning, where the blue columns represent the cumulative number of confirmed cases in each regions/ autonomous provinces in Italy as of October 26, and the orange columns represent the total number of new confirmed cases from October 27 to November 28. In particular, the red regional names highlight the five regions with the most confirmed cases. In combination, it is clear that two high-risk regions are identified by the minimum spanning tree extracted from the dynamic region network.

As shown in Fig. 6A, the dynamic region network is divided into two local networks according to the thickness of the edges, which are centered on Node 10 (Lombardy) and Node 5 (Campania) respectively. It’s obvious that our model has successfully warned two high-risk regions. In fact, a large number of new cases have been confirmed in these regions over the next month as shown in Fig. 6B. In addition, the nodes sandwiched between two local networks, i.e., Node 17 (Tuscany) and Node 19 (Umbria), should also be focused on. It turns out that from October 26 to November 28, the growth rate of new case in Tuscany and Umbria exceeded 200%.

As described in Table 2, the thicker edges in two local networks on October 26, such as e v 10 , v 21 , e v 6 , v 21 , e v 13 , v 20 , e v 5 , v 8 and e v 12 , v 14 , should be focused on. As of November 28, the number of confirmed cases in the regions, like Lombardy, Campania, Lazio and Piedmont, corresponding to the nodes associated with these edges has a high growth rate and a large number of new cases. This also verify the early warning function of MST-DNM, which not only measures its own epidemic situation, but also reflects the regional impact on the surrounding areas.

Table 2:
Description of the early warning function of edge weight.
Local region network # Edge Related regions Newly added cases Growth rate
1 e v 10 , v 21 Lombardy 244,726 154.96%
Veneto 95,506 210.07%
e v 6 , v 21 Emilia-Romagna 71,379 148.93%
Veneto 95,506 210.07%
e v 13 , v 20 Piedmont 107,150 187.46%
Valle d’Aosta 3747 140.39%
2 e v 5 , v 8 Campania 111,077 273.91%
Lazio 80,262 223.35%
e v 12 , v 14 Molise 3193 242.08%
Puglia 37,341 249.39%
DOI: 10.7717/peerj.11603/table-2

Notes:

“Newly added cases” refers to the cumulative number of confirmed cases in the corresponding region from October 26 to November 28.

“Growth rate” refers to the newly confirmed cases in corresponding regions from October 26 to November 28 divided by the cumulative cases on October 26.

Application of MST-DNM in northern Italy

As shown in Fig. 6B, several areas in northern Italy, such as Lombardy, Veneto, etc., are the most severely affected by COVID-19. Our model can be applied not only to the entire country of Italy, but also to an area. In order to verify the effectiveness of our model, it has also been applied to identify the early-warning signals of COVID-19 outbreaks in northern Italy. The results are presented in Figs. S2S3 of Supplementary Information.

Performance comparison

The machine learning algorithms are also used to forecast the COVID-19 epidemic situation (Parhusip, 2020; Singh et al., 2020; Parbat & Chakraborty, 2020). Regarding the identification of early warning signals of COVID-19 outbreaks as a binary classification problem, we compare the performance of our combined model with the support vector machine (SVM). The AUC of MST-DNM is 0.9318, while that of SVM is 0.9076. It’s clear that the performance of a system based on MST-DNM is better than that based on SVM when only the data of daily confirmed cases is provided. In addition, the SVM model issue an early warning signal on September 28, 2020, which is too early to be of practical significance for the second wave outbreak; and our adaptive model can actually issue an early warning signal about 10 days before the actual number of confirmed cases skyrockets. Actually, compared with traditional machine learning algorithms, the MST-DNM model has the following internal strengths. First of all, it is a model-free approach without any training and testing processes. There is no feature selection in MST-DNM strategy, which solely depends on the statistical conditions of our model. Second, it’s noted that there is no limitation of the data sample size for our approach, which means that our model could achieve a good performance with only small sample data. Therefore, it can be applied to describe and monitor the emerging infectious diseases like COVID-19. In addition, our combined model is capable to describe the dynamic process of the spread of COVID-19 through the minimum spanning tree of dynamic region networks.

Discussion

A new wave of COVID-19 epidemic is sweeping the world. On November 26, more than 1.5 million people were diagnosed with COVID-19 infection in Italy, and the death toll rose rapidly in the second wave of COVID-19 epidemic, bringing a heavy burden to hospitals. In order to prevent a new wave of COVID-19 pandemic or at least reduce the magnitude of COVID-19 outbreaks, it is essential to build a surveillance system that relies entirely on reliable and available information, such as the number of daily cases.

Specifically, unlike the critical transformation analysis based on DNB of complex diseases with genomic datasets, the DNB method has been improved and applied to the macro regional networks. The successful application in Italy shows that the MST-DNM is a model-free method with data-driven characteristics and has great potential in actual real-time monitoring for emerging infectious diseases. Moreover, this is the first time that the improved method based on DNB has been applied to predict the outbreak of COVID-19. Unlike previous studies that used the DNB based methods to predict influenza outbreaks (Yang et al., 2020; Chen et al., 2019; Chen et al., 2020), our study is based on small time series samples rather than years of time series data. Therefore, it could be employed to describe and monitor emerging infectious diseases like COVID-19. In addition, this paper introduces the practical significance and early warning function of the MST-DNM model in detail. It is believed that this is an important step from theory to practice. It should be noted that the MST-DNM model in our work is completely based on the records of confirmed cases per day, and has achieved satisfactory performance. Given more information on the spread of the COVID-19 epidemic, the monitoring model is expected to reliably forecast the transmission and outbreak of COVID-19 in terms of sensitivity and accuracy.

Although the proposed model have achieved good results, there are some limitations of the project:

  • The MST-DNM model in our work is completely based on the records of daily cases. If we could get data on the number of people tested in a region, we could measure the epidemic situation in this region more accurately. This may be the direction of model improvement in the future.

  • As for the recognition of early warning signals, we can take into consideration any alternative to that criterion and the effect that a different choice could have on the prediction results in the future work.

  • Experiments in this paper were performed on COVID-19 outbreaks in Italy. The future work could involve to exam the proposed model on other regions or countries.

Conclusions

In this study, we developed a combined model with dynamic network marker and minimum spanning tree solely based on the daily cases to describe and forecast the COVID-19 outbreaks in Italy. In order to put theory into practice, we also explain the significance and warning function of the model indicators in detail. By extracting the minimum spanning tree from the dynamic region network, the model can effectively identify the early-warning signals with an average of 2-week window lead prior to the catastrophic transition into COVID-19 outbreaks in Italy. Through the study of the network dynamics in Italy, this paper reveals the spread of COVID-19 on the network level. It is noteworthy that the driving force of MST-DNM only relies on small samples, rather than multi-year data. Therefore, it has great potential to monitor emerging infectious diseases timely.

Supplemental Information

Raw data and code

The daily cases data of COVID-19 in Italy from February 24, 2020 to November 28, 2020 is in ‘Italy_covid19_data.csv’. The rows represent the historical time and the columns represent different regions. The data in the table represents the number of daily cases.

The code is located in ‘myTools.py’, ‘calculate_Italy.py’ and ‘calculate_Northern_Italy.py’. ‘myTools.py’ contains the necessary calculation functions. With the environment for python and the corresponding package, this code could be run to get the results of calculations. ‘Italy_mst.json’ contains the weighted minimum spanning tree for each day, ‘Italy_daily_cases.xls’ contains the sum of daily confirmed cases in all regions of Italy for each day and ‘Italy_weight_sum.xls’ contains the daily MST-DNM indicator. For the calculation of northern Italy, the meaning is similar. The ‘data’ and code must be in the same directory.

DOI: 10.7717/peerj.11603/supp-1

Supplemental Figure and Tables

DOI: 10.7717/peerj.11603/supp-2
7 Citations   Views   Downloads