Keywords
SARS nCoV-2; COVID-19; ARIMA Forecast; COVID-19 in the Caribbean; Confirmed Cases; SARS nCoV-2 Death; Recovered COVID-19 cases
This article is included in the Emerging Diseases and Outbreaks gateway.
This article is included in the Coronavirus collection.
SARS nCoV-2; COVID-19; ARIMA Forecast; COVID-19 in the Caribbean; Confirmed Cases; SARS nCoV-2 Death; Recovered COVID-19 cases
According to the World Health Organization (WHO) and the Caribbean Public Health Agency (CARPHA), the ongoing pandemic caused by the SARS nCoV-2 virus is a grave concern worldwide, including in the 13 Caribbean countries, namely Antigua and Barbuda, Bahamas, Barbados, Cuba, Dominica, Dominican Republic, Grenada, Haiti, Jamaica, Saint Kitts and Nevis, Saint Lucia, Saint Vincent and the Grenadines, and Trinidad and Tobago. The risk of transmission of the COVID-19 virus in the Caribbean is high due to the high inflow of tourists and the return of nationals from all over the world. The islands also have a large number of inter-island migrants who returned to their mother states. Interestingly, the islands also have in-built advantages in controlling the pandemic because individual islands can be isolated and inter-island spread can be controlled. However, the region has particular disadvantages because its small population size causes its data to be overseen in global discussions. The Caribbean is home to only 0.56% of the world population. The number of confirmed COVID-19 cases is comparatively less compared to the U.S which has more than nine million confirmed cases till Oct 15, 2020. There is an inevitable risk of a second wave, as new cases can emerge in the region due to the opening of borders. In addition, there is the problem of illegal migrants in many of these countries who are unwilling to approach the health system once they realize that they need help. Porous borders are an unacknowledged source of worry in this crisis.
WHO, the Pan American Health Organization (PAHO), CARPHA, local health administrations, and other international organizations have developed risk assessment levels that establish COVID-19 information centers in different regions1,2. All CARPHA member states have been advised to increase surveillance mechanisms, including random tests, lockdown measures, increase public awareness, and implement national preparedness plans. PAHO, based in Trinidad and Tobago, Barbados, and Jamaica, is proactively collaborating with global organizations including the WHO and the various Ministries of Health in each of the Caribbean countries to convey information concerning the latest developments, public health policies, lockdown measures, therapeutics, and diagnostic modalities including an overview of rapid testing among vulnerable populations3. PAHO is also collaborating with various Caribbean countries regarding the state of public awareness, the use of masks, maintenance of social distancing, and relaxation of lockdown norms in a phased manner4–6. Regional and international health authorities are assisting with infection control strategies, essential drugs, and the supply and training of essential health care workers. The Caribbean Community (CARICOM) along with its member states has been working with the chief medical officers and local health administrators of various Caribbean countries to establish COVID-19 testing facilities, quarantine zones, and the National Emergency Operations Center for COVID-19 briefings7. In addition to the health sector, the growing COVID-19 pandemic has also affected tourism and businesses in the Caribbean.
To assess the spread of the SARS nCoV-2 virus in the region, we intend to project a short-term forecast in terms of the number of confirmed cases, deaths, and recovered cases in the Caribbean region. Data analysis involves the auto-regressive integrated moving average (ARIMA) model which forecasts future values in a time series based upon its past values, lags, and forecast errors. The predicted forecast will help CARPHA, PAHO, and other regional health organizations in the Caribbean organize resources and better prepare to assess risk in terms of community transmission from imported cases, coordinate regional preparedness with Incident Management Teams (IMT) and prepare travel guidelines for local populations.
The data for the Stata analysis is freely available for academic and research purposes through John Hopkins University GitHub Repository. It is one of the largest online communities for health care professionals, health institutions, and medical researchers. The data is maintained by John Hopkins University in collaboration with the Center for Systems Science and Engineering (CSSE) and technical support from ESRI and John Hopkins University Physics Laboratory8,9. The data include information concerning confirmed, death, and recovered cases of COVID-19 from 13 Caribbean countries, namely Antigua and Barbuda, Bahamas, Barbados, Cuba, Dominica, Dominican Republic, Grenada, Haiti, Jamaica, Saint Kitts and Nevis, Saint Lucia, Saint Vincent and the Grenadines, and Trinidad and Tobago (in alphabetical order). The time frame of the data starts from January 22, 2020, since situation reports from WHO about SARS nCoV-2 were published starting from January 21, 2020. Our study has been forecast until December 31, 2021.
The data consist of time series patterns with information concerning province/state, country, longitude, latitude, and confirmed COVID-19 cases for each respective country. Similar time series data were provided for COVID-19 deaths and recovered cases, respectively, from January 22, 2020, to August 16, 2021. The data was compiled in a Microsoft Excel spreadsheet, merged using a ‘sum’ argument entered as a formula for each specific date from January 22, 2020, and forecast until December 31, 2021. It was converted to .dta format for analysis using Stata 14 SE software ARIMA time series analysis. The analysis can be performed using PSPP software, written in C. It has a conventional command line with a graphical interface. The software uses the GNU Scientific Library and it is freely available for use. To declare the data as time series, the date variable was formatted using the Stata command ‘format’ and ‘%td’ in DDMMYYYY and quarterly format. It also involves the Stata command 'tsset', which declares the data as time series for a while using lead and lag operators. Also, the Stata command ‘tsfill’ was used to fill out the gaps in the time series model if there are any. The stationarity of the time series model was checked using the Stata ‘d’ differencing operator10. The differencing operator generates lags between current and previous values. Second-order differencing was performed using the ‘d2’ operator based on the lag values of the first-order differencing. Graphical representation of first-order and second-order differencing was made using two-way line plots using the Stata command ‘two-way’ and ‘tsline’ as shown in Figure 1. It is also very useful to check the stationarity of the time series using graphical plotting as second-order differencing plots diminish around zero. These plots are essential to check the time series stationarity as they fit numerically on the x and y-axis respectively (Figure 1, Figure 2).
Based on the graphical representation, the ARIMA model can be formed using ACF and PACF graphs using the Stata command ‘ac’ and ‘pac’ using first-order differencing and second-order differencing variables, respectively, as shown in Figure 2. ACF plots involve autocorrelation of prior values in a time series with various lags11,12. MA can be derived from ACF plots graphically which are involved in building ARIMA models for further analysis. In an ACF graph, the lines situated in the shaded region are within acceptable regions (95% CI) while the lines outside the shaded region are lags that are autocorrelated in a series and the values are taken up as MA for ARIMA modeling. ACF plots are based upon Barlett’s formula for MA(q) processes involving pointwise confidence intervals. Similarly, PACF graphs are based upon the selection of the partial autocorrelation in the selected time series. It involves a confidence interval calculated using standard error 1/ (the root of n). The graph may also include residual variance for each lag. The values of AR are derived from PACF graphs which can be used for ARIMA model forecasting. The value of I depends on the differencing of the first order (I=1) or second-order (1 = 2). The models are chosen based upon smaller AIC/BIC values, which indicates a better fitting model. Other features like log-likelihood ratio, p-value significance, and coefficient < 1 and 5% significance can also be checked to compare various models13–16. The survey was analyzed using regular regression and post-stratification methods. It involved predictive analysis and health informatics with the help of Farr Institute wherein complex and adaptive modeling techniques were used for large-scale data mining, statistics, and analysis.
Based on our data, out of the 13 Caribbean countries, the highest number of confirmed cases of COVID-19 is seen in the Dominican Republic, followed by Cuba, Jamaica, Haiti, Trinidad, and Tobago, Bahamas, Barbados, Antigua, and Barbuda, Grenada, St. Lucia, St. Vincent, and the Grenadines, Dominica, St. Kitts and Nevis, in this order17. Similarly, Caribbean countries with the highest number of death cases reported are the Dominican Republic, followed by Cuba, Haiti, Bahamas, Jamaica, Trinidad and Tobago, Barbados, Antigua and Barbuda, in this order. The highest number of recovered patients were from the Dominican Republic, followed by Cuba, Trinidad and Tobago, Jamaica, Barbados, Bahamas, Haiti, St. Lucia, Antigua and Barbuda, Dominica, Grenada, St. Vincent and the Grenadines, in this order. Our study demonstrates the future trend of the COVID-19 crisis in the Caribbean and its impact on health services and local communities.
Our data analysis with autoregressive integrated moving averages (ARIMA) modeling for 13 Caribbean countries predicts during the period Nov 16, 2021 - Nov 30, 2021, 2102437 (95% CI 2073255 - 2131619) confirmed cases, 23284 (95% CI 22971 - 23597) deaths and 817887 (95% CI 817871 - 817903) recovered COVID-19 cases; 2281189 (95% CI 2251323 - 2311055) confirmed cases, 25199 (95% CI 24879 - 25518) deaths and 817989 (95% CI 817971 - 818006) recovered COVID-19 cases during the time frame - December 1, 2021, to December 15, 2021. Similarly, during the period December 16, 2021 - December 31, 2021, 2470272 (95% CI 2438965 - 2501579) confirmed cases, 27220 (95% CI 26886 - 27555) deaths, and 818105 (95% CI 818085 - 818125) recovered COVID-19 cases are projected, respectively. The final ARIMA model chosen for confirmed COVID-19 cases, several deaths, and recovered cases are ARIMA (9,2,4), ARIMA (1,2,1), and ARIMA (1,2,1) respectively. These models were compared with other potential models such as ARIMA (4,2,1), ARIMA (7,2,2), ARIMA (2,1,1), ARIMA (2,1,5), ARIMA (3,1,1), in terms of AIC/BIC (Akaike Information Criterion/Bayesian Information Criterion), the significance of the p-value, coefficient < 1 and 5% significance and logarithmic likelihood ratio. The AIC/BIC values for confirmed cases are reported as 8316.06 and 8385.62, respectively. The best-fitting models in our study have the lowest A1C/BIC values in comparison to other potential models. The p-value is considered significant if it's less than 0.05. The AIC/BIC values for the number of deaths and recovered COVID-19 cases have AIC/BIC values - 3917.15/3934.54 and 13558.08/13571.12, respectively. The current and projected trends are shown in Figure 3 for COVID-19 number of confirmed cases, deaths, and recovered cases, respectively.
The projected ARIMA forecast shows a linear increase for COVID-19 confirmed cases along with an increase in recovered patients in the 13 Caribbean countries. The time series was checked for stationarity through first-order and second-order differencing along with the Dickey-Fuller test. The results were graphically plotted using PACF and ACF to derive AR and MA for forecasting model building (Table 1). Data analysis for the time series model was checked for stationarity as shown in Figure 1 using Stata differencing operators, and graphs were plotted along the axis for the time frame from January 22, 2020, to December 31, 2021. Figure 2 shows the ACF and PACF models for the number of confirmed cases, deaths, and recovered cases of COVID-19 plotted to find the best fitting model based on Bartlett's formula for the MA (q) 95% CI bands. Figure 3 shows current and projected forecast trends for final COVID-19 selected models - confirmed cases ARIMA (9,2,4), number of deaths ARIMA (1,2,1), and recovered cases ARIMA (1,2,1), respectively. Table 1 offers a comparison of various models along with the significance of the p-value and the AIC / BIC values. Table 2 shows the current and projected forecasts for the Caribbean region in terms of the number of confirmed cases, deaths, and recovered cases of COVID-19 (Table 2).
CONFIRMED CASES | FINAL MODEL | |||||
---|---|---|---|---|---|---|
Cons | (2,2,1) | (1,1,8) | (1,2,1) | (4,1,1) | (4,2,1) | (9,2,4) |
L1.ar | 0.07 | 0.99*** | 0.12* | 1.08*** | -0.06 | 0.95*** |
L2.ar | -0.05 | -0.13 | -0.13 | -0.98*** | ||
L3.ar | 0.07 | -0.03 | 0.81*** | |||
L4.ar | -0.02 | -0.11** | -0.80*** | |||
L5.ar | 0.47*** | |||||
L6.ar | 0.15* | |||||
L7.ar | 0.27*** | |||||
L8.ar | -0.06 | |||||
L9.ar | 0.14*** | |||||
L1.ma | -0.56*** | -0.47*** | -0.61*** | -0.55** | -0.43* | -1.49*** |
L2.ma | -0.08* | 1.45*** | ||||
L3.ma | 0.06 | -1.23*** | ||||
L4.ma | -0.07 | 1.10*** | ||||
L5.ma | 0.04 | -0.74*** | ||||
L6.ma | 0.07 | |||||
L7.ma | 0.12** | |||||
L8.ma | -0.13** | |||||
AIC | 8370.52 | 8381.55 | 8369.41 | 8400.96 | 8369.52 | 8316.06 |
BIC | 8392.26 | 8429.40 | 8386.80 | 8431.41 | 8399.95 | 8385.62 |
MORTALITY | FINAL MODEL | |||||
Cons | (1,1,1) | (2,1,1) | (4,2,2) | (1,2,2) | (7,2,2) | (1,2,1) |
L1.ar | 0.99*** | 0.92*** | 0.05 | -0.44 | 1.01*** | -0.07 |
L2.ar | 0.07 | -0.11 | 0.04 | |||
L3.ar | -0.16*** | -0.11* | ||||
L4.ar | -0.10 | -0.01 | ||||
L5.ar | 0.08 | |||||
L6.ar | -0.01 | |||||
L7.ar | -0.07 | |||||
L1.ma | -0.73*** | -0.7*** | -0.85* | -0.34 | -1.82*** | -0.71*** |
L2.ma | 0.21 | -0.28 | 0.85*** | |||
AIC | 3934.36 | 3934.26 | 3916.90 | 3918.85 | 3916.88 | 3917.15 |
BIC | 3951.76 | 3956 | 3951.68 | 3940.59 | 3964.70 | 3934.54 |
RECOVERY | FINAL MODEL | |||||
Cons | (1,1,2) | (3,1,1) | (2,1,5) | (8,1,1) | (2,2,1) | (1,2,1) |
L1.ar | 0.96*** | 0.98 | 1.84 | 0.97 | -0.006 | 0.83* |
L2.ar | 0.002 | -0.86 | 0.001 | -0.004 | -0.006 | |
L3.ar | -0.02 | -0.001 | ||||
L4.ar | -0.002 | |||||
L5.ar | -0.003 | |||||
L6.ar | -0.002 | |||||
L7.ar | -0.0003 | |||||
L8.ar | -0.02 | |||||
L1.ma | -0.98 | -0.99*** | -1.88 | -1.06*** | -0.99*** | -0.99*** |
L2.ma | -0.02 | 0.88 | ||||
L3.ma | -0.003 | |||||
L4.ma | 0.008 | |||||
L5.ma | -0.012 | |||||
AIC | 13570.74 | 13572.5 | 13576.69 | 13583.44 | 13560.07 | 13558.08 |
BIC | 13588.13 | 13594.25 | 13611.48 | 13631.28 | 13577.46 | 13571.12 |
The study was limited to data regarding COVID-19 confirmed, death & recovered cases from 13 Caribbean Countries. We were unable to access information on vulnerable populations or demographic information, and therefore the study is limited to projecting only a general trend about Covid-19 for the Caribbean.
Our study forecasts the likelihood of the pandemic over the next two months. It shows an increasing trend for confirmed and recovered COVID-19 cases and a slowing of the number of deaths. This would reflect the gradual easing of the lockdown restrictions. This is consistent with a study conducted by Sumner et al., 2020, who argue that the economic impact of COVID-19 will be drastic, specifically in the Latin America and the Caribbean (LAC) region, with potential short-term health and financial consequences18. Another study by Simbana-Rivera et al., 2020, performs a comparative analysis of the burden of COVID-19 among Latin American and Caribbean countries following government norms such as social distancing and wearing masks to slow the spread of the virus among local communities19. Our study focuses exclusively on the Caribbean and should help Caribbean countries not only prepare themselves for the upcoming impact of the Covid-19 spread but also reduce its economic impact on the very fragile economies of the smaller islands.
Our approach has been validated by several other studies that have used ARIMA modeling for short-term predictions. A study by Benvenuto et al., 2020, focuses on the application of ARIMA modeling on the COVID-19 epidemic. It adds that various institutions will benefit from this forecast as ARIMA modeling involves incidence descriptive analysis of data along with incidence forecast and overall trend analysis20. Another article by Dehesh et al., 2020, shows the importance of prediction models and analyzing the trend of Covid-19 as a global pandemic for various countries16. Chintalapudi et al., 2020 recommend a data-driven model approach and highlight the significance of government policies based upon forecasting and modeling, which can help to reduce COVID-19 cases in Italy. The study commends preventive measures like regular hand washing, disinfection, wearing masks, travel restrictions, and suspension of public gatherings based upon modeling and forecasts21. Although most Caribbean countries are dependent on tourism as a major economic activity, we recommend a guarded approach to open borders the already weak economies may not be able to sustain a fresh wave of the pandemic.
A study by Jenkins et al. discusses Caribbean trade relations with China in terms of foreign direct investment and its impact on the economy22. The consequences of the COVID-19 outbreak have significantly impacted Caribbean countries, even as China continues to be a prominent supporting partner. Another study by Rodrguez-Morales, MacGregor and, Kanagarajah, 2020 on COVID-19 discusses the importance of carefully assessing the situation based on scientific knowledge sharing and coordinated efforts to revitalize and uplift Caribbean communities amid the COVID-19 crisis23. We agree with them and recommend that as countries gear up to open the borders, they should introduce enhanced screening/ monitoring at entry and exit ports.
The outbreak of COVID-19 is a grave concern in the Caribbean region specifically due to the high risk of transmission among the local community. CARPHA has issued certain guidelines for its member states, including the 13 Caribbean countries based on WHO recommendations as mentioned in our study. Our cumulative projected ARIMA model for COVID-19 positive patients, deaths, and recovered cases in the 13 Caribbean countries offer insight for the government and health organizations operating in the region to assess pandemic preparedness plans and revise risk levels based on current and forecasted trends.
Github: COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University, https://github.com/CSSEGISandData/COVID-19. The data was accessed from Jan 22, 2020 until Aug 16, 20212.
The data is maintained by John Hopkins University in collaboration with the Center for Systems Science and Engineering (CSSE) and technical support from ESRI and John Hopkins University Physics Laboratory.
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Debjyoti Talukdar: Investigation, Writing- Original draft preparation, Conceptualization, Methodology, Software, Visualization, Data curation
Dr. Vrijesh Tripathi: Supervision, Software, Validation, Writing- Reviewing and Editing, Visualization
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Partly
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Partly
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Applied Statistics and Statistics of Extremes
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |
---|---|
1 | |
Version 1 21 Oct 21 |
read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)