Machine learning predictions of COVID-19 second wave end-times in Indian states

Kondapalli, Anvesh Reddy; Koganti, Hanesh; Challagundla, Sai Krishna; Guntaka, Chaitanya Suhaas Reddy; Biswas, Soumyajyoti

doi:10.1007/s12648-021-02195-x

Machine learning predictions of COVID-19 second wave end-times in Indian states

Original Paper
Published: 01 October 2021

Volume 96, pages 2547–2555, (2022)
Cite this article

Download PDF

Indian Journal of Physics Aims and scope Submit manuscript

Machine learning predictions of COVID-19 second wave end-times in Indian states

Download PDF

Anvesh Reddy Kondapalli¹,
Hanesh Koganti¹,
Sai Krishna Challagundla¹,
Chaitanya Suhaas Reddy Guntaka¹ &
…
Soumyajyoti Biswas²

1132 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

The estimate of the remaining time of an ongoing wave of epidemic spreading is a critical issue. Due to the variations of a wide range of parameters in an epidemic, for simple models such as Susceptible-Infected-Removed (SIR) model, it is difficult to estimate such a time scale. On the other hand, multidimensional data with a large set attributes are precisely what one can use in statistical learning algorithms to make predictions. Here we show, how the predictability of the SIR model changes with various parameters using a supervised learning algorithm. We then estimate the condition in which the model gives the least error in predicting the duration of the first wave of the COVID-19 pandemic in different states in India. Finally, we use the SIR model with the above mentioned optimal conditions to generate a training data set and use it in the supervised learning algorithm to estimate the end-time of the ongoing second wave of the pandemic in different states in India.

Comparative Study on Predictive Mathematical Models for Risk Assessment of nCOVID-19 Pandemic in India

Inefficiency of SIR models in forecasting COVID-19 epidemic: a case study of Isfahan

Article Open access 25 February 2021

Shiva Moein, Niloofar Nickaeen, … Yousof Gheisari

Forecasting the long-term trend of COVID-19 epidemic using a dynamic model

Article Open access 03 December 2020

Jichao Sun, Xi Chen, … Yefeng Zheng

1 Introduction

Since the outbreak of the COVID-19 pandemic [1], the estimate of a time-scale for the end of a wave of pandemic outbreak has undoubtedly become an outstanding challenge. Nevertheless, due to the variations of a wide range of parameters, such as the rate of spreading, the contact network of the individuals, various mitigation measures, etc., it is very difficult to make such an estimate [2–11]. However, a multidimensional set of data is often used in statistical learning approaches for making predictions [12, 13]. Indeed, such predictions have been attempted in a wide range of cases, such as financial time series, weather data, medical applications and many other physical systems [14–17]. There have been multiple earlier attempts in using machine learning approaches for predicting epidemic spreading in the context of COVID-19 (see e.g., [21, 22]) as well. However, to have a proper estimate, a large set of training data is needed to be fed to the supervised learning algorithm. This is often a major hurdle to overcome for a pandemic such as the present one, the like of which is not seen in a century.

To address this issue, we first consider a simplified model of epidemic spreading, called the Susceptible-Infected-Removed (SIR) model [18–20] and estimate its predictability using a supervised learning algorithm, by varying various parameters of the SIR model. We then find the condition under which the model is best suited to make ‘predictions’ about the first wave of the COVID-19 pandemic in different states in India. Since in most of these cases, the first and the second waves are separated by a period of low infection rates, the end-time of the first wave can be well defined. Therefore, it is possible to make an error estimate for the ‘prediction’ of the first wave end-time. The optimal condition of the model that ‘predicts’ the end of the first wave can then be used to generate a ‘synthetic’ training data set of substantial size. This ‘synthetic’ training data set can then be used to make predictions for the ongoing second wave. The use of synthetic data for enhancing prediction capability of ML algorithms is a well known technique (see e.g., [23]). By increasing the size of the training data set substantially, this technique enables the ML algorithm to make stable predictions.

Certainly, there are multiple issues in using the first-wave data for the optimization of the training set. Particularly, the two waves are, of course, different in several aspects: effects of vaccinations, changes in the norms of travel restrictions, presence of mutant variants of the virus, etc. One outcome of these variations would be the changes in the maximum values of the daily infection rates between the two waves. As is evident from the data in India [24], the peak height of the second wave was about four times larger than the peak height of the first wave. Therefore, we normalize the data by the peak height, which necessarily assumes that the peak for the second wave has passed, in each of the cases where we make the end-time predictions.

The rest of the paper is organized as follows: First we describe the SIR model and the machine learning methods used for making the predictions in the model (Sect. 2). In Sect. 3, we present the simulation results, describing the variations in predictability of the SIR model under different conditions (testing rate, site dilution). Then in Sec. 4, we use the ML algorithm to make predictions of the end-time of the second wave of the pandemic for some states in India. Finally, we discuss and conclude in Sect. 5.

2 Model and methods

The SIR model is a well studied model for epidemic spreading [18]. Although a simple model, this and its variants have been widely popular for epidemic spreading studies [19, 20]. The model assumes that the total population is divided into three groups - Susceptible: denoting the individuals who can get infected by the virus but are not yet infected, Infected: denoting the individuals who are currently infected and can infect the susceptible population and Removed: denoting the population who were already infected by the virus and do not affect the evolution of the spreading dynamics any more. For the case of COVID-19, it is not entirely conclusive whether an individual can get infected more than once. However, in our study we have assumed it to be the case.

There are several variants of SIR-like models that are applied in the case of COVID-19 spreading, specifically those concerning the imposition of social distancing measures and travel restrictions (see e.g., [25–28], see also [29] for a review on effects of travel restrictions). However, in our case we have considered the simplest version of the SIR model, since the data for the additional states (e.g., exposed, isolated individuals) are not available for comparisons.

We consider the model on a two dimensional square lattice, where each site represents the location of an individual who can be in one of the three states mentioned above. An infected individual can, with certain probability, infect one of their eight neighbors (four nearest neighbors and four diagonal neighbors) if that neighbor is in the susceptible state. An infected individual remains in that state for a given duration of time, during which they can infect other susceptible individuals. Following that duration, the infected individual enters the Removed state, where they no longer participate in the spreading dynamics.

In one time step of the simulation, every individual is selected once, and their states are attempted for a possible update. The updates are done in a parallel updating scheme, such that a given update comes into effect in the following time step. If an individual is in the susceptible state and one of the eight neighbors is infected, then the susceptible individual can be infected with probability p. If an infected individual has remained in that state for \(\tau\) time steps, then that individual is put in the removed state. Here we keep \(\tau =14\) , and the value of p is fixed at a randomly chosen value between 0.3 to 0.8 from a uniform distribution for each realization of the model.

The number of infected individuals at a time t, denoted by I(t), represents what is usually referred to as the ‘active cases’. The time derivative of the susceptible (S(t)) individuals, dS(t)/dt, is essentially the number of new infections in a day (at t). Both of these quantities start from a low value. Initially, these quantities are identical and start from a low value. The initial infection is chosen randomly and uniformly between 10 and 20 for each simulation, independent of the system size. Both of these quantities then increase with time, reach a peak and then eventually decrease to zero. The model does not show multiple waves of infection rates, nor does it account for the effects of vaccination or restriction imposed in interactions.

While it is straightforward to get an exact solution for the mean field version of the model and also to numerically estimate the above mentioned quantities in other topologies (including the square lattice considered here), the actual situation is far more complex, and the available data sets are limited. Particularly, just the absence of tests for the individuals without symptoms and/or access to such medical facilities, would distort the data for the number of infections and other related quantities. Also, the topology of a square lattice is a simplified one and would at the very least include ‘disorder’ in terms of unoccupied sites. We first investigate the effects of these factors in predictability of the SIR model. For the prediction, we use a supervised machine learning algorithm, particularly the Random Forest algorithm. This is an ensemble of decision tress, and the predictions in the model are made using the majority of the predictions of the decision trees. The various attributes that we use for the training of the algorithm are: daily infection, daily recovery and the number of active cases at a particular time. The target variable for the prediction is the remaining time before the daily infection number goes below 5% of the peak value. In the Random-forest, we used 1000 estimators and kept the maximum depth at 15. The results are stable with small variations around these parameters. Following the training of the algorithm with a training set of 200 ensembles (each ensemble represents the full time series of the different quantities mentioned from the start to the end of the spreading dynamics), each having a randomly chosen infection rate and initial infection number, in the way outlined above. Then the trained algorithm is used to make predictions for a different set of 100 ensembles. Of course, there are multiple other factors that are not captured in the model or data; for example the effects of social distancing measures, quarantines, asymptomatic individuals, exposed individuals who were not showing symptoms yet and so on. However, since the relevant data for these parameters are not available for comparisons, we could not use it for this study.

In Fig. 1, a typical time series of the infection rate (normalized by the peak height) is shown. The actual remaining time and the machine learning (ML) predicted remaining times, at every instance, are also shown. The root-mean-squared fluctuations between the actual remaining time and the predicted remaining time at every instance give an estimate for the error in prediction. In the following, we first estimate the error in predictions i.e., the efficiency of the ML predictions under different conditions of variable testing rate and disorder in the SIR model. Then we use the model as training set to make predictions for the end of the second wave of the COVID-19 pandemic in different states in India.

3 Simulation results

Here we describe the simulation results of the SIR model of epidemic spreading and estimate the variations of its predictability with different parameters of the model, using supervised machine learning algorithm.

3.1 ML predictability of SIR model with variable testing

As indicated before, a major source of distortion in the data for the pandemic is the limited testing resources available. This was especially apparent during the first wave of the pandemic (see e.g., [31]). Therefore, it is useful to understand, even for this simplified model, how does incomplete testing and/or variable testing rate affect the measurements in the model so as to affect, in turn, the predictability of the model.

We check this effect in two different ways. First we assume that only a fraction (v) of the total population can get tested i.e., only that fraction of the population have access to the necessary medical facilities. While the underlying SIR dynamics runs with the three possible states (S, I and R) for each individual, the measurements are made, at each time, only on the v fraction of the total population, chosen randomly and kept fixed in time for that realization.

Figure 2 depicts the effect of the various values of v, between 10% (\(v=0.1\)) to 100% (\(v=1\)) testing. While the (apparent) number of daily infection is very sensitive to the value of v, when scaled by this factor (Fig. 2b), the curves fall on top of each other, with a varying degree of fluctuations. The consequent error in the predictions using the ML algorithm, however, only varies weakly (Fig. 2c) with v. This gives an important conclusion that while drastically sub-sampling, the predictability remains almost the same in the model, as long as a macroscopic fraction of the randomly chosen population is tested.

Secondly, it is also known that the rate of testing is not a fixed quantity over the duration of the pandemic. Particularly, it is often dependent on the rate of positive results obtained in daily testing. Therefore, we also look at the variation due to a time dependent value of v. We vary v between a lower bound \(v_{min}\) and an upper bound \(v_{max}=0.5\), linearly dependent on the daily infection rate. Other than the linear dependence of v within this range, it is not allowed to fall below or increase above, the fixed threshold values. The individuals are again randomly selected for testing at each step, but now with a time dependent value of v. Fig. 3 shows the results for this case. In Fig. 3a, the time variation of the daily infections are shown for various values of \(v_{min}\) and \(v_{max}\). In Fig. 3b, the actual data for number of testing and the corresponding daily infections are shown for various states in India, justifying the choice of the linear variation (indicated by the straight line). Nevertheless, there is hardly any systematic variation in the error (hence also the predictability) with \(v_{min}\). This is also an interesting observation that while the time dependent testing rate can introduce fluctuations in the data, ultimately it does not translate to a lower predictability, as long as it is made sure that a macroscopic fraction of the individuals is always tested.

3.2 ML predictability of SIR model with site dilution

As mentioned before, topology of the contact network of the individuals can play a crucial role in the spreading dynamics. So far we have kept that to be very simple fully occupied two dimensional square lattice. But such an orderly arrangement is not realistic. As a simple way to introduce disorder, we remove a fraction q of the sites i.e., there are no individuals occupying that fraction of the sites This modification, of course, introduces a fluctuation that diverges near the critical point \(q_c\approx 0.4\) of site percolation [32] (see also [33] for percolation threshold with longer than nearest neighbor connections). It is generally known that a system with higher disorder is relatively more predictable through machine learning, compared to the systems having less disorder [17]. It is also known that the distribution of population in a city follow a fractal character [34], which will happen here near the percolation threshold. It is, therefore, interesting to study the variation of the predictability when the SIR model is simulated in a site diluted lattice.

Figure 4 shows the simulation results for the site diluted lattice. It is interesting to note that even when the infection curves are scaled by the corresponding maximum values, they do not overlap. Indeed, there is a non-monotonic variation in the duration upto which the dynamics run, with the dilution fraction. The corresponding errors, scaled by the error obtained without ML i.e., just considering the average duration of the training set as the predicted duration for each testing set, show a non-monotonic variation with the dilution fraction. A system size dependence shows that the minimum point of the error tends toward the critical percolation threshold, as the system size is increased. Therefore, we conjecture that the highest disorder in the model i.e., the percolation critical point, is the maximally predictable point as well. This is an interesting observation, since as mentioned before, at the percolation point, the occupied site form a fractal structure. As mentioned before, this mimics the fractal nature of the population distributions in cities [34], although with a different fractal dimension.

In Fig. 5, the locations of infected sites and the corresponding infection times are shown (in color gradient). It is clear that for the dilution fraction around 0.6, the infected locations look like a fractal. It is known that the spatial arrangements of COVID-19 spreading indeed follow a fractal structure [35]. Therefore, it is significant that near the point where the infection spreading is fractal-like, the matching with first-wave prediction is the highest (see below). Finally, the marginally connected sites are infected much later, delaying the decay process of the daily infection curve (see Fig. 4).

4 Application: Predictions of end-time of second wave in some Indian states

So far we have discussed the predictability of the SIR model using supervised machine learning. We have also seen that the predictability depends on the site dilution fraction in the model when simulated on a square lattice. Here we attempt in using the SIR model as a training set and then make predictions for the end-time of the second waves in eight Indian states where the total infection has crossed one million. These states are: Andhra Pradesh (AP), Delhi (DL), Karnataka (KA), Kerala (KL), Maharashtra (MH), Tamil Nadu (TN), Uttar Pradesh (UP) and West Bengal (WB).

In Fig. 6, we see that the first and second waves are more or less distinct in these states–separated by a low daily infection rate. First we use the SIR model with various site dilution fractions and make ‘predictions’ about the end-time of the first wave. Knowing the actual end-time of the first waves in these states, it is possible to estimate the errors in those predictions (Fig. 6(inset)). It is seen that the error is minimum for the dilution fraction 0.55. We therefore use the SIR model with dilution fraction 0.55 to generate a training set (500 sets) and then feed the data for the second wave to make a prediction about the end-time. In doing so, one obvious issue is with the peak height, which are very much different between the first and the second waves and also among the different states. We, therefore, normalize the training as well as the testing data by the corresponding peak heights. One obvious assumption, therefore, is that the peak for the second wave has passed, which is obvious in many of the states and are also indicative in the rest of the states.

We make another set of predictions by using the first waves as the training data. It is remarkable that the two sets of predictions are very close to each other. However, in making the final prediction (Fig. 7, Table 1), we use the training set of the SIR model.

The errors in the remaining time were estimated from the errors of the straight-line fit of the remaining time line. This has two components, the errors in the pre-factor and the errors in the slope. The final errors were calculated by taking the combinations of the extreme values of these two errors, which has resulted in asymmetric error bars in some cases.

Table 1 The predicted end-dates for the second wave in different states and the corresponding errors. The errors are higher for the states where the infection rates are close to the peak (data as of May 20, 2021 [24, 30])

Full size table

5 Conclusions

We have reported the variations in the predictability of the SIR model of epidemic spreading with different parameters of the model, using supervised machine learning algorithm. It is interesting to note that the predictions for the end-time in the model are remarkably stable, even when only a small fraction (10%) of the individuals are tested for the infection. The predictions are also stable when the testing rate vary with time–linearly with the positivity rate of the testing within a given range. However, the predictability changes substantially when a disorder is introduced in terms of site dilution in the model i.e., some positions are not occupied by any individual. Particularly, the relative predictability of the model is the highest (error in prediction is the lowest) when the site dilution fraction approaches the percolation critical point (see Fig. 4). In that case, the underlying lattice structure approaches a fractal and the fluctuations in the cluster size diverges with system size [32]. It is seen before that the predictability using ML approaches increases with the increase in the disorder in the system (see e.g., [17]). Indeed, the fluctuations in the time series of the various attributes used for the ML algorithm have richer characteristics (carrying more information), and consequently, the training of the algorithm is better. Also, it is interesting to note that the spatial distribution of population in cities is fractal in nature [34], although not necessarily of the same fractal dimension as that of the site percolation. Nevertheless, the fluctuations introduced in the daily infection rate and other related quantities due to the delayed spreading of the infections in marginally connection regions, would introduce qualitatively similar effects in any fractal geometry. Therefore, a fractal geometry is perhaps better suited to model the epidemic spreading.

We then use the model and the ML approach to make predictions of the end-time of the ongoing second wave of the COVID-19 pandemic in eight Indian states, where the total number of infections are over one million (see Table 1). In doing so, we first need to overcome the lack of training data for the ML algorithm. We first note that the first and second waves of the pandemic in India are somewhat separated by a relatively low daily infection rate. Therefore, we take the data for the first wave and make ‘predictions’ about its end-time using the SIR model as the training set. As the predictability of the SIR model is already shown to be sensitive to the lattice dilution, we estimate the dilution fraction for which the error in the ‘predictions’ for the first wave is minimum. We then use the SIR model with that dilution fraction to generate the training data set for making predictions about the end-time of the ongoing second wave. This approach assumes that the statistical nature of the fluctuations in the first and second waves would be similar once those are scaled by the respective peak infection rates. This in turn necessarily assumes that the peak infection for the second wave has already past, which indeed seems to be the case (see Fig. 6). In places where the peak has just reached (until May 20, 2021), the errors in predictions are higher. It also does not consider the effects of vaccinations, new mutant variants of the virus and changes in the travel restriction norms. Nevertheless, use of this ‘synthetic’ training data set enables the ML algorithm to make predictions for the end-time, which is otherwise difficult to do due to the lack of training data sets.

It is useful to note here that ML algorithms have been used widely in various aspects of COVID-19, including diagnosis to predictions on infections (see e.g., [36, 37]). It was also used for imposition of social distancing measures and the possible effects of early lifting of such measures (see e.g., [29] for a review). In our case, we do not implement such measures in the simulation due to the lack of data for the corresponding comparisons.

In conclusion, we note that the epidemic spreading in the SIR model on a two dimensional square lattice can be well predicted by supervised machine learning algorithms. The predictability of the model is sensitive to the site dilution fraction of the model and becomes the highest near the percolation critical point. An optimal condition for predictability can be obtained by tuning the site dilution fraction in the model that minimizes the prediction errors for the first wave of the COVID-19 pandemic. The optimized model can then be used to make predictions of the end-times for the ongoing second wave of the pandemic in different states in India with reasonable success.

Table 2 The infection numbers in states where the predicted end-date (when infection was supposed to be 5% of peak) has passed

Full size table

Note added in the revised version: Since the work presented here (in the first submission) represent predictions made using data up to May 20, 2021, several states have now passed the predicted end-dates. In Table 2, therefore, we provide the six states for which the predicted end-date has passed and the value of the infection number on that particular date and what percentage of the peak was reached (predicted end-time was defined as 5% of peak).

References

P Zhou et al. Nature 579 270 (2020)
Article ADS Google Scholar
M Chinazzi et al. Sci. 368 395 (2020)
Article ADS Google Scholar
J Dehning et al. Sci. 369 6500 (2020)
M Kraemer et al. Sci. 368 493 (2020)
Article ADS Google Scholar
Z Ceylan Sci. Total Environ. 729 138817 (2020)
Article ADS Google Scholar
C Anastassopoulou et al. PLoS ONE 15 e0230405 (2020)
Article Google Scholar
T Chakraborty, I Ghosh Chaos Solitons & Fractals 135 109850 (2020)
Article Google Scholar
K Biswas, A Khaleque, P Sen arxiv: 2003:07063
S Khajanchi et al. arxiv: 2005.06286, (2020)
S Chatterjee et al. Indian Journal of Physics (2020) https://doi.org/10.1007/s12648-020-01766-8
Article Google Scholar
S Chatterjee et al. Indian J. Phys. (2020) DOI: https://doi.org/10.1007/s12648-020-01928-8
Article Google Scholar
R A Olshen, C J Stone Classification and regression trees, CRC Press: London (1984)
MATH Google Scholar
T Hastie, R Tibshirani, J Friedman The elements of statistical learning: Data mining, inference and predictions (New York: Springer) (2001)
Book Google Scholar
B Rouet-Leduc et al. Geophys. Res. Lett. 44 9276 (2017)
Article ADS Google Scholar
H Salmenjoki, M J Alava, L Laurson Nat. Commun. 9 5307 (2018)
Article ADS Google Scholar
M van der Baan, C Jutten Geophysics 65 1032 (2000)
Article Google Scholar
S Biswas, D F Castellanos, M Zaiser Sci. Rep. 10 16910 (2020)
Article ADS Google Scholar
W Kermack, A McKendrick Proc. R. Soc. A 115 700 (1927)
ADS Google Scholar
N T J Bailey The mathematical theory of infectious diseases, 2nd edition, Griffin, London (1975)
MATH Google Scholar
D Daley, J Gani Epidemic modeling: An introduction, Cambridge University Press, NY (2005)
MATH Google Scholar
V Chimmula, L Zhang Chaos Solitons & Fractals 135 109864 (2020)
Article Google Scholar
P Bedi, S Dhiman, P Gole, N Gupta, V Jindal SN Computer Science 2 224 (2021)
Article Google Scholar
N Patki, R Wedge, K Veeramachaneni The Synthetic Data Vault, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp. 399-410, (2016)
All data used in this work comes from: https://api.covid19india.org/
L C G Rogers arxiv: 2004.12462 (2020)
M A Pires et al. Int. J. Mod. Phys. C 32 2150107 (2021)
Article ADS Google Scholar
N Hoertel Nat. Medicine 26 1417 (2020)
Article Google Scholar
M Pedersen, M Meneghini DOI:https://doi.org/10.13140/RG.2.2.11753.85600(2020)
S Biswas, A Kr Mandal arxiv: 2105.14294 (2021)
The predictions for UP and Delhi already seem to work reasonably well: For Delhi, the peak infection number was 28395 on April 20, 2021. 5% of that means 1420. Delhi had: 1491 cases on 26th May, 1072 cases on 27th May, 1141 cases on 28th May. Prediction for Delhi was 28th May, with error of +/- 2 days. For Uttar Pradesh, the peak infection was 37944 on 24th April, 2021. 5% of that means 1897. UP had: 3179 cases on 27 May, 2276 cases on 28 May, 2014 cases on 29 May. Prediction for UP was 27May, with error -2days to +3 days
M J Binnicker J. Clinical Microbiology 58 e01695 (2020)
Article Google Scholar
D Stauffer and A Aharony Introduction to Percolation Theory, 2nd ed. Taylor & Francis, London, (1994)
MATH Google Scholar
K Malarz, S Galam Phys. Rev. E 71 016125 (2005)
Article ADS Google Scholar
M Batty, P A Longley Fractal cities: a geometry of form and function, London, Academic Press (1994)
MATH Google Scholar
M Abbasi et al. Chaos, Solitons & Fractals 140 110119 (2020)
Article Google Scholar
O Shahid J. Biomed. Inf. 117 103751 (2021)
Article Google Scholar
S Kushwaha et al. J. Indust. Integr. Manag. 5 453 (2020)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, SRM University-AP, Amaravati, Andra Pradesh, 522502, India
Anvesh Reddy Kondapalli, Hanesh Koganti, Sai Krishna Challagundla & Chaitanya Suhaas Reddy Guntaka
Department of Physics, SRM University-AP, Amaravati, Andra Pradesh, 522502, India
Soumyajyoti Biswas

Authors

Anvesh Reddy Kondapalli
View author publications
You can also search for this author in PubMed Google Scholar
Hanesh Koganti
View author publications
You can also search for this author in PubMed Google Scholar
Sai Krishna Challagundla
View author publications
You can also search for this author in PubMed Google Scholar
Chaitanya Suhaas Reddy Guntaka
View author publications
You can also search for this author in PubMed Google Scholar
Soumyajyoti Biswas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soumyajyoti Biswas.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kondapalli, A.R., Koganti, H., Challagundla, S.K. et al. Machine learning predictions of COVID-19 second wave end-times in Indian states. Indian J Phys 96, 2547–2555 (2022). https://doi.org/10.1007/s12648-021-02195-x

Download citation

Received: 21 June 2021
Accepted: 26 August 2021
Published: 01 October 2021
Issue Date: July 2022
DOI: https://doi.org/10.1007/s12648-021-02195-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning predictions of COVID-19 second wave end-times in Indian states

Abstract

Similar content being viewed by others

Comparative Study on Predictive Mathematical Models for Risk Assessment of nCOVID-19 Pandemic in India

Inefficiency of SIR models in forecasting COVID-19 epidemic: a case study of Isfahan

Forecasting the long-term trend of COVID-19 epidemic using a dynamic model

1 Introduction

2 Model and methods

3 Simulation results

3.1 ML predictability of SIR model with variable testing

3.2 ML predictability of SIR model with site dilution

4 Application: Predictions of end-time of second wave in some Indian states

5 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Machine learning predictions of COVID-19 second wave end-times in Indian states

Abstract

Similar content being viewed by others

Comparative Study on Predictive Mathematical Models for Risk Assessment of nCOVID-19 Pandemic in India

Inefficiency of SIR models in forecasting COVID-19 epidemic: a case study of Isfahan

Forecasting the long-term trend of COVID-19 epidemic using a dynamic model

1 Introduction

2 Model and methods

3 Simulation results

3.1 ML predictability of SIR model with variable testing

3.2 ML predictability of SIR model with site dilution

4 Application: Predictions of end-time of second wave in some Indian states

5 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation