COVID-19 cases prediction using regression and novel SSM model for non-converged countries

  • Rupali Patil KJ Somaiya College of Engineering, Mumbai - 400077, India (IN)
  • Umang Patel KJ Somaiya College of Engineering, Mumbai - 400077, India (IN)
  • Tushar Sarkar KJ Somaiya College of Engineering, Mumbai - 400077, India (IN)
Keywords: COVID-19, Prediction, SSM, SARIMAX, Linear Regression

Viewed = 1022 time(s)


Anticipating the quantity of new associated or affirmed cases with novel coronavirus ailment 2019 (COVID-19) is critical in the counteraction and control of the COVID-19 flare-up. The new associated cases with COVID-19 information were gathered from 20 January 2020 to 21 July 2020. We filtered out the countries which are converging and used those for training the network. We utilized the SARIMAX, Linear regression model to anticipate new suspected COVID-19 cases for the countries which did not converge yet. We predict the curve of non-converged countries with the help of proposed Statistical SARIMAX model (SSM). We present new information investigation-based forecast results that can assist governments with planning their future activities and help clinical administrations to be more ready for what's to come. Our framework can foresee peak corona cases with an R-Squared value of 0.986 utilizing linear regression and fall of this pandemic at various levels for countries like India, US, and Brazil. We found that considering more countries for training degrades the prediction process as constraints vary from nation to nation. Thus, we expect that the outcomes referenced in this work will help individuals to better understand the possibilities of this pandemic.


Download data is not yet available.


Y.-C. Liu, R.-L. Kuo, and S.-R. Shih, “COVID-19: The first documented coronavirus pandemic in history,” Biomedical Journal, May 2020, doi: 10.1016/

A. S. Ahmar and E. Boj del Val, “The date predicted 200.000 cases of Covid-19 in Spain,” J. Appl. Sci. Eng. Technol. Educ., vol. 2, no. 2, pp. 188–193, Jun. 2020, doi: 10.35877/454ri.asci22102

Qin, L.; Sun, Q.; Wang, Y.; Wu, K.-F.; Chen, M.; Shia, B.-C.; Wu, S.-Y. Prediction of Number of Cases of 2019 Novel Coronavirus (COVID-19) Using Social Media Search Index. Int. J. Environ. Res. Public Health , vol. 17, no. 7, p. 2365, Mar. 2020, doi: 10.3390/ijerph17072365

F. Petropoulos and S. Makridakis, “Forecasting the novel coronavirus COVID-19,” PLoS ONE, vol. 15, no. 3, p. e0231236, Mar. 2020, doi: 10.1371/journal.pone.0231236

Li, Lixiang, Zihang Yang, Zhongkai Dang, Cui Meng, Jingze Huang, Haotian Meng, Deyu Wang et al. "Propagation analysis and prediction of the COVID-19." Infectious Disease Modelling, vol. 5, pp. 282–292, 2020, doi: 10.1016/j.idm.2020.03.002

S. Boccaletti, W. Ditto, G. Mindlin, and A. Atangana, “Modeling and forecasting of epidemic spreading: The case of Covid-19 and beyond,” Chaos, Solitons & Fractals, vol. 135, p. 109794, Jun. 2020, doi: 10.1016/j.chaos.2020.109794

T. Chakraborty and I. Ghosh, “Real-time forecasts and risk assessment of novel coronavirus (COVID-19) cases: A data-driven analysis,” Chaos, Solitons & Fractals, vol. 135, p. 109850, Jun. 2020, doi: 10.1016/j.chaos.2020.109850

M. J. Kane, N. Price, M. Scotch, and P. Rabinowitz, “Comparison of ARIMA and Random Forest time series models for prediction of avian influenza H5N1 outbreaks,” BMC Bioinformatics, vol. 15, no. 1, Aug. 2014, doi: 10.1186/1471-2105-15-276

Z. Liu, P. Magal, O. Seydi, and G. Webb, “A COVID-19 epidemic model with latency period,” Infectious Disease Modelling, vol. 5, pp. 323–337, 2020, doi: 10.1016/j.idm.2020.03.003

F. A. B. Hamzah, C. Lau, H. Nazri, D. V. Ligot, G. Lee, and C. L. Tan. "CoronaTracker: worldwide COVID-19 outbreak data analysis and prediction." WHO Press, Mar. 19, 2020, doi: 10.2471/blt.20.255695

M. Azarafza, M. Azarafza, and H. Akgün, “Clustering method for spread pattern analysis of corona-virus (COVID-19) infection in Iran,” J. Appl. Sci. Eng. Technol. Educ., vol. 3, no. 1, pp. 1–6, Jun. 2020, doi: 10.35877/454ri.asci31109

R. Salgotra, M. Gandomi, and A. H. Gandomi, “Time Series Analysis and Forecast of the COVID-19 Pandemic in India using Genetic Programming,” Chaos, Solitons & Fractals, vol. 138, p. 109945, Sep. 2020, doi: 10.1016/j.chaos.2020.109945

K. Roosa, Y. Lee, R. Luo, A. Kirpich, R. Rothenberg, J. M. Hyman, P. Yan, and G. Chowell. “Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020,” Infectious Disease Modelling, vol. 5, pp. 256–263, 2020, doi: 10.1016/j.idm.2020.02.002

J. N. Dhanwant and V. Ramanathan. "Forecasting COVID 19 growth in India using Susceptible-Infected-Recovered (SIR) model." arXiv preprint arXiv:2004.00696, 2020

G. Pandey, P. Chaudhary, R. Gupta, and S. Pal. "SEIR and Regression Model based COVID-19 outbreak predictions in India." arXiv preprint arXiv:2004.00958, 2020

R. Gupta and S. K. Pal, “Trend Analysis and Forecasting of COVID-19 outbreak in India.” medRxiv, Mar. 30, 2020, doi: 10.1101/2020.03.26.20044511

R. Ranjan, “Estimating the Final Epidemic Size for COVID-19 Outbreak using Improved Epidemiological Models.” medRxiv, Apr. 16, 2020, doi: 10.1101/2020.04.12.20061002

Received 2020-07-31
Accepted 2020-12-02
Published 2021-02-16
How to Cite
R. Patil, U. Patel, and T. Sarkar, “COVID-19 cases prediction using regression and novel SSM model for non-converged countries”, J. Appl. Sci. Eng. Technol. Educ., vol. 3, no. 1, pp. 74-81, Feb. 2021.