COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation

Background. Prediction of the new coronavirus infection (COVID-19) spread is important to take timely measures and initiate systemic preventive and anti-epidemic actions both at the regional and state levels to reduce morbidity and mortality.Objective: to develop a model for short-term forecasting o...

Full description

Saved in:
Bibliographic Details
Main Authors: D. V. Gavrilov, R. V. Abramov, А. V. Kirilkina, А. А. Ivshin, R. E. Novitskiy
Format: Article
Language:Russian
Published: IRBIS LLC 2021-10-01
Series:Фармакоэкономика
Subjects:
Online Access:https://www.pharmacoeconomics.ru/jour/article/view/541
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850050203341553664
author D. V. Gavrilov
R. V. Abramov
А. V. Kirilkina
А. А. Ivshin
R. E. Novitskiy
author_facet D. V. Gavrilov
R. V. Abramov
А. V. Kirilkina
А. А. Ivshin
R. E. Novitskiy
author_sort D. V. Gavrilov
collection DOAJ
description Background. Prediction of the new coronavirus infection (COVID-19) spread is important to take timely measures and initiate systemic preventive and anti-epidemic actions both at the regional and state levels to reduce morbidity and mortality.Objective: to develop a model for short-term forecasting of COVID-19 cases and deaths in the Russian Federation.Material and methods. The data for the model training were collected from the Stopcoronavirus.rf and Johns Hopkins University portals. It included 13 features to assess the infection dynamics and mortality, as well as the rate of morbidity and mortality in different countries and certain regions of the Russian Federation. The model was trained by the CatBoost gradient boosting method and retrained daily with updated data.Results. The forecast model of COVID-19 cases and deaths for the period of up to 14 days was created. The mean absolute percentage error (MAPE) estimate of the model’s accuracy ranged from 2.3% to 24% for 85 regions of the Russian Federation. The advantage of the CatBoost machine learning method over linear regression was shown using the example of the root mean square error (RMSE) value. The model showed less error for regions with a large population than for less populated ones.Conclusion. The model can be used not only to predict the pandemic of the novel coronavirus infection but also to control and assess the spread of diseases from the group of new infections at their emergence, peak incidence, and stabilization period.
format Article
id doaj-art-43e1b999fc5246b6a620fe4f2e5746ce
institution DOAJ
issn 2070-4909
2070-4933
language Russian
publishDate 2021-10-01
publisher IRBIS LLC
record_format Article
series Фармакоэкономика
spelling doaj-art-43e1b999fc5246b6a620fe4f2e5746ce2025-08-20T02:53:31ZrusIRBIS LLCФармакоэкономика2070-49092070-49332021-10-0114334235610.17749/2070-4909/farmakoekonomika.2021.108352COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian FederationD. V. Gavrilov0R. V. Abramov1А. V. Kirilkina2А. А. Ivshin3R. E. Novitskiy4K-SkAI LLCK-SkAI LLCRepublican Infectious Diseases HospitalPetrozavodsk State UniversityK-SkAI LLCBackground. Prediction of the new coronavirus infection (COVID-19) spread is important to take timely measures and initiate systemic preventive and anti-epidemic actions both at the regional and state levels to reduce morbidity and mortality.Objective: to develop a model for short-term forecasting of COVID-19 cases and deaths in the Russian Federation.Material and methods. The data for the model training were collected from the Stopcoronavirus.rf and Johns Hopkins University portals. It included 13 features to assess the infection dynamics and mortality, as well as the rate of morbidity and mortality in different countries and certain regions of the Russian Federation. The model was trained by the CatBoost gradient boosting method and retrained daily with updated data.Results. The forecast model of COVID-19 cases and deaths for the period of up to 14 days was created. The mean absolute percentage error (MAPE) estimate of the model’s accuracy ranged from 2.3% to 24% for 85 regions of the Russian Federation. The advantage of the CatBoost machine learning method over linear regression was shown using the example of the root mean square error (RMSE) value. The model showed less error for regions with a large population than for less populated ones.Conclusion. The model can be used not only to predict the pandemic of the novel coronavirus infection but also to control and assess the spread of diseases from the group of new infections at their emergence, peak incidence, and stabilization period.https://www.pharmacoeconomics.ru/jour/article/view/541artificial intelligencemachine learninggradient boostingepidemiological forecastcovid-19 pandemic
spellingShingle D. V. Gavrilov
R. V. Abramov
А. V. Kirilkina
А. А. Ivshin
R. E. Novitskiy
COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation
Фармакоэкономика
artificial intelligence
machine learning
gradient boosting
epidemiological forecast
covid-19 pandemic
title COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation
title_full COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation
title_fullStr COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation
title_full_unstemmed COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation
title_short COVID-19 pandemic prediction model based on machine learning in selected regions of the Russian Federation
title_sort covid 19 pandemic prediction model based on machine learning in selected regions of the russian federation
topic artificial intelligence
machine learning
gradient boosting
epidemiological forecast
covid-19 pandemic
url https://www.pharmacoeconomics.ru/jour/article/view/541
work_keys_str_mv AT dvgavrilov covid19pandemicpredictionmodelbasedonmachinelearninginselectedregionsoftherussianfederation
AT rvabramov covid19pandemicpredictionmodelbasedonmachinelearninginselectedregionsoftherussianfederation
AT avkirilkina covid19pandemicpredictionmodelbasedonmachinelearninginselectedregionsoftherussianfederation
AT aaivshin covid19pandemicpredictionmodelbasedonmachinelearninginselectedregionsoftherussianfederation
AT renovitskiy covid19pandemicpredictionmodelbasedonmachinelearninginselectedregionsoftherussianfederation