Ganesan, Swathi ORCID: https://orcid.org/0000-0002-6278-2090, Somasiri, Nalinda ORCID: https://orcid.org/0000-0001-6311-2251, Jeyavadhanam, Rebecca and Karthick, Gayathri (2023) Improved Computational Efficiency of Machine Learning Algorithm based on Evaluation Metrics to control the spread of Coronavirus in the UK. In: ICDSTA 2023: 17. International Conference on Data Science Techniques and Applications, 16-17th February, 2023, London.
Text
Covid paper.pdf - Accepted Version Restricted to Repository staff only |
Abstract
The Covid-19 crisis presents a substantial and critical hazard to worldwide health. Since the occurrence of the disease in late January 2020 in the UK, the number of infected people confirmed to acquire the illness has increased tremendously across the country, and the number of individuals affected is undoubtedly considerably high. The purpose of this research is to figure out a predictive machine learning archetypal that could forecast the covid-19 cases within the UK. This study concentrates on the statistical data collected from 31st January 2020 to 31st March 2021 in the United Kingdom. Information on total covid cases registered, new cases encountered on a daily basis, total death registered, and patients’ death per day due to Coronavirus is collected from World Health Organisation (WHO). Data preprocessing is carried out to identify any missing values, outliers, or anomalies in the dataset. The data is split into 8:2 ratio for training and testing purposes to forecast future new Covid cases. Support Vector Machines (SVM), Random Forests, and linear regression algorithms are chosen to study the model performance in the prediction of new Covid 19 cases. From the evaluation metrics such as r-squared value and mean squared error, the statistical performance of the model in predicting the new Covid cases is evaluated. Random Forest outperformed the other two Machine Learning algorithms with a training accuracy of 99.47% and testing accuracy of 98.26% when n=30. The mean square error obtained for Random Forest is 4.05e11, which is lesser compared to the other predictive models used for this study. From the experimental analysis Random Forest algorithm can perform more effectively and efficiently in predicting the new Covid cases, which could help the health sector to take relevant control measures for the spread of the virus.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Status: | Published |
Subjects: | T Technology > T Technology (General) T Technology > T Technology (General) > T201 Patents. Trademarks |
School/Department: | London Campus |
URI: | https://ray.yorksj.ac.uk/id/eprint/7363 |
University Staff: Request a correction | RaY Editors: Update this record