Siddalingappa, Rashmi ORCID: https://orcid.org/0000-0001-9786-8436 and Kanagaraj, Sekar
(2023)
A Novel ML Approach for Computing Missing Sift, Provean, and Mutassessor Scores in Tp53 Mutation Pathogenicity Prediction.
International Journal of Advanced Computer Science and Applications.
Preview |
Text
A_Novel_ML_Approach_for_Computing_Missing_Sift.pdf - Published Version Available under License Creative Commons Attribution. | Preview |
Abstract
Cancer is often caused by missense mutations, where a single nucleotide substitution leads to an amino acid change and affects protein function. This study proposes a novel machine learning (ML) approach to calculate missing values in the tp53 database for three computational methods: SIFT, Provean, and Mutassessor scores. The computed values are compared with those obtained from the imputation method. Using these values, an ML classification model trained on 80,406 samples achieves an accuracy of 85%, while the impute method achieves 75%. The scores and statistics are used to classify samples into five classes: Benign, likely pathogenic, possibly pathogenic, pathogenic, and a variant of uncertain significance. Additionally, a comparative analysis is conducted on 58,444 samples, evaluating six ML techniques. The accuracy obtained by each of these mentioned in mentioned alongside the algorithm: logistic regression (89%), k-nearest neighbor (99%), decision tree (95%), random forest (99.8%), support vector machine with the polynomial kernel (91%), support vector machine with RBF kernel (84%), and deep neural networks (98.2%). These results demonstrate the effectiveness of the proposed ML approach for pathogenicity prediction.
| Item Type: | Article |
|---|---|
| Status: | Published |
| DOI: | 10.14569/ijacsa.2023.01406111 |
| Subjects: | Q Science > Q Science (General) Q Science > Q Science (General) > Q325 Machine learning |
| School/Department: | London Campus |
| URI: | https://ray.yorksj.ac.uk/id/eprint/12876 |
University Staff: Request a correction | RaY Editors: Update this record
Altmetric
Altmetric