Citation
Tong, Hau Lee and Ng, Hu and Arul Ananthan, Harannesh (2024) Predicting Diabetes Mellitus with Machine Learning Techniques. Journal of Engineering Technology and Applied Physics, 6 (1). pp. 91-99. ISSN 26828383
Text
View of Predicting Diabetes Mellitus with Machine Learning Techniques.pdf - Published Version Restricted to Repository staff only Download (1MB) |
Abstract
This study addresses the challenge of accurately identifying diabetes mellitus in individuals. Utilizing accessible online and real-world diagnostic data, we employ machine learning models, including Support Vector Machine, Random Forest, Naïve Bayes, eXtreme Gradient Boosting, and Deep Neural Network, on the PIMA Indian Diabetes and NHANES 1999-2016 datasets. Rigorous data pre-processing steps were conducted, handling null values, outliers, and imbalanced data together with data normalization. Our results reveal that the RF model achieves a 79% accuracy for binary classification on the PIMA Indian Diabetes dataset, using a 60:40 train-test split with BORUTA selected features. Meanwhile, the XGBoost model excels on the NHANES 1999-2016 dataset, achieving 92% accuracy for binary and 91% for multiclass classification respectively.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Diabetes Mellitus, Machine Learning, Accuracy |
Subjects: | R Medicine > R Medicine (General) > R855-855.5 Medical technology R Medicine > R Medicine (General) > R858-859.7 Computer applications to medicine. Medical informatics |
Divisions: | Faculty of Computing and Informatics (FCI) |
Depositing User: | Mr. MUHAMMAD AZRUL MOSRI |
Date Deposited: | 03 Apr 2024 04:44 |
Last Modified: | 03 Apr 2024 04:44 |
URII: | http://shdl.mmu.edu.my/id/eprint/12371 |
Downloads
Downloads per month over past year
Edit (login required) |