Predicting Diabetes Mellitus with Machine Learning Techniques

Citation

Tong, Hau Lee and Ng, Hu and Arul Ananthan, Harannesh (2024) Predicting Diabetes Mellitus with Machine Learning Techniques. Journal of Engineering Technology and Applied Physics, 6 (1). pp. 91-99. ISSN 26828383

[img] Text
View of Predicting Diabetes Mellitus with Machine Learning Techniques.pdf - Published Version
Restricted to Repository staff only

Download (1MB)

Abstract

This study addresses the challenge of accurately identifying diabetes mellitus in individuals. Utilizing accessible online and real-world diagnostic data, we employ machine learning models, including Support Vector Machine, Random Forest, Naïve Bayes, eXtreme Gradient Boosting, and Deep Neural Network, on the PIMA Indian Diabetes and NHANES 1999-2016 datasets. Rigorous data pre-processing steps were conducted, handling null values, outliers, and imbalanced data together with data normalization. Our results reveal that the RF model achieves a 79% accuracy for binary classification on the PIMA Indian Diabetes dataset, using a 60:40 train-test split with BORUTA selected features. Meanwhile, the XGBoost model excels on the NHANES 1999-2016 dataset, achieving 92% accuracy for binary and 91% for multiclass classification respectively.

Item Type: Article
Uncontrolled Keywords: Diabetes Mellitus, Machine Learning, Accuracy
Subjects: R Medicine > R Medicine (General) > R855-855.5 Medical technology
R Medicine > R Medicine (General) > R858-859.7 Computer applications to medicine. Medical informatics
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Mr. MUHAMMAD AZRUL MOSRI
Date Deposited: 03 Apr 2024 04:44
Last Modified: 03 Apr 2024 04:44
URII: http://shdl.mmu.edu.my/id/eprint/12371

Downloads

Downloads per month over past year

View ItemEdit (login required)