Citation
Das, Prosenjit and Sarker, Proshenjit and Tiang, Jun Jiat and Nahid, Abdullah-Al (2025) Dengue Fever Classification Integrating Bird Swarm Algorithm with Gradient Boosting Classifier Along with Feature Selection and SHAP–DiCE Based Interpretability. Applied Sciences, 15 (21). p. 11413. ISSN 2076-3417|
Text
applsci-15-11413-v2.pdf - Published Version Restricted to Repository staff only Download (1MB) |
Abstract
Dengue is a life-threatening disease that is transmitted by mosquitoes. Dengue fever has no proper treatment. Early, proper diagnosis is essential to minimize complications and enhance outcomes in patients. This research uses a clinical and hematological dataset of dengue to assess the effectiveness of the Gradient Boosting (GB) classification model with and without feature selection. It initially employs a standalone GB model, achieving impeccable results for classification, at 100% accuracy, F1-score, precision, and recall. In addition, the Bird Swarm Algorithm (BSA)-based metaheuristic technique is implemented on the GB classifier to execute wrapper-based feature selection so that features are reduced and achieve better results. The BSA-GB model yielded an accuracy of 99.49%, F1-score of 99.62%, recall of 99.24%, and precision of 100%, but it only selected five features in total. An additional test with a five-fold cross-validation was employed for better performance and model evaluation. Folds 1 and 2 showed especially good results. Although fold 2 selected only four features, it still showed high results, compared to fold 1, which selected five features. In this context, fold 2 achieved an accuracy of 99.49%, F1-score of 99.65%, recall of 99.30%, and precision of 100%. Means of hyperparameters were also calculated across folds to make a generalized GB model, which maintained 99.49% of accuracy with just three features, namely, Hemoglobin, WBC Count, and Platelet Count. To enhance transparency, counterfactual explanations were performed to analyze the misclassified cases, which indicated that minimum changes in input features modify the predictions. Also, an evaluation of the SHAP value result designated WBC Count and Platelet Count as the most important features.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | Bird swarm algorithm, dengue, diverse counterfactual explanations, gradient boosting, metaheuristic, SHAP |
| Subjects: | R Medicine > RC Internal medicine > RC71-78.7 Examination. Diagnosis |
| Divisions: | Faculty of Artificial Intelligence & Engineering (FAIE) |
| Depositing User: | Nor Afiqah Mohd Adnan |
| Date Deposited: | 10 Dec 2025 01:34 |
| Last Modified: | 10 Dec 2025 01:34 |
| URII: | http://shdl.mmu.edu.my/id/eprint/15000 |
Downloads
Downloads per month over past year
Edit (login required) |
