Citation
Zhi, Zheng Kang and Sin, Yin Teh and Yong, Samuel Guang Tan and Wei, Chien Ng (2025) Loan Default Prediction Using Machine Learning Algorithms. Journal of Informatics and Web Engineering, 4 (3). pp. 232-244. ISSN 2821-370X|
Text
1680-Article Text-21521-1-10-20250914.pdf - Published Version Restricted to Repository staff only Download (769kB) |
Abstract
Financial institutions constantly face at the risk of default by borrowers which can result in significant financial losses. It is essential to develop an appropriate predictive model for loan default to reduce these risks and minimise financial losses. The objective of this study is to identify the most suitable machine learning model to predict loan default by comparing four models which are Random Forest, Decision Tree, Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM). Additionally, it also examines the key features influencing loan default prediction. The dataset used in this study is sourced from Kaggle and it consists of 148,670 rows with 34 features. As class imbalance is common in the model prediction, Synthetic Minority Over-sampling Technique (SMOTE) is applied during model training to enhance predictive performance. Model performance is evaluated using five significant assessment metrics: accuracy, precision, F1-score, recall, and the area under the receiver operating characteristic curve (ROC AUC). The outcomes indicate that LightGBM performs the best among the other models with the highest accuracy (0.9764), in addition to precision (0.9747) and recall (0.9503) scores. Feature importance analysis is conducted by using permutation importance. It identifies interest, credit type, interest rate spread, and upfront charges as the four most significant features of loan default. These findings provide useful information for financial institutions aiding risk assessment and decision-making to mitigate potential losses.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | Financial Inclusion, LightGBM, Loan Default, Machine Learning, XGBoost |
| Subjects: | H Social Sciences > HG Finance > HG3691-3769 Credit. Debt. Loans. Including credit institutions, credit instruments, consumer credit, bankruptcy |
| Divisions: | Others |
| Depositing User: | Nor Afiqah Mohd Adnan |
| Date Deposited: | 11 Nov 2025 01:34 |
| Last Modified: | 11 Nov 2025 01:34 |
| URII: | http://shdl.mmu.edu.my/id/eprint/14854 |
Downloads
Downloads per month over past year
Edit (login required) |
