Comparative Analysis of Machine Learning Algorithms for Health Insurance Pricing


Bau, Yoon Teck and Md Hanif, Shuhail Azri (2024) Comparative Analysis of Machine Learning Algorithms for Health Insurance Pricing. JOIV : International Journal on Informatics Visualization, 8 (1). p. 481. ISSN 2549-9610

[img] Text
2282-6603-1-PB.pdf - Published Version
Restricted to Repository staff only

Download (4MB)


Insurance is an effective way to guard against potential loss. Risk management is primarily employed to protect against the risk of a financial loss. Risk and uncertainty are inevitable parts of life, and the pace of life has led to a rise in these risks and uncertainties. Health insurance pricing has emerged as one of the essential fields of this study following the coronavirus pandemic. The anticipated outcomes from this study will be applied to guarantee that an insurance company's goal for its health insurance packages is within the range of profitability so that the insurance company will also choose the most price-effective course of action. The US Health Insurance dataset was utilized for this study. This health insurance pricing prediction aims to examine four different types of regression-based machine learning algorithms: multiple linear regression, ridge regression, XGBoost regression, and random forest regression. The implemented model's performance is assessed using four evaluation metrics: MAE, MSE, RMSE, and R2 score. Random forest regression outperforms all other algorithms in terms of all four evaluation metrics. The best machine learning algorithm, random forest, is further enhanced with hyperparameter tuning. Random forest with hyperparameter tuning performs better for three evaluation metrics except for MAE. To gain further insights, data visualizations are also implemented to showcase the importance of features and the differences between actual and predicted prices for all the data points.

Item Type: Article
Uncontrolled Keywords: Health Insurance Pricing; Machine Learning Algorithms; Regression; Multiple Linear Regression; Ridge Regression; XGBoost Regression; Random Forest Regression; MAE; MSE; RMSE; R2 Score; Hyperparameter Tuning
Subjects: Q Science > QA Mathematics > QA71-90 Instruments and machines > QA75-76.95 Calculating machines
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Ms Nurul Iqtiani Ahmad
Date Deposited: 03 May 2024 02:52
Last Modified: 03 May 2024 02:52


Downloads per month over past year

View ItemEdit (login required)