Evaluation of Cost Sensitive Learning for Imbalanced Bank Direct Marketing Data

Citation

Khor, Kok Chin and Ng, Keng Hoong (2016) Evaluation of Cost Sensitive Learning for Imbalanced Bank Direct Marketing Data. Indian Journal of Science and Technology, 9 (42). ISSN 0974-6846 (In Press)

[img] Text
136.pdf
Restricted to Repository staff only

Download (570kB)

Abstract

Objectives: The imbalanced bank direct marketing data set utilized in this study is a two-class data mining problem, where a customer may or may not subscribe a product from a bank. Methods/Statistical Analysis: The data set inherited the rare class problem where the classification rate attained for the rare class is low. In this study, we attempted cost sensitive learning to mitigate the problem, and to address that there are various costs involved when misclassification occurs. Three learning algorithms, namely, Naive Bayes (NB), C4.5 and Naive Bayes Tree (NBT) were involved in the cost sensitive learning and their results were empirically evaluated. Findings: The results were also compared with two previous studies that utilized the cost insensitive SVM and over-sampling, respectively. Although cost sensitive learning is claimed able to handle imbalanced data sets, but we noticed that the learning is less effective for the bank direct marketing data set in overall. Cost sensitive learning provides a way of “wrapping” learning algorithms that are not designed to handle imbalanced class distributions. Therefore, it may not work well for certain imbalanced data sets. Over-sampling, on the other hand, worked well for the data set. Improvements/Applications: Over-sampling helped to generalize the decision region of the rare class clearly and subsequently improved the classification result.

Item Type: Article
Uncontrolled Keywords: Bank Direct Marketing, Cost Sensitive Learning, Imbalanced Data Set, Rare Class Problem, Over-Sampling
Subjects: H Social Sciences > HF Commerce > HF5001-6182 Business > HF5410-5417.5 Marketing. Distribution of products
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Ms Rosnani Abd Wahab
Date Deposited: 02 Aug 2018 16:15
Last Modified: 02 Aug 2018 16:15
URII: http://shdl.mmu.edu.my/id/eprint/6721

Downloads

Downloads per month over past year

View ItemEdit (login required)