Citation
Amir Suharman, Nur Adlina Marini and Lim, Amy Hui Lan and Goh, Hui Ngo (2025) Sequence-to-pattern analysis for predicting buying decisions on imbalanced clickstream data. Cogent Engineering, 12 (1). ISSN 2331-1916![]() |
Text
Sequence-to-pattern analysis for predicting buying decisions on imbalanced clickstream data.pdf - Published Version Restricted to Repository staff only Download (3MB) |
Abstract
The e-commerce industry has transformed online shopping. To better understand customer behavior and buying decisions, anonymized clickstream data from the SIGIR eCOM 2021 Data Challenge was analyzed. The analysis iterated between data preprocessing and exploratory data analysis (EDA) before predictive modeling. Constraint-based Seq2Pat, which included the Dichotomic Pattern Mining (DPM) technique, was employed to identify common browsing patterns among customers. Using the ratio() method in the SequenceMatcher class of difflib, the obtained patterns were mapped with the patterns from the preprocessed clickstream dataset, and the sequences with the highest similarity score were identified. Preprocessed imbalanced clickstream data with various ratios of buyer and non-buyer groups, namely 4:96, 3:97, 2:98, and 1:99, were prepared by adjusting the thresholds for the similarity score, and their prediction performance was observed. Logistic Regression (LR) achieved high prediction performance across imbalanced clickstream datasets of different ratios, with a ratio of 4:96 performing exceptionally well, with 90.95% average recall and 95.26% average F1-score. Businesses can use this knowledge as a guide to preprocess clickstream data and automate it. Moreover, LR can be used to predict potential buyers. Finally, the results from EDA and prediction support effective marketing campaigns, accurately target potential buyers, and increase the conversion rate.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | e-commerce |
Subjects: | H Social Sciences > HF Commerce > HF5001-6182 Business > HF5546-5548.6 Office management > HF5548.32-.34 Electronic commerce |
Divisions: | Faculty of Computing and Informatics (FCI) |
Depositing User: | Ms Rosnani Abd Wahab |
Date Deposited: | 03 Jun 2025 02:00 |
Last Modified: | 03 Jun 2025 02:00 |
URII: | http://shdl.mmu.edu.my/id/eprint/13911 |
Downloads
Downloads per month over past year
![]() |