Main Menu

Classifying Scam Calls Through Content Analysis With Dynamic Sparsity Top-k Attention Regularization

Citation

Zhi, Brendan Hong Jun and Tee, Connie and Ong, Thian Song and Teoh, Andrew Beng Jin (2025) Classifying Scam Calls Through Content Analysis With Dynamic Sparsity Top-k Attention Regularization. IEEE Access, 13. pp. 111733-111751. ISSN 2169-3536

Text
3.pdf - Published Version
Restricted to Repository staff only
Download (3MB)

Official URL: https://doi.org/10.1109/ACCESS.2025.3582906

Abstract

The rise of scam calls in recent years necessitated effective countermeasures against these fraudulent activities, which cause financial losses and threaten personal security. Although previous research utilizing traditional machine learning techniques has fallen short in today’s technological landscape, this study introduces a novel approach for recognizing scam calls by analyzing their content. By leveraging natural language processing techniques and deep learning methodologies, we propose the D-STAR (Dynamic Sparse Attention with Top-k Regularization) model, a transformer-based architecture designed to enhance scam call content detection. Unlike conventional models, D-STAR integrates Dynamic Sparse Attention (DSA), Top-k selection, and sparsity regularization, optimizing computational efficiency while preserving key scam-related contextual information. Our data set consists of 400 scam and 400 non-scam call transcripts, collected from publicly available sources such as social media, news reports, and discussion forums. To ensure dataset diversity, ChatGPT was utilized only to augment real scam scenarios across different contexts while preserving their core fraudulent structures. The model was evaluated using various hyperparameter configurations and managed to achieve an accuracy of 94%, a recall of 91.67%, and an F1-score of 84.98% in classifying scam call contents, outperforming state-of-the-art baselines such as CNN, LSTM, Decision Tree, Random Forest, and SVM in the scam call detection domain. A knowledge graph-based preprocessing technique was also introduced to enrich scam-related contextual understanding. The proposed approach demonstrates its effectiveness in enhancing scam call classification while maintaining computational efficiency. Future work will focus on real-world validation with telecom providers and further optimizations for real-time deployment.

Item Type:	Article
Uncontrolled Keywords:	Attention mechanism, transformer, natural language processing, scam call detection, transformers
Subjects:	Q Science > QA Mathematics > QA71-90 Instruments and machines
Divisions:	Faculty of Information Science and Technology (FIST)
Depositing User:	Ms Suzilawati Abu Samah
Date Deposited:	29 Jul 2025 00:31
Last Modified:	31 Jul 2025 01:35
URII:	http://shdl.mmu.edu.my/id/eprint/14319

Downloads

Downloads per month over past year

Edit (login required)