Citation
Law, Theng Jia and Ting, Choo Yee and Ng, Hu and Goh, Hui Ngo and Quek, Albert (2025) Improving Embeddings Representation via QED Approach for Enhancing Graduate on Time and Employability Prediction. International Journal of Computational Intelligence Systems, 18 (1). ISSN 1875-6883|
Text
Improving Embeddings Representation via QED Approach for Enhancing Graduate on Time and Employability Prediction.pdf - Published Version Restricted to Repository staff only Download (795kB) |
Abstract
Embeddings have been widely implemented with Large Language Models in various domains to enhance Machine Learning models prediction compared to existing encoding methods. Due to the high-dimensional embeddings generated, researchers have employed data dimensionality reduction algorithms to improve the embeddings representation, but it remains challenging in interpreting the original features. The difficulties become more significant in education, especially in predicting graduate on time and employability. Therefore, this work proposed Quantized Embeddings Reduction to improve the embeddings representation in predicting graduate on time and employability. This algorithm is compared against existing data dimensionality reduction algorithms under three scenarios based on 4007 graduates across 38 variables. Without feature selection, Quantized Embeddings Reduction achieved the highest accuracy in 9 of 12 semesters for on-time graduation prediction and 10 of 12 semesters for employability prediction. With feature selection, it retained dominance, leading accuracy in 8 sesmesters for both tasks which was peaking at 81.3% in employability prediction while achieving unmatched precision with 88.2% compared to others with less than 88.1% in on-time graduation and F1-score with 89.5% compared to others with less than 89.4%. Under feature selection and hyperparameter tuning, the algorithm secured top accuracy in 11 semesters in employability prediction while dominating precision (88.1%). A Friedman test ranked the proposed algorithm as the best-performing algorithm in improving embeddings representation. In future, the scalability of proposed algorithm could be extended to broader domains.
| Item Type: | Article |
|---|---|
| Uncontrolled Keywords: | Machine learning, data dimensionality reduction |
| Subjects: | Q Science > Q Science (General) > Q300-390 Cybernetics |
| Divisions: | Faculty of Computing and Informatics (FCI) |
| Depositing User: | Ms Rosnani Abd Wahab |
| Date Deposited: | 10 Dec 2025 07:48 |
| Last Modified: | 13 Dec 2025 14:58 |
| URII: | http://shdl.mmu.edu.my/id/eprint/15041 |
Downloads
Downloads per month over past year
Edit (login required) |
