Main Menu

Towards enhanced assessment question classification: a study using machine learning, deep learning, and generative AI

Citation

Gani, Mohammed Osman and Ayyasamy, Ramesh Kumar and Alhashmi, Saadat M. and Alam, Khondaker Sajid and Sangodiah, Anbuselvan and Khaleduzzman, Khondaker and Ponnusamy, Chinnasamy (2025) Towards enhanced assessment question classification: a study using machine learning, deep learning, and generative AI. Connection Science, 37 (1). ISSN 0954-0091

Text
4.pdf - Published Version
Restricted to Repository staff only
Download (4MB)

Official URL: https://doi.org/10.1080/09540091.2024.2445249

Abstract

This study aims to benchmark the performance of machine learning (ML), deep learning (DL), and generative AI (GenAI) models in categorising assessment questions based on Bloom’s Taxonomy. Previous studies have lacked comprehensive investigations into the performance of these approaches. Further, the GenAI remains unexplored, offering a promising avenue for groundbreaking explorations. Therefore, we explore the effectiveness of various ML models by incorporating domain-specific term weighting and utilising word embeddings. The study also analyses the performance of Recurrent Neural Networks (RNNs) and Convolutional Neural Network (CNN) with and without bidirectional connections, as well as an approach that combines RNNs and CNN. Furthermore, we evaluate several transformer-based models by fine-tuning them alongside GenAI models text-davinci-003, gpt-3.5-turbo, PaLM2, and Gemini Pro in zero-shot classification settings. The results demonstrate that ML models outperformed DL models, achieving a best accuracy of 0.871 and F1 score of 0.872. Additionally, domain-specific term weighting is found to be superior to word embeddings. Furthermore, most ML and DL models performed better than GenAI models, with GenAI models achieving a best accuracy of 0.618 and a best F1 score of 0.627. Therefore, the outcome suggests considering the ML models with domain-specific term weighting as benchmark models in future research.

Item Type:	Article
Uncontrolled Keywords:	Bloom’s taxonomy, assessment question classiﬁcation, generative AI, term weighting, word embedding
Subjects:	Q Science > QA Mathematics > QA71-90 Instruments and machines
Divisions:	Faculty of Computing and Informatics (FCI)
Depositing User:	Ms Nurul Iqtiani Ahmad
Date Deposited:	17 Feb 2025 07:51
Last Modified:	18 Feb 2025 02:15
URII:	http://shdl.mmu.edu.my/id/eprint/13449

Downloads

Downloads per month over past year

Edit (login required)