Feature-Level Analysis and Robust Baselines for EEG-Based Imagined Speech Recognition on the ASU Dataset

Citation

Duhair, Hatem T M and Ibrahim, Masrullizam Mat and Alsayaydeh, Jamil Abedalrahim Jamil and Farid, Mazen and Herawan, Safarudin Gazali (2026) Feature-Level Analysis and Robust Baselines for EEG-Based Imagined Speech Recognition on the ASU Dataset. International Journal of Advanced Computer Science and Applications, 17 (4). ISSN 2158107X

[img] Text
13.pdf - Published Version
Restricted to Repository staff only

Download (1MB)

Abstract

Abstract: Imagined speech decoding from non-invasive electroencephalography remains a challenging problem, especially when moving beyond small vocabularies and optimistic evaluation protocols. This work revisits the Arizona State University (ASU) imagined speech dataset and treats it as a rigorous ten-class benchmark, with a focus on offline, corpus-level analysis rather than real-time deployment. After unifying all recordings into 5 s epochs at 256 Hz, 6,520 trials with 60 EEG channels were preprocessed using bandpass filtering, baseline correction, z-score normalization, and trial-wise ICA for artifact attenuation. On top of this pipeline, a comprehensive feature representation was constructed that combines common spatial patterns, discrete wavelet statistics, time-domain moments, autocorrelation coefficients, power spectral density band powers, and Hjorth parameters into a single 5,120-dimensional vector. A block-wise ablation indicates that autocorrelation, CSP, PSD, and Hjorth features carry most of the discriminative information in this setting, while wavelet and simple statistical descriptors contribute little and can be removed without harming performance. Using only the informative blocks (3,440 features), a multinomial logistic regression classifier reaches about 0.41 accuracy and 0.42 macro F1 on the ten-class task, roughly four times chance level. A multi-layer perceptron and a CNN–LSTM model, trained under the same splits and with class weighting, do not outperform this linear baseline and exhibit stronger overfitting. Within the evaluated protocol, these findings suggest that carefully engineered features capture most of the discriminative structure accessible on this corpus, and that deeper models add complexity without clear benefit. The study provides a transparent baseline and a feature-level analysis that can serve as a reference point for future work on imagined speech recognition and transfer learning across EEG corpora.

Item Type: Article
Uncontrolled Keywords: EEG-based imagined speech, ASU imagined speech dataset, brain–computer interface, feature extraction, logistic regression
Subjects: Q Science > QA Mathematics > QA71-90 Instruments and machines
Divisions: Faculty of Information Science and Technology (FIST)
Depositing User: Ms Suzilawati Abu Samah
Date Deposited: 05 Jun 2026 06:27
Last Modified: 05 Jun 2026 06:27
URII: http://shdl.mmu.edu.my/id/eprint/16039

Downloads

Downloads per month over past year

View ItemEdit (login required)