Enhanced speech recognition in natural language processing

Citation

Chang, Siu Hong and Ng, Kok Why and Haw, Su Cheng and Yoong, Yih Jian (2025) Enhanced speech recognition in natural language processing. Bulletin of Electrical Engineering and Informatics, 14 (6). pp. 4732-4742. ISSN 2089-3191

Full text not available from this repository.

Abstract

Speech recognition is crucial for helping individuals with physical disabilities access digital content. However, current systems have significant flaws that hinder user experience and complicate daily tasks. Environmental disturbances can cause misinterpretation, and existing automatic speech recognition (ASR) systems struggle with comprehending acoustic and linguistic nuances and handling diverse speaking styles and accents. To address these issues, a new model integrates bidirectional encoder representations from transformers (BERT) and transformer features with natural language processing (NLP) capabilities. This model aims to consolidate semantic, linguistic, and acoustic information extracted from the Kaldi speech recognition toolkit and improve accuracy by rescoring the list of N-best hypotheses. The innovative approach leverages advancements in NLP to enhance speech recognition's accuracy and robustness across various scenarios. Evaluations on the LibriSpeech dataset show that integrating BERT, transformer encoder, and generative pretrained transformer 2 for rescoring N-best hypotheses significantly improves transcription accuracy. The proposed model achieves a word error rate (WER) of 17.98%, outperforming other models. This development paves the way for advancements in speech recognition technology, offering better user experiences in real-world applications.

Item Type: Article
Uncontrolled Keywords: Speech recognition
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware
Divisions: Faculty of Computing and Informatics (FCI)
Depositing User: Ms Rosnani Abd Wahab
Date Deposited: 22 Dec 2025 04:08
Last Modified: 22 Dec 2025 04:08
URII: http://shdl.mmu.edu.my/id/eprint/15098

Downloads

Downloads per month over past year

View ItemEdit (login required)