Recurrent neural network-based speech recognition using MATLAB


James, Praveen Edward and Kit, Mun Hou and Vaithilingam, Chockalingam Aravind and Tan, Alan Wee Chiat (2020) Recurrent neural network-based speech recognition using MATLAB. International Journal of Intelligent Enterprise, 7 (1,2,3). pp. 56-66. ISSN 1745-3232

Full text not available from this repository.


The purpose of this paper is to design an efficient recurrent neural network (RNN)-based speech recognition system using software with long short-term memory (LSTM). The design process involves speech acquisition, pre-processing, feature extraction, training and pattern recognition tasks for a spoken sentence recognition system using LSTM-RNN. There are five layers namely, an input layer, a fully connected layer, a hidden LSTM layer, SoftMax layer and a sequential output layer. A vocabulary of 80 words which constitute 20 sentences is used. The depth of the layer is chosen as 20, 42 and 60 and the accuracy of each system is determined. The results reveal that the maximum accuracy of 89% is achieved when the depth of the hidden layer is 42. Since the depth of the hidden layer is fixed for a task, increased performance can be achieved by increasing the number of hidden layers.

Item Type: Article
Uncontrolled Keywords: Automatic speech recognition, feature extraction, pre-processing, recurrent neural network, RNN, long short-term memory
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware
Divisions: Faculty of Engineering (FOE)
Depositing User: Ms Suzilawati Abu Samah
Date Deposited: 29 Dec 2020 06:07
Last Modified: 29 Dec 2020 06:07


Downloads per month over past year

View ItemEdit (login required)