A preliminary speech analysis for recognizing emotion

Citation

Abd. Razak, Aishah and Zainal Abidin, Mohd Izani and Komiya, Ryoichi (2003) A preliminary speech analysis for recognizing emotion. In: Proceedings. Student Conference Research and Development, 2003. SCORED 2003. IEEE Xplore, 49 -54. ISBN 0-7803-8173-4

[img] Text
01459662.pdf - Published Version
Restricted to Repository staff only

Download (941kB)

Abstract

Some speech analysis to extract emotion from voice is discussed. An emotional Malay and English voice database has been developed, consisting six basic emotions namely happiness, sadness, disgust, fear, anger and surprise. As the target is content independent emotion recognition, 4 short sentences that have the most natural meaning is adopted for the illustration and analysis. A study on speech prosody is done to identify the emotional features of voice. Variation on the sample's energy, duration, and pitch for different emotions is compared. Spectrogram analysis is done on some samples to observe the effect of formant. It is found that duration, average energy and pitch can provide some indication of emotional content of a speech, but it is not enough to correctly represent the emotions. Even though there are slightly different pattern for English and Malay samples, it is still reasonable to assume that there are standard acoustic configurations in expressing particular emotions.

Item Type: Book Section
Uncontrolled Keywords: Speech processing
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics
Divisions: Faculty of Information Science and Technology (FIST)
Depositing User: Ms Suzilawati Abu Samah
Date Deposited: 24 Dec 2013 01:55
Last Modified: 19 Aug 2021 00:16
URII: http://shdl.mmu.edu.my/id/eprint/4653

Downloads

Downloads per month over past year

View ItemEdit (login required)