Please use this identifier to cite or link to this item:
|Title:||Integrating articulatory based features with auditory based features for robust stressed speech recognition||Authors:||Tin, L.N.
Nonlinear FFT spectrum
Robust speech recognition
|Issue Date:||2005||Citation:||Tin, L.N.,Haizhou, L.,Ye, W. (2005). Integrating articulatory based features with auditory based features for robust stressed speech recognition. 2005 Fifth International Conference on Information, Communications and Signal Processing 2005 : 1334-1338. ScholarBank@NUS Repository.||Abstract:||Intra-speaker variations due to perceptually induced stress or emotion adversely affect speech recognition system performance. In this paper, we combine auditory based (Mel Frequency Cepstral Coefficients and Linear Predictive Cepstral Coefficients) features and articulatory based (voicedness) features for robust speech recognition. Voicedness features are derived using linear and Teager Energy Operator (TEO) based nonlinear Fast Fourier Transform (FFT) spectra. Nonlinear properties are analyzed in both the time and frequency domains. In addition, we investigate the sensitivity of all these FFT spectra to stress and observe the performance of individual FFT spectra. The system is tested using stressed speech data from the Speech Under Simulated and Actual Stress (SUSAS) database. The results show that articulatory based features help to improve the system performance. Furthermore, significant performance improvement has been observed when using the FFT spectrum which is less sensitive to stress. © 2005 IEEE.||Source Title:||2005 Fifth International Conference on Information, Communications and Signal Processing||URI:||http://scholarbank.nus.edu.sg/handle/10635/41474||ISBN:||0780392833|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Apr 19, 2019
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.