Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/43044
Title: | Stress classification using subband based features | Authors: | Nwe, T.L. Foo, S.W. De Silva, L.C. |
Keywords: | Log Frequency Power Coefficients Nonlinear frequency domain LFPC feature Nonlinear time domain LFPC features Stress classification |
Issue Date: | 2003 | Citation: | Nwe, T.L.,Foo, S.W.,De Silva, L.C. (2003). Stress classification using subband based features. IEICE Transactions on Information and Systems E86-D (3) : 565-573. ScholarBank@NUS Repository. | Abstract: | On research to determine reliable acoustic indicators for the type of stress present in speech, the majority of systems have concentrated on the statistics extracted from pitch contour, energy contour, wavelet based subband features and Teager-Energy-Operator (TEO) based feature parameters. These systems work mostly on pair-wise distinction between stress and neutral speech. Their performance decreases substantially when tested in multi-style detection among many stress categories. In this paper, a novel system is proposed using linear short time Log Frequency Power Coefficients (LFPC) and TEO based nonlinear LFPC features in both time and frequency domain. Five-state Hidden Markov Model (HMM) with continuous Gaussian mixture distribution is used. The stress classification ability of the system is tested using data from the SUSAS (Speech Under Simulated and Actual Stress) database to categorize five stress conditions individually. It is found that the performance of linear acoustic features LFPC is better than that of nonlinear TEO based LFPC feature parameters. Results show that with linear acoustic feature LFPC, average accuracy of 84% and the best accuracy of 95% can be achieved in the classification of the five categories. Results of test of the system under different signal-to-noise conditions show that the performance of the system does not degrade drastically with increase in noise. It is also observed that classification using nonlinear frequency domain LFPC features gives relatively higher accuracy than that using nonlinear time domain LFPC features. | Source Title: | IEICE Transactions on Information and Systems | URI: | http://scholarbank.nus.edu.sg/handle/10635/43044 | ISSN: | 09168532 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.