An SNR-incremental stochastic matching algorithm for noisy speech recognition

Please use this identifier to cite or link to this item: https://doi.org/10.1109/89.966089

DC Field	Value
dc.title	An SNR-incremental stochastic matching algorithm for noisy speech recognition
dc.contributor.author	Huang, C.-S.
dc.contributor.author	Wang, H.-C.
dc.contributor.author	Lee, C.-H.
dc.date.accessioned	2013-07-04T07:36:43Z
dc.date.available	2013-07-04T07:36:43Z
dc.date.issued	2001
dc.identifier.citation	Huang, C.-S., Wang, H.-C., Lee, C.-H. (2001). An SNR-incremental stochastic matching algorithm for noisy speech recognition. IEEE Transactions on Speech and Audio Processing 9 (8) : 866-873. ScholarBank@NUS Repository. https://doi.org/10.1109/89.966089
dc.identifier.issn	10636676
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/39221
dc.description.abstract	In this paper, an signal-to-noise ratio (SNR)-incremental stochastic matching (SISM) algorithm is proposed for robust speech recognition in noisy environments. The SISM algorithm is an extension of Sankar and Lee's stochastic matching (SM) for dealing with the distortion due to additive noise. We address two issues concerning the original maximum likelihood-based SM techniques. One concern is that the initial condition of the expectation-maximization (EM) algorithm has to be set carefully if the mismatch between training and testing is large. The other is that the performance is often limited by the newly adapted model in noise compensation instead of reaching the higher level of accuracy often obtained in clean environments. Our proposed SISM algorithm attempts to improve the initial condition and to relax the performance bound. First, the SISM algorithm provides a good initial condition making use of a set of environment-matched models. The second is a recursive operation, i.e., the reference model in each recursion is changed along the direction of SNR increment in order to push the generation performance to that obtained at higher SNR levels. Experimental results show that the SISM algorithm provides further improvement after the best environment-matched performance has been reached, and can therefore obtain an additional discriminative power through using the speech models with higher SNR instead of retraining process.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/89.966089
dc.source	Scopus
dc.subject	Expectation-maximization (EM) algorithm
dc.subject	Robust speech recognition
dc.subject	Stochastic matching
dc.type	Article
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1109/89.966089
dc.description.sourcetitle	IEEE Transactions on Speech and Audio Processing
dc.description.volume	9
dc.description.issue	8
dc.description.page	866-873
dc.description.coden	IESPE
dc.identifier.isiut	000172284600010
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM