Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/77990
Title: | A two-stage speaker adaptation approach for subspace gaussian mixture model based nonnative speech recognition | Authors: | Li, B. Sim, K.C. |
Keywords: | Nonnative Speech Recognition Speaker Adaptation Subspace Gaussian Mixture Model |
Issue Date: | 2012 | Citation: | Li, B.,Sim, K.C. (2012). A two-stage speaker adaptation approach for subspace gaussian mixture model based nonnative speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 2 : 1770-1773. ScholarBank@NUS Repository. | Abstract: | Nonnative speech recognition is becoming more and more important as many speech applications are deployed world wide. Meanwhile, due to the large population of nonnative speakers, speaker adaptation remains the most practical way for providing high performance speech services. Subspace Gaussian Mixture Model (SGMM) has recently been shown to yield superior performance on various native speech recognition tasks. In this paper, we investigated different speaker adaptation techniques of SGMM for nonnative speech recognition. A two-stage direct model adaptation approach has been proposed based on the analysis of SGMM model parameter functionalities. Our initial experiments have also verified that the proposed approach is much more effective than the traditional feature-space Maximum Likelihood Linear Regression(MLLR) on SGMM based nonnative speaker adaptation tasks. | Source Title: | 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 | URI: | http://scholarbank.nus.edu.sg/handle/10635/77990 | ISBN: | 9781622767595 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.