Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/77990
Title: A two-stage speaker adaptation approach for subspace gaussian mixture model based nonnative speech recognition
Authors: Li, B.
Sim, K.C. 
Keywords: Nonnative Speech Recognition
Speaker Adaptation
Subspace Gaussian Mixture Model
Issue Date: 2012
Citation: Li, B.,Sim, K.C. (2012). A two-stage speaker adaptation approach for subspace gaussian mixture model based nonnative speech recognition. 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 2 : 1770-1773. ScholarBank@NUS Repository.
Abstract: Nonnative speech recognition is becoming more and more important as many speech applications are deployed world wide. Meanwhile, due to the large population of nonnative speakers, speaker adaptation remains the most practical way for providing high performance speech services. Subspace Gaussian Mixture Model (SGMM) has recently been shown to yield superior performance on various native speech recognition tasks. In this paper, we investigated different speaker adaptation techniques of SGMM for nonnative speech recognition. A two-stage direct model adaptation approach has been proposed based on the analysis of SGMM model parameter functionalities. Our initial experiments have also verified that the proposed approach is much more effective than the traditional feature-space Maximum Likelihood Linear Regression(MLLR) on SGMM based nonnative speaker adaptation tasks.
Source Title: 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
URI: http://scholarbank.nus.edu.sg/handle/10635/77990
ISBN: 9781622767595
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.