Please use this identifier to cite or link to this item:
https://doi.org/10.1109/ICASSP.2012.6288972
DC Field | Value | |
---|---|---|
dc.title | An investigation of tied-mixture GMM based triphone state clustering | |
dc.contributor.author | Wang, G. | |
dc.contributor.author | Sim, K.C. | |
dc.date.accessioned | 2013-07-04T08:18:14Z | |
dc.date.available | 2013-07-04T08:18:14Z | |
dc.date.issued | 2012 | |
dc.identifier.citation | Wang, G.,Sim, K.C. (2012). An investigation of tied-mixture GMM based triphone state clustering. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings : 4717-4720. ScholarBank@NUS Repository. <a href="https://doi.org/10.1109/ICASSP.2012.6288972" target="_blank">https://doi.org/10.1109/ICASSP.2012.6288972</a> | |
dc.identifier.isbn | 9781467300469 | |
dc.identifier.issn | 15206149 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/41040 | |
dc.description.abstract | Parameter tying is a crucial scheme for robust context dependent acoustic modeling since it takes a major role in balancing the desired model complexity and the amount of data available. In this paper, a modified decision tree state clustering scheme based on tied-mixture Gaussian Mixture Model (GMM) is proposed. Instead of using a single Gaussian untied triphone system, a tied-mixture GMM triphone system is adopted as a better acoustic model for state clustering. Meanwhile, the proposed scheme allows easy incorporation of discriminative training during clustering. Experimental results show that for a varying number of state clusters, the proposed approach consistently outperforms the standard single Gaussian based state tying. The best WER performance has a 10.5% relative improvement over the conventional decision tree clustering and the proposed scheme achieves its best performance using a much smaller number of state clusters. Moreover, detailed analyses reveal that the proposed GMM clustering has a better state distribution which leads to 1) better frame-state alignments 2) better phonetic question selections. These two factors may make the proposed approach superior for clustering. © 2012 IEEE. | |
dc.description.uri | http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/ICASSP.2012.6288972 | |
dc.source | Scopus | |
dc.subject | context dependent modeling | |
dc.subject | phonetic decision tree | |
dc.subject | state clustering | |
dc.subject | Tied-mixture | |
dc.type | Conference Paper | |
dc.contributor.department | COMPUTER SCIENCE | |
dc.description.doi | 10.1109/ICASSP.2012.6288972 | |
dc.description.sourcetitle | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | |
dc.description.page | 4717-4720 | |
dc.description.coden | IPROD | |
dc.identifier.isiut | NOT_IN_WOS | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.