Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/135844
Title: FACTORIZED HIDDEN LAYER ADAPTATION FOR DEEP NEURAL NETWORK BASED ACOUSTIC MODELING
Authors: LAHIRU THILINA SAMARAKOON
Keywords: Automatic Speech Recognition, Deep Neural Network, Factorized Hidden Layer Adaptation
Issue Date: 9-Dec-2016
Citation: LAHIRU THILINA SAMARAKOON (2016-12-09). FACTORIZED HIDDEN LAYER ADAPTATION FOR DEEP NEURAL NETWORK BASED ACOUSTIC MODELING. ScholarBank@NUS Repository.
Abstract: The automatic speech recognition (ASR) technology have become widely popular since the deep neural networks (DNNs) have been integrated into ASR systems. However, DNNs are susceptible to performance degradations due to the mismatches between the training and testing conditions. Therefore, adaptation techniques are used to reduce this mismatch. This thesis proposes two methods to adapt the DNN acoustic models for ASR. Namely, factorized hidden layer (FHL) and subspace learning hidden unit contributions (LHUC). FHL aims at modeling speaker dependent (SD) hidden layers by representing an SD affine transformation as a linear combination of bases. The combination weights can be initialized using speaker representations like i-vectors and then reliably refined in an unsupervised adaptation fashion. The subspace LHUC improves the LHUC method by constructing a subspace to estimate the SD parameters. Experiments have shown that the both methods improve the ASR performance significantly over baselines and other state-of-the-art DNN adaptation approaches.
URI: http://scholarbank.nus.edu.sg/handle/10635/135844
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
SamarakoonLT.pdf4.9 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.