Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/53706
DC FieldValue
dc.titleNoise-Robust Speech Recognition Using Deep Neural Network
dc.contributor.authorLI BO
dc.date.accessioned2014-05-31T18:02:52Z
dc.date.available2014-05-31T18:02:52Z
dc.date.issued2014-01-06
dc.identifier.citationLI BO (2014-01-06). Noise-Robust Speech Recognition Using Deep Neural Network. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/53706
dc.description.abstractThis thesis addresses the noise robustness of the recently developed Deep Neural Networks (DNNs) based speech recognition systems. Five techniques have been proposed. Firstly, a Mean Variance Normalization technique was developed to integrate noise statistics using Vector Taylor Series. Secondly, a Deep Split Temporal Context system was proposed to separately model sub-contexts of a long temporal span of speech. Thirdly, we revisited the missing feature theory and developed a DNN-based spectral masking system. Fourthly, an Ideal Hidden-activation Mask (IHM) was proposed to remove noise-prone latent detectors. Lastly, a noise code technique was developed to simulate IHM with reduced computational costs. Improved noise robustness has been obtained using the proposed techniques on benchmark tasks, Aurora-2 and Aurora-4. The spectral masking approach successfully ranks first in the literature on both tasks at the time of writing and is one of the most promising noise-robust techniques for DNNs.
dc.language.isoen
dc.subjectSpeech Recognition, Deep Neural Network, Noise Robustness
dc.typeThesis
dc.contributor.departmentCOMPUTER SCIENCE
dc.contributor.supervisorSIM KHE CHAI
dc.description.degreePh.D
dc.description.degreeconferredDOCTOR OF PHILOSOPHY
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Ph.D Theses (Open)

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
LiB.pdf1.89 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.