Please use this identifier to cite or link to this item: https://doi.org/10.1109/JSTSP.2020.3038054
Title: Parameter tuning-free missing-feature reconstruction for robust sound recognition
Authors: LIU QI 
JIBIN WU 
Keywords: automatic speech recognition (ASR)
deep neural networks (DNNs)
environmental sound classification
matrix factorization
Missing-feature reconstruction
Issue Date: 9-Jan-2021
Publisher: IEEE Journal of Selected Topics in Signal Processing
Citation: LIU QI, JIBIN WU (2021-01-09). Parameter tuning-free missing-feature reconstruction for robust sound recognition 15 (1) : 78 - 89. ScholarBank@NUS Repository. https://doi.org/10.1109/JSTSP.2020.3038054
Abstract: With the advent of the deep neural network, automatic speech recognition (ASR) has seen significant improvements in recent years. However, ASR performance degrades rapidly when the acoustic environment, such as communication channels or noise backgrounds, differ from those of training data. In the missing feature approach to speech processing, the unreliable feature components are identified and reconstructed to overcome signal degradation and the mismatch of the acoustic environment. To reduce the model dependency, we investigate the matrix completion technique in missing feature reconstruction tasks. However, most of the matrix completion techniques require a priori tuning parameters, e.g., target rank, which is hard to determine in practice. In this work, we propose a matrix completion method based on matrix factorization for the missing-feature reconstruction task, that does not require model training nor parameter tuning. Experiments show superior feature reconstruction performance and computational efficiency in both speech recognition and environmental sound classification tasks. © 2007-2012 IEEE.
URI: https://scholarbank.nus.edu.sg/handle/10635/187065
ISBN: 19324553
DOI: 10.1109/JSTSP.2020.3038054
Appears in Collections:Elements
Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
FINAL VERSION.pdf671.67 kBAdobe PDF

OPEN

Post-printView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.