Please use this identifier to cite or link to this item:
https://doi.org/10.1109/JSTSP.2020.3038054
Title: | Parameter tuning-free missing-feature reconstruction for robust sound recognition | Authors: | LIU QI JIBIN WU |
Keywords: | automatic speech recognition (ASR) deep neural networks (DNNs) environmental sound classification matrix factorization Missing-feature reconstruction |
Issue Date: | 9-Jan-2021 | Publisher: | IEEE Journal of Selected Topics in Signal Processing | Citation: | LIU QI, JIBIN WU (2021-01-09). Parameter tuning-free missing-feature reconstruction for robust sound recognition 15 (1) : 78 - 89. ScholarBank@NUS Repository. https://doi.org/10.1109/JSTSP.2020.3038054 | Abstract: | With the advent of the deep neural network, automatic speech recognition (ASR) has seen significant improvements in recent years. However, ASR performance degrades rapidly when the acoustic environment, such as communication channels or noise backgrounds, differ from those of training data. In the missing feature approach to speech processing, the unreliable feature components are identified and reconstructed to overcome signal degradation and the mismatch of the acoustic environment. To reduce the model dependency, we investigate the matrix completion technique in missing feature reconstruction tasks. However, most of the matrix completion techniques require a priori tuning parameters, e.g., target rank, which is hard to determine in practice. In this work, we propose a matrix completion method based on matrix factorization for the missing-feature reconstruction task, that does not require model training nor parameter tuning. Experiments show superior feature reconstruction performance and computational efficiency in both speech recognition and environmental sound classification tasks. © 2007-2012 IEEE. | URI: | https://scholarbank.nus.edu.sg/handle/10635/187065 | ISBN: | 19324553 | DOI: | 10.1109/JSTSP.2020.3038054 |
Appears in Collections: | Elements Staff Publications |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
FINAL VERSION.pdf | 671.67 kB | Adobe PDF | OPEN | Post-print | View/Download |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.