Please use this identifier to cite or link to this item: https://doi.org/10.21437/Interspeech.2017-303
DC FieldValue
dc.titleMulti-stage DNN training for automatic recognition of dysarthric speech
dc.contributor.authorYilmaz E.
dc.contributor.authorMario Ganzeboom
dc.contributor.authorCatia Cucchiarini
dc.contributor.authorHelmer Strik
dc.date.accessioned2018-08-02T05:10:34Z
dc.date.available2018-08-02T05:10:34Z
dc.date.issued2017-08-01
dc.identifier.citationYilmaz E., Mario Ganzeboom, Catia Cucchiarini, Helmer Strik (2017-08-01). Multi-stage DNN training for automatic recognition of dysarthric speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017-August : 2685-2689. ScholarBank@NUS Repository. https://doi.org/10.21437/Interspeech.2017-303
dc.identifier.issn2308457X
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/145522
dc.description.abstractIncorporating automatic speech recognition (ASR) in individualized speech training applications is becoming more viable thanks to the improved generalization capabilities of neural network-based acoustic models. The main problem in developing applications for dysarthric speech is the relative in-domain data scarcity. Collecting representative amounts of dysarthric speech data is difficult due to rigorous ethical and medical permission requirements, problems in accessing patients who are generally vulnerable and often subject to altering health conditions and, last but not least, the high variability in speech resulting from different pathological conditions. Developing such applications is even more challenging for languages which in general have fewer resources, fewer speakers and, consequently, also fewer patients than English, as in the case of a mid-sized language like Dutch. In this paper, we investigate a multi-stage deep neural network (DNN) training scheme aimed at obtaining better modeling of dysarthric speech by using only a small amount of in-domain training data. The results show that the system employing the proposed training scheme considerably improves the recognition of Dutch dysarthric speech compared to a baseline system with single-stage training only on a large amount of normal speech or a small amount of in-domain data.
dc.language.isoen
dc.publisherInternational Speech Communication Association
dc.subjectAutomatic speech recognition, Deep neural networks, Dysarthria, Pathological speech
dc.typeConference Paper
dc.contributor.departmentELECTRICAL & COMPUTER ENGINEERING
dc.description.doi10.21437/Interspeech.2017-303
dc.description.sourcetitleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
dc.description.volume2017-August
dc.description.page2685-2689
dc.published.statePublished
dc.grant.idNWO 314-99-101 (CHASING)
dc.grant.fundingagencyNederlandse Organisatie voor Wetenschappelijk Onderzoek
Appears in Collections:Elements
Staff Publications

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Interspeech2017_2.pdfPreprint version96.46 kBAdobe PDF

OPEN

Pre-printView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.