Please use this identifier to cite or link to this item: https://doi.org/10.21437/Interspeech.2017-301
DC FieldValue
dc.titleLongitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech
dc.contributor.authorYilmaz E.
dc.contributor.authorJelske Dijkstra
dc.contributor.authorHans Van de Velde
dc.contributor.authorFrederik Kampstra
dc.contributor.authorJouke Algra
dc.contributor.authorHenk van den Heuvel
dc.contributor.authorDavid van Leeuwen
dc.date.accessioned2018-08-02T05:14:52Z
dc.date.available2018-08-02T05:14:52Z
dc.date.issued2017-08-01
dc.identifier.citationYilmaz E., Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen (2017-08-01). Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017-August : 37-41. ScholarBank@NUS Repository. https://doi.org/10.21437/Interspeech.2017-301
dc.identifier.issn2308457X
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/145523
dc.description.abstractIn this paper, we present a new longitudinal and bilingual broadcast database designed for speaker clustering and text-independent verification research. The broadcast data is extracted from the archives of Omrop Fryslân which is the regional broadcaster in the province of Fryslân, located in the north of the Netherlands. Two speaker verification tasks are provided in a standard enrollment-test setting with language consistent trials. The first task contains target trials from all speakers available appearing in at least two different programs, while the second task contains target trials from a subgroup of speakers appearing in programs recorded in multiple years. The second task is designed to investigate the effects of ageing on the accuracy of speaker verification systems. This database also contains unlabeled spoken segments from different radio programs for speaker clustering research. We provide the output of an existing speaker diarization system for baseline verification experiments. Finally, we present the baseline speaker verification results using the Kaldi GMM- and DNN-UBM speaker verification system. This database will be an extension to the recently presented open source Frisian data collection and it is publicly available for research purposes.
dc.language.isoen
dc.publisherInternational Speech Communication Association
dc.subjectAgeing effects, Bilingual data, Speaker clustering, Speaker diarization, Speaker verification
dc.typeConference Paper
dc.contributor.departmentELECTRICAL & COMPUTER ENGINEERING
dc.description.doi10.21437/Interspeech.2017-301
dc.description.sourcetitleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
dc.description.volume2017-August
dc.description.page37-41
dc.published.statePublished
dc.grant.idNWO Project 314-99-119 (Frisian Audio Mining Enterprise)
dc.grant.fundingagencyNederlandse Organisatie voor Wetenschappelijk Onderzoek
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Interspeech2017_3.pdfPreprint version316.64 kBAdobe PDF

OPEN

Pre-printView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.