Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech

Please use this identifier to cite or link to this item: https://doi.org/10.21437/Interspeech.2017-301

DC Field	Value
dc.title	Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech
dc.contributor.author	Yilmaz E.
dc.contributor.author	Jelske Dijkstra
dc.contributor.author	Hans Van de Velde
dc.contributor.author	Frederik Kampstra
dc.contributor.author	Jouke Algra
dc.contributor.author	Henk van den Heuvel
dc.contributor.author	David van Leeuwen
dc.date.accessioned	2018-08-02T05:14:52Z
dc.date.available	2018-08-02T05:14:52Z
dc.date.issued	2017-08-01
dc.identifier.citation	Yilmaz E., Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen (2017-08-01). Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017-August : 37-41. ScholarBank@NUS Repository. https://doi.org/10.21437/Interspeech.2017-301
dc.identifier.issn	2308457X
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/145523
dc.description.abstract	In this paper, we present a new longitudinal and bilingual broadcast database designed for speaker clustering and text-independent verification research. The broadcast data is extracted from the archives of Omrop Fryslân which is the regional broadcaster in the province of Fryslân, located in the north of the Netherlands. Two speaker verification tasks are provided in a standard enrollment-test setting with language consistent trials. The first task contains target trials from all speakers available appearing in at least two different programs, while the second task contains target trials from a subgroup of speakers appearing in programs recorded in multiple years. The second task is designed to investigate the effects of ageing on the accuracy of speaker verification systems. This database also contains unlabeled spoken segments from different radio programs for speaker clustering research. We provide the output of an existing speaker diarization system for baseline verification experiments. Finally, we present the baseline speaker verification results using the Kaldi GMM- and DNN-UBM speaker verification system. This database will be an extension to the recently presented open source Frisian data collection and it is publicly available for research purposes.
dc.language.iso	en
dc.publisher	International Speech Communication Association
dc.subject	Ageing effects, Bilingual data, Speaker clustering, Speaker diarization, Speaker verification
dc.type	Conference Paper
dc.contributor.department	ELECTRICAL & COMPUTER ENGINEERING
dc.description.doi	10.21437/Interspeech.2017-301
dc.description.sourcetitle	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
dc.description.volume	2017-August
dc.description.page	37-41
dc.published.state	Published
dc.grant.id	NWO Project 314-99-119 (Frisian Audio Mining Enterprise)
dc.grant.fundingagency	Nederlandse Organisatie voor Wetenschappelijk Onderzoek
Appears in Collections:	Staff Publications Elements

Show simple item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
Interspeech2017_3.pdf	Preprint version	316.64 kB	Adobe PDF	OPEN	Pre-print	View/Download

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM