Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://doi.org/10.21437/Interspeech.2017-301

Title:	Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech
Authors:	Yilmaz E. Jelske Dijkstra Hans Van de Velde Frederik Kampstra Jouke Algra Henk van den Heuvel David van Leeuwen
Keywords:	Ageing effects, Bilingual data, Speaker clustering, Speaker diarization, Speaker verification
Issue Date:	1-Aug-2017
Publisher:	International Speech Communication Association
Citation:	Yilmaz E., Jelske Dijkstra, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen (2017-08-01). Longitudinal speaker clustering and verification corpus with code-switching Frisian-Dutch speech. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017-August : 37-41. ScholarBank@NUS Repository. https://doi.org/10.21437/Interspeech.2017-301
Abstract:	In this paper, we present a new longitudinal and bilingual broadcast database designed for speaker clustering and text-independent verification research. The broadcast data is extracted from the archives of Omrop Fryslân which is the regional broadcaster in the province of Fryslân, located in the north of the Netherlands. Two speaker verification tasks are provided in a standard enrollment-test setting with language consistent trials. The first task contains target trials from all speakers available appearing in at least two different programs, while the second task contains target trials from a subgroup of speakers appearing in programs recorded in multiple years. The second task is designed to investigate the effects of ageing on the accuracy of speaker verification systems. This database also contains unlabeled spoken segments from different radio programs for speaker clustering research. We provide the output of an existing speaker diarization system for baseline verification experiments. Finally, we present the baseline speaker verification results using the Kaldi GMM- and DNN-UBM speaker verification system. This database will be an extension to the recently presented open source Frisian data collection and it is publicly available for research purposes.
Source Title:	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
URI:	http://scholarbank.nus.edu.sg/handle/10635/145523
ISSN:	2308457X
DOI:	10.21437/Interspeech.2017-301
Appears in Collections:	Staff Publications Elements

Show full item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
Interspeech2017_3.pdf	Preprint version	316.64 kB	Adobe PDF	OPEN	Pre-print	View/Download

Google Scholar^TM

Check

Altmetric

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.