Please use this identifier to cite or link to this item: https://doi.org/10.1117/12.631645
Title: Singing voice detection for karaoke application
Authors: Shenoy, A.
Wu, Y.
Wang, Y. 
Keywords: Inverse comb filtering
Karaoke
Key
Lyrics
Rhythm
Singing voice
Tonic
Vocal segmentation
Issue Date: 2005
Source: Shenoy, A., Wu, Y., Wang, Y. (2005). Singing voice detection for karaoke application. Proceedings of SPIE - The International Society for Optical Engineering 5960 (2) : 752-762. ScholarBank@NUS Repository. https://doi.org/10.1117/12.631645
Abstract: We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocal/ non-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.
Source Title: Proceedings of SPIE - The International Society for Optical Engineering
URI: http://scholarbank.nus.edu.sg/handle/10635/41900
ISSN: 0277786X
DOI: 10.1117/12.631645
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

SCOPUSTM   
Citations

5
checked on Dec 6, 2017

WEB OF SCIENCETM
Citations

2
checked on Nov 18, 2017

Page view(s)

80
checked on Dec 10, 2017

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.