Please use this identifier to cite or link to this item: https://doi.org/10.1007/s11042-007-0174-z
Title: A cross-modal approach for karaoke artifacts correction
Authors: Yan, W.-Q.
Kankanhalli, M.S. 
Keywords: Adaptive sampling
Artifacts handling
Karaoke
Issue Date: 2008
Source: Yan, W.-Q., Kankanhalli, M.S. (2008). A cross-modal approach for karaoke artifacts correction. Multimedia Tools and Applications 39 (3) : 413-439. ScholarBank@NUS Repository. https://doi.org/10.1007/s11042-007-0174-z
Abstract: Karaoke singing is a popular form of entertainment in several parts of the world. Since this genre of performance attracts amateurs, the singing often has artifacts related to scale, tempo, and synchrony. We have developed an approach to correct these artifacts using cross-modal multimedia streams information. We first perform adaptive sampling on the user's rendition and then use the original singer's rendition as well as the video caption highlighting information in order to correct the pitch, tempo and the loudness. A method of analogies has been employed to perform this correction. The basic idea is to manipulate the user's rendition in a manner to make it as similar as possible to the original singing. A pre-processing step of noise removal due to feedback and huffing also helps improve the quality of the user's audio. The results are described in the paper which shows the effectiveness of this multimedia approach. © 2007 Springer Science+Business Media, LLC.
Source Title: Multimedia Tools and Applications
URI: http://scholarbank.nus.edu.sg/handle/10635/39255
ISSN: 13807501
DOI: 10.1007/s11042-007-0174-z
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

SCOPUSTM   
Citations

1
checked on Dec 11, 2017

Page view(s)

66
checked on Dec 9, 2017

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.