Please use this identifier to cite or link to this item:
|Title:||A cross-modal approach for karaoke artifacts correction||Authors:||Yan, W.-Q.
|Issue Date:||2008||Citation:||Yan, W.-Q., Kankanhalli, M.S. (2008). A cross-modal approach for karaoke artifacts correction. Multimedia Tools and Applications 39 (3) : 413-439. ScholarBank@NUS Repository. https://doi.org/10.1007/s11042-007-0174-z||Abstract:||Karaoke singing is a popular form of entertainment in several parts of the world. Since this genre of performance attracts amateurs, the singing often has artifacts related to scale, tempo, and synchrony. We have developed an approach to correct these artifacts using cross-modal multimedia streams information. We first perform adaptive sampling on the user's rendition and then use the original singer's rendition as well as the video caption highlighting information in order to correct the pitch, tempo and the loudness. A method of analogies has been employed to perform this correction. The basic idea is to manipulate the user's rendition in a manner to make it as similar as possible to the original singing. A pre-processing step of noise removal due to feedback and huffing also helps improve the quality of the user's audio. The results are described in the paper which shows the effectiveness of this multimedia approach. © 2007 Springer Science+Business Media, LLC.||Source Title:||Multimedia Tools and Applications||URI:||http://scholarbank.nus.edu.sg/handle/10635/39255||ISSN:||13807501||DOI:||10.1007/s11042-007-0174-z|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Mar 4, 2021
checked on Mar 2, 2021
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.