Please use this identifier to cite or link to this item:
|Title:||Educational violin transcription by fusing multimedia streams|
|Authors:||Wang, Y. |
|Source:||Wang, Y.,Zhang, B.,Schleusing, O. (2007). Educational violin transcription by fusing multimedia streams. Proceedings of the ACM International Multimedia Conference and Exhibition : 57-66. ScholarBank@NUS Repository. https://doi.org/10.1145/1290144.1290154|
|Abstract:||Computer-assisted violin tutoring requires accurate violin transcription. For pitched non-percussive (PNP) sounds such as from the violin, note segmentation is a much more difficult task than pitch detection. This issue is accentuated when the audio is recorded during an instrument practice session at home which is acoustically inferior to a professional recording studio. This paper presents a new approach to the problem by using the correlation between different media streams for e-learning applications. We design a capture mechanism to record one audio and two video streams simultaneously, and exploit the relationships among them for enhanced transcription. State-of-the-art audio methods for note segmentation and pitch estimation are implemented as the audio-only baseline. Two web-cameras are employed to track the right hand (bowing) and the left hand's four fingers (fingering) on the fingerboard, respectively. The audio and visual information is then fused in the feature space. Our new approach is evaluated with an audio-visual violin music database containing 16 complete music pieces of different styles with 2157 notes in total. Experimental results show that our multimodal approach achieves a 10% increase in true positives, and a 8% reduction in false positives of overall transcription performance in comparison with the audio-only baseline. Copyright 2007 ACM.|
|Source Title:||Proceedings of the ACM International Multimedia Conference and Exhibition|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Dec 11, 2017
checked on Dec 9, 2017
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.