Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/146436
Title: | Speech recognition for acoustic-assisted video coding and animation | Authors: | Chen Homer H. Chou Wu Haskell Barry G. Chen Tsuhan |
Issue Date: | 1995 | Publisher: | Society of Photo-Optical Instrumentation Engineers, Bellingham, WA, United States | Citation: | Chen Homer H., Chou Wu, Haskell Barry G., Chen Tsuhan (1995). Speech recognition for acoustic-assisted video coding and animation. Proceedings of SPIE - The International Society for Optical Engineering 2501 (1/-) : 274-283. ScholarBank@NUS Repository. | Abstract: | In this paper, we discuss issues related to analysis and synthesis of facial images using speech information. An approach to speaker independent acoustic-assisted image coding and animation is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) acoustic viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions. | Source Title: | Proceedings of SPIE - The International Society for Optical Engineering | URI: | http://scholarbank.nus.edu.sg/handle/10635/146436 | ISBN: | 891418587 | ISSN: | 0277786X |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.