Speech recognition for acoustic-assisted video coding and animation

Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/146436

Title:	Speech recognition for acoustic-assisted video coding and animation
Authors:	Chen Homer H. Chou Wu Haskell Barry G. Chen Tsuhan
Issue Date:	1995
Publisher:	Society of Photo-Optical Instrumentation Engineers, Bellingham, WA, United States
Citation:	Chen Homer H., Chou Wu, Haskell Barry G., Chen Tsuhan (1995). Speech recognition for acoustic-assisted video coding and animation. Proceedings of SPIE - The International Society for Optical Engineering 2501 (1/-) : 274-283. ScholarBank@NUS Repository.
Abstract:	In this paper, we discuss issues related to analysis and synthesis of facial images using speech information. An approach to speaker independent acoustic-assisted image coding and animation is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) acoustic viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions.
Source Title:	Proceedings of SPIE - The International Society for Optical Engineering
URI:	http://scholarbank.nus.edu.sg/handle/10635/146436
ISBN:	891418587
ISSN:	0277786X
Appears in Collections:	Staff Publications

There are no files associated with this item.

Check