Cross-modal prediction in audio-visual communication

Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/146426

DC Field	Value
dc.title	Cross-modal prediction in audio-visual communication
dc.contributor.author	Rao Ram R.
dc.contributor.author	Chen Tsuhan
dc.date.accessioned	2018-08-21T05:13:33Z
dc.date.available	2018-08-21T05:13:33Z
dc.date.issued	1996
dc.identifier.citation	Rao Ram R., Chen Tsuhan (1996). Cross-modal prediction in audio-visual communication. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings 4 : 2056-2059. ScholarBank@NUS Repository.
dc.identifier.issn	07367791
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/146426
dc.description.abstract	In this paper, we present a novel means for predicting the shape of a person's mouth from the corresponding speech signal and explore applications of this prediction to video coding. The prediction is accomplished by modeling the probability distribution of the audio-visual features by a Gaussian mixture density. The optimal estimate for the visual features given the acoustic features can then be computed using this probability distribution. The ability to predict a person's mouth shape from the corresponding audio leads to a number of interesting joint audio-video coding strategies. In the cross-modal predictive coding system described in this paper, a model-based video coder compares measured visual parameters with predicted visual parameters, and sends the difference between the two to the receiver. Since the decoder also receives the acoustic data, it can form the prediction and then reconstruct the original parameters by adding the transmitted error signal.
dc.publisher	IEEE, Piscataway, NJ, United States
dc.source	Scopus
dc.type	Conference Paper
dc.contributor.department	OFFICE OF THE PROVOST
dc.contributor.department	DEPARTMENT OF COMPUTER SCIENCE
dc.description.sourcetitle	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
dc.description.volume	4
dc.description.page	2056-2059
dc.description.coden	IPROD
dc.published.state	published
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Google Scholar^TM