Hidden Markov model based visual speech recognition | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/14759

Title:	Hidden Markov model based visual speech recognition
Authors:	DONG LIANG
Keywords:	Hidden Markov Model, visual speech recognition, viseme, separable distance, adaptive boosting, level building
Issue Date:	2-Jun-2005
Citation:	DONG LIANG (2005-06-02). Hidden Markov model based visual speech recognition. ScholarBank@NUS Repository.
Abstract:	Speech recognition can be made more accurate if visual speech information such as the movement of the lips is taken into consideration. In this thesis, studies on visual speech processing are presented. Classifiers based on Hidden Markov Model (HMM) are first explored for modeling and identifying the basic visual speech elements. Considering that the temporal features of visual speech elements may be confusable and sensitive to their contexts, three novel training strategies, referred to as two-channel training strategy, Maximum Separable Distance training strategy and HMM Adaptive Boosting strategy, are proposed to improve the discriminative power and robustness of an HMM classifier. Following that, approaches for recognizing words, phrases and connected-digit units in visual speech are presented with exploration of level building method and Viterbi searching algorithm. The thesis also covers the studies of 3D lip tracking and visual speech mapping between different speakers. These approaches may extend the applicability of a visual speech processing system to unfavorable conditions.
URI:	http://scholarbank.nus.edu.sg/handle/10635/14759
Appears in Collections:	Ph.D Theses (Open)

Show full item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
Hidden Markov Model Based Visual Speech Recognition.pdf		1.75 MB	Adobe PDF	OPEN	None	View/Download

Google Scholar^TM

Check

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.