Recognition of video text through temporal integration

Please use this identifier to cite or link to this item: https://doi.org/10.1109/ICDAR.2013.122

DC Field	Value
dc.title	Recognition of video text through temporal integration
dc.contributor.author	Phan, T.Q.
dc.contributor.author	Shivakumara, P.
dc.contributor.author	Lu, T.
dc.contributor.author	Tan, C.L.
dc.date.accessioned	2014-07-04T03:14:54Z
dc.date.available	2014-07-04T03:14:54Z
dc.date.issued	2013
dc.identifier.citation	Phan, T.Q., Shivakumara, P., Lu, T., Tan, C.L. (2013). Recognition of video text through temporal integration. Proceedings of the International Conference on Document Analysis and Recognition, ICDAR : 589-593. ScholarBank@NUS Repository. https://doi.org/10.1109/ICDAR.2013.122
dc.identifier.issn	15205363
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/78315
dc.description.abstract	This paper presents a method for temporal integration, which can be used to improve the recognition accuracy of video texts. Given a word detected in a video frame, we use a combination of Stroke Width Transform and SIFT (Scale Invariant Feature Transform) to track it both backward and forward in time. The text instances within the word's frame span are then extracted and aligned at pixel level. In the second step, we integrate these instances into a text probability map. By thresholding this map, we obtain an initial binarization of the word. In the final step, the shapes of the characters are refined using the intensity values. This helps to preserve the distinctive character features (e.g., sharp edges and holes), which are useful for OCR engines to distinguish between the different character classes. Experiments on English and German videos show that the proposed method outperforms existing ones in terms of recognition accuracy. © 2013 IEEE.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/ICDAR.2013.122
dc.source	Scopus
dc.subject	multiple frame integration
dc.subject	SIFT
dc.subject	Stroke Width Transform
dc.subject	temporal integration
dc.subject	text binarization
dc.subject	text enhancement
dc.subject	text probability
dc.subject	text tracking
dc.subject	video text recognition
dc.type	Conference Paper
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1109/ICDAR.2013.122
dc.description.sourcetitle	Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
dc.description.page	589-593
dc.identifier.isiut	000343489100113
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM