A novel mutual nearest neighbor based symmetry for text frame classification in video

Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.patcog.2011.02.008

DC Field	Value
dc.title	A novel mutual nearest neighbor based symmetry for text frame classification in video
dc.contributor.author	Shivakumara, P.
dc.contributor.author	Dutta, A.
dc.contributor.author	Quy Phan, T.
dc.contributor.author	Lim Tan, C.
dc.contributor.author	Pal, U.
dc.date.accessioned	2013-07-04T07:42:05Z
dc.date.available	2013-07-04T07:42:05Z
dc.date.issued	2011
dc.identifier.citation	Shivakumara, P., Dutta, A., Quy Phan, T., Lim Tan, C., Pal, U. (2011). A novel mutual nearest neighbor based symmetry for text frame classification in video. Pattern Recognition 44 (8) : 1671-1683. ScholarBank@NUS Repository. https://doi.org/10.1016/j.patcog.2011.02.008
dc.identifier.issn	00313203
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/39459
dc.description.abstract	In the field of multimedia retrieval in video, text frame classification is essential for text detection, event detection, event boundary detection, etc. We propose a new text frame classification method that introduces a combination of wavelet and median moment with k-means clustering to select probable text blocks among 16 equally sized blocks of a video frame. The same feature combination is used with a new MaxMin clustering at the pixel level to choose probable dominant text pixels in the selected probable text blocks. For the probable text pixels, a so-called mutual nearest neighbor based symmetry is explored with a four-quadrant formation centered at the centroid of the probable dominant text pixels to know whether a block is a true text block or not. If a frame produces at least one true text block then it is considered as a text frame otherwise it is a non-text frame. Experimental results on different text and non-text datasets including two public datasets and our own created data show that the proposed method gives promising results in terms of recall and precision at the block and frame levels. Further, we also show how existing text detection methods tend to misclassify non-text frames as text frames in term of recall and precision at both the block and frame levels. © 2011 Elsevier Ltd. All rights reserved.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/j.patcog.2011.02.008
dc.source	Scopus
dc.subject	Frame classification
dc.subject	Mutual nearest neighbor
dc.subject	Text block location
dc.subject	Video image
dc.subject	Waveletmedian moments
dc.type	Article
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1016/j.patcog.2011.02.008
dc.description.sourcetitle	Pattern Recognition
dc.description.volume	44
dc.description.issue	8
dc.description.page	1671-1683
dc.description.coden	PTNRA
dc.identifier.isiut	000290054200009
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM