Please use this identifier to cite or link to this item:
https://doi.org/10.1016/j.patcog.2011.02.008
DC Field | Value | |
---|---|---|
dc.title | A novel mutual nearest neighbor based symmetry for text frame classification in video | |
dc.contributor.author | Shivakumara, P. | |
dc.contributor.author | Dutta, A. | |
dc.contributor.author | Quy Phan, T. | |
dc.contributor.author | Lim Tan, C. | |
dc.contributor.author | Pal, U. | |
dc.date.accessioned | 2013-07-04T07:42:05Z | |
dc.date.available | 2013-07-04T07:42:05Z | |
dc.date.issued | 2011 | |
dc.identifier.citation | Shivakumara, P., Dutta, A., Quy Phan, T., Lim Tan, C., Pal, U. (2011). A novel mutual nearest neighbor based symmetry for text frame classification in video. Pattern Recognition 44 (8) : 1671-1683. ScholarBank@NUS Repository. https://doi.org/10.1016/j.patcog.2011.02.008 | |
dc.identifier.issn | 00313203 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/39459 | |
dc.description.abstract | In the field of multimedia retrieval in video, text frame classification is essential for text detection, event detection, event boundary detection, etc. We propose a new text frame classification method that introduces a combination of wavelet and median moment with k-means clustering to select probable text blocks among 16 equally sized blocks of a video frame. The same feature combination is used with a new MaxMin clustering at the pixel level to choose probable dominant text pixels in the selected probable text blocks. For the probable text pixels, a so-called mutual nearest neighbor based symmetry is explored with a four-quadrant formation centered at the centroid of the probable dominant text pixels to know whether a block is a true text block or not. If a frame produces at least one true text block then it is considered as a text frame otherwise it is a non-text frame. Experimental results on different text and non-text datasets including two public datasets and our own created data show that the proposed method gives promising results in terms of recall and precision at the block and frame levels. Further, we also show how existing text detection methods tend to misclassify non-text frames as text frames in term of recall and precision at both the block and frame levels. © 2011 Elsevier Ltd. All rights reserved. | |
dc.description.uri | http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/j.patcog.2011.02.008 | |
dc.source | Scopus | |
dc.subject | Frame classification | |
dc.subject | Mutual nearest neighbor | |
dc.subject | Text block location | |
dc.subject | Video image | |
dc.subject | Waveletmedian moments | |
dc.type | Article | |
dc.contributor.department | COMPUTER SCIENCE | |
dc.description.doi | 10.1016/j.patcog.2011.02.008 | |
dc.description.sourcetitle | Pattern Recognition | |
dc.description.volume | 44 | |
dc.description.issue | 8 | |
dc.description.page | 1671-1683 | |
dc.description.coden | PTNRA | |
dc.identifier.isiut | 000290054200009 | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.