Please use this identifier to cite or link to this item:
https://doi.org/10.1109/ICDAR.2013.122
DC Field | Value | |
---|---|---|
dc.title | Recognition of video text through temporal integration | |
dc.contributor.author | Phan, T.Q. | |
dc.contributor.author | Shivakumara, P. | |
dc.contributor.author | Lu, T. | |
dc.contributor.author | Tan, C.L. | |
dc.date.accessioned | 2014-07-04T03:14:54Z | |
dc.date.available | 2014-07-04T03:14:54Z | |
dc.date.issued | 2013 | |
dc.identifier.citation | Phan, T.Q., Shivakumara, P., Lu, T., Tan, C.L. (2013). Recognition of video text through temporal integration. Proceedings of the International Conference on Document Analysis and Recognition, ICDAR : 589-593. ScholarBank@NUS Repository. https://doi.org/10.1109/ICDAR.2013.122 | |
dc.identifier.issn | 15205363 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/78315 | |
dc.description.abstract | This paper presents a method for temporal integration, which can be used to improve the recognition accuracy of video texts. Given a word detected in a video frame, we use a combination of Stroke Width Transform and SIFT (Scale Invariant Feature Transform) to track it both backward and forward in time. The text instances within the word's frame span are then extracted and aligned at pixel level. In the second step, we integrate these instances into a text probability map. By thresholding this map, we obtain an initial binarization of the word. In the final step, the shapes of the characters are refined using the intensity values. This helps to preserve the distinctive character features (e.g., sharp edges and holes), which are useful for OCR engines to distinguish between the different character classes. Experiments on English and German videos show that the proposed method outperforms existing ones in terms of recognition accuracy. © 2013 IEEE. | |
dc.description.uri | http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/ICDAR.2013.122 | |
dc.source | Scopus | |
dc.subject | multiple frame integration | |
dc.subject | SIFT | |
dc.subject | Stroke Width Transform | |
dc.subject | temporal integration | |
dc.subject | text binarization | |
dc.subject | text enhancement | |
dc.subject | text probability | |
dc.subject | text tracking | |
dc.subject | video text recognition | |
dc.type | Conference Paper | |
dc.contributor.department | COMPUTER SCIENCE | |
dc.description.doi | 10.1109/ICDAR.2013.122 | |
dc.description.sourcetitle | Proceedings of the International Conference on Document Analysis and Recognition, ICDAR | |
dc.description.page | 589-593 | |
dc.identifier.isiut | 000343489100113 | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.