Please use this identifier to cite or link to this item:
https://doi.org/10.1145/1873951.1873967
Title: | A generic framework for event detection in various video domains | Authors: | Zhang, T. Xu, C. Zhu, G. Liu, S. Lu, H. |
Keywords: | broadcast video event detection internet multiple instance learning semi-supervised learning web-casting text |
Issue Date: | 2010 | Citation: | Zhang, T.,Xu, C.,Zhu, G.,Liu, S.,Lu, H. (2010). A generic framework for event detection in various video domains. MM'10 - Proceedings of the ACM Multimedia 2010 International Conference : 103-112. ScholarBank@NUS Repository. https://doi.org/10.1145/1873951.1873967 | Abstract: | Event detection is essential for the extensively studied video analysis and understanding area. Although various approaches have been proposed for event detection, there is a lack of a generic event detection framework that can be applied to various video domains (e.g. sports, news, movies, surveillance). In this paper, we present a generic event detection approach based on semi-supervised learning and Internet vision. Concretely, a Graph-based Semi-Supervised Multiple Instance Learning (GSSMIL) algorithm is proposed to jointly explore small-scale expert labeled videos and large-scale unlabeled videos to train the event models to detect video event boundaries. The expert labeled videos are obtained from the analysis and alignment of well-structured video related text (e.g. movie scripts, web-casting text, close caption). The unlabeled data are obtained by querying related events from the video search engine (e.g. YouTube) in order to give more distributive information for event modeling. A critical issue of GSSMIL in constructing a graph is the weight assignment, where the weight of an edge specifies the similarity between two data points. To tackle this problem, we propose a novel Multiple Instance Learning Induced Similarity (MILIS) measure by learning instance sensitive classifiers. We perform the thorough experiments in three popular video domains: movies, sports and news. The results compared with the state-of-the-arts are promising and demonstrate our proposed approach is performance-effective. © 2010 ACM. | Source Title: | MM'10 - Proceedings of the ACM Multimedia 2010 International Conference | URI: | http://scholarbank.nus.edu.sg/handle/10635/68823 | ISBN: | 9781605589336 | DOI: | 10.1145/1873951.1873967 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.