Please use this identifier to cite or link to this item:
Title: Mental Visual Indexing: Towards Fast Video Browsing
Authors: Richang Hong
Jun He
Hanwang Zhang 
Tat-Seng Chua 
Keywords: Mental search
Query intent
Temporal model
Video browsing
Issue Date: 15-Oct-2016
Publisher: Association for Computing Machinery, Inc
Citation: Richang Hong, Jun He, Hanwang Zhang, Tat-Seng Chua (2016-10-15). Mental Visual Indexing: Towards Fast Video Browsing. ACM Multimedia Conference 2016 : 621-625. ScholarBank@NUS Repository.
Abstract: Video browsing describes an interactive process where users want to find a target shot in a long video. Therefore, it is crucial for a video browsing system to be fast and accurate with minimum user effort. In sharp contrast to traditional Relevance Feedback (RF), we propose a novel paradigm for fast video browsing dubbed Mental Visual Indexing (MVI). At each interactive round, the user only needs to select one of the displayed shots that is most visually similar to her mental target and then the user's choice will further tailor the search to the target. The search model update given a user feedback only requires vector inner products, which makes MVI highly responsive. MVI is underpinned by a sequence model in terms of Recurrent Neural Network (RNN), which is trained by automatically generated shot sequences from a rigorous Bayesian framework, which simulates user feedback process. Experimental results on three 3-hour movies conducted by real users demonstrate the effectiveness of the proposed approach. © 2016 ACM.
Source Title: ACM Multimedia Conference 2016
ISBN: 9781450336031
DOI: 10.1145/2964284.2967296
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Mental Visual Indexing.pdf1.81 MBAdobe PDF




checked on Jan 16, 2021

Page view(s)

checked on Jan 15, 2021


checked on Jan 15, 2021

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.