Encoded Semantic Tree for Automatic User Profiling Applied to Personalized Video Summarization

Please use this identifier to cite or link to this item: https://doi.org/10.1109/TCSVT.2016.2602832

DC Field	Value
dc.title	Encoded Semantic Tree for Automatic User Profiling Applied to Personalized Video Summarization
dc.contributor.author	Yin, Yifang
dc.contributor.author	Thapliya, Roshan
dc.contributor.author	Zimmermann, Roger
dc.date.accessioned	2021-09-20T07:48:34Z
dc.date.available	2021-09-20T07:48:34Z
dc.date.issued	2018-01-01
dc.identifier.citation	Yin, Yifang, Thapliya, Roshan, Zimmermann, Roger (2018-01-01). Encoded Semantic Tree for Automatic User Profiling Applied to Personalized Video Summarization. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 28 (1) : 181-192. ScholarBank@NUS Repository. https://doi.org/10.1109/TCSVT.2016.2602832
dc.identifier.issn	10518215
dc.identifier.issn	15582205
dc.identifier.uri	https://scholarbank.nus.edu.sg/handle/10635/200728
dc.description.abstract	We propose an innovative method of automatic video summary generation with personal adaptations. User interests are mined from their personal image collections. To reduce the semantic gap, we propose to extract visual representations based on a novel semantic tree (SeTree). A SeTree is a hierarchy that captures the conceptual relationships between the visual scenes in a codebook. This idea builds upon the observation that such semantic connections among the elements have been overlooked in the previous work. To construct the SeTree, we adopt a normalized graph cut clustering algorithm by conjunctively exploiting visual features, textual information, and social user-image connections. Using this technique, we obtain an 8.1% improvement of normalized discounted cumulative gain in personalized video segments ranking compared with existing methods. Furthermore, to promote the interesting parts of a video, we extract a space-time saliency map and estimate the attractiveness of segments by kernel fitting and matching. A linear function is utilized to combine the two factors, based on which the playback rate of a video is adapted to generate the summary. We play the less important segments in a fast-forward mode to keep users updated with the context. Subjective experiments were conducted which showed that our proposed video summarization approach outperformed the state-of-the-art techniques by 6.2%.
dc.language.iso	en
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
dc.source	Elements
dc.subject	Science & Technology
dc.subject	Technology
dc.subject	Engineering, Electrical & Electronic
dc.subject	Engineering
dc.subject	Semantic modeling
dc.subject	user profiling
dc.subject	video summarization
dc.subject	visual attention
dc.subject	FRAMEWORK
dc.type	Article
dc.date.updated	2021-09-19T15:33:10Z
dc.contributor.department	CHEMICAL & BIOMOLECULAR ENGINEERING
dc.description.doi	10.1109/TCSVT.2016.2602832
dc.description.sourcetitle	IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
dc.description.volume	28
dc.description.issue	1
dc.description.page	181-192
dc.published.state	Published
Appears in Collections:	Staff Publications Elements

Show simple item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
main.pdf		2.1 MB	Adobe PDF	CLOSED	None

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM