Multi-video summary and skim generation of sensor-rich videos in geo-space

Please use this identifier to cite or link to this item: https://doi.org/10.1145/2155555.2155565

DC Field	Value
dc.title	Multi-video summary and skim generation of sensor-rich videos in geo-space
dc.contributor.author	Zhang, Y.
dc.contributor.author	Wang, G.
dc.contributor.author	Seo, B.
dc.contributor.author	Zimmermann, R.
dc.date.accessioned	2013-07-15T05:25:54Z
dc.date.available	2013-07-15T05:25:54Z
dc.date.issued	2012
dc.identifier.citation	Zhang, Y.,Wang, G.,Seo, B.,Zimmermann, R. (2012). Multi-video summary and skim generation of sensor-rich videos in geo-space. MMSys'12 - Proceedings of the 3rd Multimedia Systems Conference : 53-64. ScholarBank@NUS Repository. <a href="https://doi.org/10.1145/2155555.2155565" target="_blank">https://doi.org/10.1145/2155555.2155565</a>
dc.identifier.isbn	9781450311311
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/42921
dc.description.abstract	User-generated videos have become increasingly popular in recent years. Due to advances in camera technology it is now very easy and convenient to record videos with mobile devices, such as smartphones. Here we consider an application where users collect and share a large set of videos that are related to a geographic area, say a city. Such a repository can be a great source of information for prospective tourists when they plan to visit a city and would like to get a preview of its main areas. The challenge that we address is how to automatically create a preview video summary from a large set of source videos. The main features of our technique are that it is fully automatic and leverages meta-data sensor information which is acquired in conjunction with videos. The meta-data is collected from GPS and compass sensors and is used to describe the viewable scenes of the videos. Our method then proceeds in three steps through the analysis of the sensor data. First, we generate a single video summary. Shot boundaries are detected based on different motion types of camera movements and key frames are extracted related to motion patterns. Second, we build video skims for popular places (i.e., hotspots) aiming to provide maximal coverage of hotspot areas with minimal redundancy (per-spot multi-video summary). Finally, the individual hotspot skims are linked together to generate a pleasant video tour that visits all the popular places (multi-spot multi-video summary). © 2012 ACM.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1145/2155555.2155565
dc.source	Scopus
dc.subject	geo-tagging
dc.subject	key frame extraction
dc.subject	sensor data mining
dc.subject	video skim
dc.subject	video summarization
dc.type	Conference Paper
dc.contributor.department	INFORMATION SYSTEMS
dc.contributor.department	COMPUTER SCIENCE
dc.description.doi	10.1145/2155555.2155565
dc.description.sourcetitle	MMSys'12 - Proceedings of the 3rd Multimedia Systems Conference
dc.description.page	53-64
dc.identifier.isiut	NOT_IN_WOS
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM