Please use this identifier to cite or link to this item:
https://doi.org/10.1145/2155555.2155565
DC Field | Value | |
---|---|---|
dc.title | Multi-video summary and skim generation of sensor-rich videos in geo-space | |
dc.contributor.author | Zhang, Y. | |
dc.contributor.author | Wang, G. | |
dc.contributor.author | Seo, B. | |
dc.contributor.author | Zimmermann, R. | |
dc.date.accessioned | 2013-07-15T05:25:54Z | |
dc.date.available | 2013-07-15T05:25:54Z | |
dc.date.issued | 2012 | |
dc.identifier.citation | Zhang, Y.,Wang, G.,Seo, B.,Zimmermann, R. (2012). Multi-video summary and skim generation of sensor-rich videos in geo-space. MMSys'12 - Proceedings of the 3rd Multimedia Systems Conference : 53-64. ScholarBank@NUS Repository. <a href="https://doi.org/10.1145/2155555.2155565" target="_blank">https://doi.org/10.1145/2155555.2155565</a> | |
dc.identifier.isbn | 9781450311311 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/42921 | |
dc.description.abstract | User-generated videos have become increasingly popular in recent years. Due to advances in camera technology it is now very easy and convenient to record videos with mobile devices, such as smartphones. Here we consider an application where users collect and share a large set of videos that are related to a geographic area, say a city. Such a repository can be a great source of information for prospective tourists when they plan to visit a city and would like to get a preview of its main areas. The challenge that we address is how to automatically create a preview video summary from a large set of source videos. The main features of our technique are that it is fully automatic and leverages meta-data sensor information which is acquired in conjunction with videos. The meta-data is collected from GPS and compass sensors and is used to describe the viewable scenes of the videos. Our method then proceeds in three steps through the analysis of the sensor data. First, we generate a single video summary. Shot boundaries are detected based on different motion types of camera movements and key frames are extracted related to motion patterns. Second, we build video skims for popular places (i.e., hotspots) aiming to provide maximal coverage of hotspot areas with minimal redundancy (per-spot multi-video summary). Finally, the individual hotspot skims are linked together to generate a pleasant video tour that visits all the popular places (multi-spot multi-video summary). © 2012 ACM. | |
dc.description.uri | http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1145/2155555.2155565 | |
dc.source | Scopus | |
dc.subject | geo-tagging | |
dc.subject | key frame extraction | |
dc.subject | sensor data mining | |
dc.subject | video skim | |
dc.subject | video summarization | |
dc.type | Conference Paper | |
dc.contributor.department | INFORMATION SYSTEMS | |
dc.contributor.department | COMPUTER SCIENCE | |
dc.description.doi | 10.1145/2155555.2155565 | |
dc.description.sourcetitle | MMSys'12 - Proceedings of the 3rd Multimedia Systems Conference | |
dc.description.page | 53-64 | |
dc.identifier.isiut | NOT_IN_WOS | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.