Please use this identifier to cite or link to this item:
https://doi.org/10.1007/978-3-642-33712-3_51
DC Field | Value | |
---|---|---|
dc.title | Spatio-temporal phrases for activity recognition | |
dc.contributor.author | Zhang Y. | |
dc.contributor.author | Liu X. | |
dc.contributor.author | Chang M.-C. | |
dc.contributor.author | Ge W. | |
dc.contributor.author | Chen T. | |
dc.date.accessioned | 2018-08-21T04:57:41Z | |
dc.date.available | 2018-08-21T04:57:41Z | |
dc.date.issued | 2012 | |
dc.identifier.citation | Zhang Y., Liu X., Chang M.-C., Ge W., Chen T. (2012). Spatio-temporal phrases for activity recognition. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7574 LNCS (PART 3) : 707-721. ScholarBank@NUS Repository. https://doi.org/10.1007/978-3-642-33712-3_51 | |
dc.identifier.isbn | 9783642337116 | |
dc.identifier.issn | 03029743 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/146128 | |
dc.description.abstract | The local feature based approaches have become popular for activity recognition. A local feature captures the local movement and appearance of a local region in a video, and thus can be ambiguous; e.g., it cannot tell whether a movement is from a person's hand or foot, when the camera is far away from the person. To better distinguish different types of activities, people have proposed using the combination of local features to encode the relationships of local movements. Due to the computation limit, previous work only creates a combination from neighboring features in space and/or time. In this paper, we propose an approach that efficiently identifies both local and long-range motion interactions; taking the "push" activity as an example, our approach can capture the combination of the hand movement of one person and the foot response of another person, the local features of which are both spatially and temporally far away from each other. Our computational complexity is in linear time to the number of local features in a video. The extensive experiments show that our approach is generically effective for recognizing a wide variety of activities and activities spanning a long term, compared to a number of state-of-the-art methods. | |
dc.source | Scopus | |
dc.subject | Activity Recognition | |
dc.subject | Spatio-Temporal Phrases | |
dc.type | Conference Paper | |
dc.contributor.department | OFFICE OF THE PROVOST | |
dc.contributor.department | DEPARTMENT OF COMPUTER SCIENCE | |
dc.description.doi | 10.1007/978-3-642-33712-3_51 | |
dc.description.sourcetitle | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | |
dc.description.volume | 7574 LNCS | |
dc.description.issue | PART 3 | |
dc.description.page | 707-721 | |
dc.published.state | published | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.