Please use this identifier to cite or link to this item: https://doi.org/10.1109/TPAMI.2014.2359435
DC FieldValue
dc.title3D reasoning from blocks to stability
dc.contributor.authorJia Z.
dc.contributor.authorGallagher A.C.
dc.contributor.authorSaxena A.
dc.contributor.authorChen T.
dc.date.accessioned2018-08-21T04:25:52Z
dc.date.available2018-08-21T04:25:52Z
dc.date.issued2015
dc.identifier.citationJia Z., Gallagher A.C., Saxena A., Chen T. (2015). 3D reasoning from blocks to stability. IEEE Transactions on Pattern Analysis and Machine Intelligence 37 (5) : 905-918. ScholarBank@NUS Repository. https://doi.org/10.1109/TPAMI.2014.2359435
dc.identifier.issn01628828
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/146077
dc.description.abstractObjects occupy physical space and obey physical laws. To truly understand a scene, we must reason about the space that objects in it occupy, and how each objects is supported stably by each other. In other words, we seek to understand which objects would, if moved, cause other objects to fall. This 3D volumetric reasoning is important for many scene understanding tasks, ranging from segmentation of objects to perception of a rich 3D, physically well-founded, interpretations of the scene. In this paper, we propose a new algorithm to parse a single RGB-D image with 3D block units while jointly reasoning about the segments, volumes, supporting relationships, and object stability. Our algorithm is based on the intuition that a good 3D representation of the scene is one that fits the depth data well, and is a stable, self-supporting arrangement of objects (i.e., one that does not topple). We design an energy function for representing the quality of the block representation based on these properties. Our algorithm fits 3D blocks to the depth values corresponding to image segments, and iteratively optimizes the energy function. Our proposed algorithm is the first to consider stability of objects in complex arrangements for reasoning about the underlying structure of the scene. Experimental results show that our stability-reasoning framework improves RGB-D segmentation and scene volumetric representation.
dc.publisherIEEE Computer Society
dc.sourceScopus
dc.subjectcomputer vision
dc.subjectscene understanding
dc.subjectSegmentation
dc.typeArticle
dc.contributor.departmentOFFICE OF THE PROVOST
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.description.doi10.1109/TPAMI.2014.2359435
dc.description.sourcetitleIEEE Transactions on Pattern Analysis and Machine Intelligence
dc.description.volume37
dc.description.issue5
dc.description.page905-918
dc.description.codenITPID
dc.published.statepublished
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.