Toward holistic scene understanding: Feedback enabled cascaded classification models

Please use this identifier to cite or link to this item: https://doi.org/10.1109/TPAMI.2011.232

DC Field	Value
dc.title	Toward holistic scene understanding: Feedback enabled cascaded classification models
dc.contributor.author	Li C.
dc.contributor.author	Kowdle A.
dc.contributor.author	Saxena A.
dc.contributor.author	Chen T.
dc.date.accessioned	2018-08-21T04:58:28Z
dc.date.available	2018-08-21T04:58:28Z
dc.date.issued	2012
dc.identifier.citation	Li C., Kowdle A., Saxena A., Chen T. (2012). Toward holistic scene understanding: Feedback enabled cascaded classification models. IEEE Transactions on Pattern Analysis and Machine Intelligence 34 (7) : 1394-1408. ScholarBank@NUS Repository. https://doi.org/10.1109/TPAMI.2011.232
dc.identifier.issn	01628828
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/146139
dc.description.abstract	Scene understanding includes many related subtasks, such as scene categorization, depth estimation, object detection, etc. Each of these subtasks is often notoriously hard, and state-of-the-art classifiers already exist for many of them. These classifiers operate on the same raw image and provide correlated outputs. It is desirable to have an algorithm that can capture such correlation without requiring any changes to the inner workings of any classifier. We propose Feedback Enabled Cascaded Classification Models (FE-CCM), that jointly optimizes all the subtasks while requiring only a black box interface to the original classifier for each subtask. We use a two-layer cascade of classifiers, which are repeated instantiations of the original ones, with the output of the first layer fed into the second layer as input. Our training method involves a feedback step that allows later classifiers to provide earlier classifiers information about which error modes to focus on. We show that our method significantly improves performance in all the subtasks in the domain of scene understanding, where we consider depth estimation, scene categorization, event categorization, object detection, geometric labeling, and saliency detection. Our method also improves performance in two robotic applications: an object-grasping robot and an object-finding robot.
dc.source	Scopus
dc.subject	classification
dc.subject	machine learning
dc.subject	robotics
dc.subject	Scene understanding
dc.type	Article
dc.contributor.department	OFFICE OF THE PROVOST
dc.contributor.department	DEPARTMENT OF COMPUTER SCIENCE
dc.description.doi	10.1109/TPAMI.2011.232
dc.description.sourcetitle	IEEE Transactions on Pattern Analysis and Machine Intelligence
dc.description.volume	34
dc.description.issue	7
dc.description.page	1394-1408
dc.description.coden	ITPID
dc.published.state	published
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM