Please use this identifier to cite or link to this item: https://doi.org/10.1109/CVPR.2013.428
DC FieldValue
dc.titleIncorporating structural alternatives and sharing into hierarchy for multiclass object recognition and detection
dc.contributor.authorWang, X.
dc.contributor.authorLin, L.
dc.contributor.authorHuang, L.
dc.contributor.authorYan, S.
dc.date.accessioned2014-06-19T03:13:43Z
dc.date.available2014-06-19T03:13:43Z
dc.date.issued2013
dc.identifier.citationWang, X., Lin, L., Huang, L., Yan, S. (2013). Incorporating structural alternatives and sharing into hierarchy for multiclass object recognition and detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition : 3334-3341. ScholarBank@NUS Repository. https://doi.org/10.1109/CVPR.2013.428
dc.identifier.issn10636919
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/70577
dc.description.abstractThis paper proposes a reconfigurable model to recognize and detect multiclass (or multiview) objects with large variation in appearance. Compared with well acknowledged hierarchical models, we study two advanced capabilities in hierarchy for object modeling: (i)''switch'' variables(i.e. or-nodes) for specifying alternative compositions, and (ii) making local classifiers (i.e. leaf-nodes) shared among different classes. These capabilities enable us to account well for structural variabilities while preserving the model compact. Our model, in the form of an And-Or Graph, comprises four layers: a batch of leaf-nodes with collaborative edges in bottom for localizing object parts, the or-nodes over bottom to activate their children leaf-nodes, the and-nodes to classify objects as a whole, one root-node on the top for switching multiclass classification, which is also an or-node. For model training, we present an EM-type algorithm, namely dynamical structural optimization (DSO), to iteratively determine the structural configuration, (e.g., leaf-node generation associated with their parent or-nodes and shared across other classes), along with optimizing multi-layer parameters. The proposed method is valid on challenging databases, e.g., PASCAL VOC 2007 and UIUC-People, and it achieves state-of-the-arts performance. © 2013 IEEE.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/CVPR.2013.428
dc.sourceScopus
dc.subjectand-or graph
dc.subjectmodel sharing
dc.subjectobject recognition
dc.subjectstructure learning
dc.typeConference Paper
dc.contributor.departmentELECTRICAL & COMPUTER ENGINEERING
dc.description.doi10.1109/CVPR.2013.428
dc.description.sourcetitleProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
dc.description.page3334-3341
dc.description.codenPIVRE
dc.identifier.isiut000331094303052
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.