Improving bottom-up saliency detection by looking into neighbors

Please use this identifier to cite or link to this item: https://doi.org/10.1109/TCSVT.2013.2248495

DC Field	Value
dc.title	Improving bottom-up saliency detection by looking into neighbors
dc.contributor.author	Lang, C.
dc.contributor.author	Feng, J.
dc.contributor.author	Liu, G.
dc.contributor.author	Tang, J.
dc.contributor.author	Yan, S.
dc.contributor.author	Luo, J.
dc.date.accessioned	2014-10-07T04:30:16Z
dc.date.available	2014-10-07T04:30:16Z
dc.date.issued	2013
dc.identifier.citation	Lang, C., Feng, J., Liu, G., Tang, J., Yan, S., Luo, J. (2013). Improving bottom-up saliency detection by looking into neighbors. IEEE Transactions on Circuits and Systems for Video Technology 23 (6) : 1016-1028. ScholarBank@NUS Repository. https://doi.org/10.1109/TCSVT.2013.2248495
dc.identifier.issn	10518215
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/82513
dc.description.abstract	Bottom-up saliency detection aims to detect salient areas within natural images usually without learning from labeled images. Typically, the saliency map of an image is inferred by only using the information within this image (referred to as the 'current image'). While efficient, such single-image-based methods may fail to obtain reliable results, because the information within a single image may be insufficient for defining saliency. In this paper, we investigate how saliency detection can benefit from the nearest neighbor structure in the image space. First, we show that existing methods can be improved by extending them to include the visual neighborhood information. This verifies the significance of the neighbors. Next, a solution of multitask sparsity pursuit is proposed to integrate the current image and its neighbors to collaboratively detect saliency. The integration is done by first representing each image as a feature matrix, and then seeking the consistently sparse elements from the joint decompositions of multiple matrices into pairs of low-rank and sparse matrices. The computational procedure is formulated as a constrained nuclear norm and ℓ2,1-norm minimization problem, which is convex and can be solved efficiently with the augmented Lagrange multiplier method. Besides the nearest neighbor structure in the visual feature space, the proposed model can also be generalized to handle multiple visual features. Extensive experiments have clearly validated its superiority over other state-of-the-art methods. © 1991-2012 IEEE.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/TCSVT.2013.2248495
dc.source	Scopus
dc.subject	Multimodal modeling
dc.subject	multitask learning
dc.subject	saliency detection
dc.subject	sparsity and low-rankness
dc.subject	visual attention
dc.type	Article
dc.contributor.department	ELECTRICAL & COMPUTER ENGINEERING
dc.description.doi	10.1109/TCSVT.2013.2248495
dc.description.sourcetitle	IEEE Transactions on Circuits and Systems for Video Technology
dc.description.volume	23
dc.description.issue	6
dc.description.page	1016-1028
dc.description.coden	ITCTE
dc.identifier.isiut	000321275400008
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM