Please use this identifier to cite or link to this item:
Title: Weakly supervised photo cropping
Authors: Zhang, L.
Song, M.
Yang, Y.
Zhao, Q. 
Zhao, C.
Sebe, N.
Keywords: Bayesian network
image-level semantics
photo cropping
weakly supervised
Issue Date: Jan-2014
Citation: Zhang, L., Song, M., Yang, Y., Zhao, Q., Zhao, C., Sebe, N. (2014-01). Weakly supervised photo cropping. IEEE Transactions on Multimedia 16 (1) : 94-107. ScholarBank@NUS Repository.
Abstract: Photo cropping is widely used in the printing industry, photography, and cinematography. Conventional photo cropping methods suffer from three drawbacks: 1) the semantics used to describe photo aesthetics are determined by the experience of model designers and specific data sets, 2) image global configurations, an essential cue to capture photos aesthetics, are not well preserved in the cropped photo, and 3) multi-channel visual features from an image region contribute differently to human aesthetics, but state-of-the-art photo cropping methods cannot automatically weight them. Owing to the recent progress in image retrieval community, image-level semantics, i.e., photo labels obtained without much human supervision, can be efficiently and effectively acquired. Thus, we propose weakly supervised photo cropping, where a manifold embedding algorithm is developed to incorporate image-level semantics and image global configurations with graphlets, or, small-sized connected subgraph. After manifold embedding, a Bayesian Network (BN) is proposed. It incorporates the testing photo into the framework derived from the multi-channel post-embedding graphlets of the training data, the importance of which is determined automatically. Based on the BN, photo cropping can be casted as searching the candidate cropped photo that maximally preserves graphlets from the training photos, and the optimal cropping parameter is inferred by Gibbs sampling. Subjective evaluations demonstrate that: 1) our approach outperforms several representative photo cropping methods, including our previous cropping model that is guided by semantics-free graphlets, and 2) the visualized graphlets explicitly capture photo semantics and global spatial configurations. © 1999-2012 IEEE.
Source Title: IEEE Transactions on Multimedia
ISSN: 15209210
DOI: 10.1109/TMM.2013.2286817
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.


checked on Nov 7, 2019


checked on Nov 7, 2019

Page view(s)

checked on Oct 28, 2019

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.