中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Actively Learning Human Gaze Shifting Paths for Semantics-Aware Photo Cropping

文献类型:期刊论文

作者Zhang, Luming1; Gao, Yue2; Ji, Rongrong3; Xia, Yingjie4; Dai, Qionghai2; Li, Xuelong5
刊名ieee transactions on image processing
出版日期2014-05-01
卷号23期号:5页码:2235-2245
关键词Photo cropping semantics active graphlet path aesthetics
ISSN号1070-986x
英文摘要photo cropping is a widely used tool in printing industry, photography, and cinematography. conventional cropping models suffer from the following three challenges. first, the deemphasized role of semantic contents that are many times more important than low-level features in photo aesthetics. second, the absence of a sequential ordering in the existing models. in contrast, humans look at semantically important regions sequentially when viewing a photo. third, the difficulty of leveraging inputs from multiple users. experience from multiple users is particularly critical in cropping as photo assessment is quite a subjective task. to address these challenges, this paper proposes semantics-aware photo cropping, which crops a photo by simulating the process of humans sequentially perceiving semantically important regions of a photo. we first project the local features (graphlets in this paper) onto the semantic space, which is constructed based on the category information of the training photos. an efficient learning algorithm is then derived to sequentially select semantically representative graphlets of a photo, and the selecting process can be interpreted by a path, which simulates humans actively perceiving semantics in a photo. furthermore, we learn a prior distribution of such active graphlet paths from training photos that are marked as aesthetically pleasing by multiple users. the learned priors enforce the corresponding active graphlet path of a test photo to be maximally similar to those from the training photos. experimental results show that: 1) the active graphlet path accurately predicts human gaze shifting, and thus is more indicative for photo aesthetics than conventional saliency maps and 2) the cropped photos produced by our approach outperform its competitors in both qualitative and quantitative comparisons.
WOS标题词science & technology ; technology
类目[WOS]computer science, artificial intelligence ; engineering, electrical & electronic
研究领域[WOS]computer science ; engineering
关键词[WOS]object retrieval ; recognition ; classification ; manifold
收录类别SCI ; EI
语种英语
WOS记录号WOS:000334677900003
公开日期2015-03-18
源URL[http://ir.opt.ac.cn/handle/181661/22375]  
专题西安光学精密机械研究所_光学影像学习与分析中心
作者单位1.Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
2.Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, Dept Automat, Beijing 100084, Peoples R China
3.Xiamen Univ, Sch Informat Sci & Engn, Dept Cognit Sci, Xiamen 361000, Peoples R China
4.Hangzhou Normal Univ, Hangzhou Inst Serv Engn, Hangzhou, Zhejiang, Peoples R China
5.Chinese Acad Sci, Xian Inst Opt & Precis Mech, Ctr Opt IMagery Anal & Learning OPTIMAL, State Key Lab Transient Opt & Photon, Xian 710119, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Luming,Gao, Yue,Ji, Rongrong,et al. Actively Learning Human Gaze Shifting Paths for Semantics-Aware Photo Cropping[J]. ieee transactions on image processing,2014,23(5):2235-2245.
APA Zhang, Luming,Gao, Yue,Ji, Rongrong,Xia, Yingjie,Dai, Qionghai,&Li, Xuelong.(2014).Actively Learning Human Gaze Shifting Paths for Semantics-Aware Photo Cropping.ieee transactions on image processing,23(5),2235-2245.
MLA Zhang, Luming,et al."Actively Learning Human Gaze Shifting Paths for Semantics-Aware Photo Cropping".ieee transactions on image processing 23.5(2014):2235-2245.

入库方式: OAI收割

来源:西安光学精密机械研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。