中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Fine-grained Visual Categorization by Localizing Object Parts with Single Image

文献类型:期刊论文

作者Zheng, Xiangtao1; Qi, Lei2; Ren, Yutao3; Lu, Xiaoqiang4
刊名IEEE Transactions on Multimedia
关键词Fine-grained visual categorization Part localization Part relationship Spectral clustering Dropout learning
ISSN号15209210;19410077
DOI10.1109/TMM.2020.2993960
产权排序1
英文摘要

Fine-grained visual categorization (FGVC) refers to assigning fine-grained labels to images which belong to the same base category. Due to the high inter-class similarity, it is challenging to distinguish fine-grained images under different subcategories. Recently, researchers have proposed to firstly localize key object parts within images and then find discriminative clues on object parts. To localize object parts, existing methods train detectors for different kinds of object parts. However, due to the fact that the same kind of object part in different images often changes intensely in appearance, the existing methods face two shortages: 1) Training part detector for object parts with diverse appearance is laborious; 2) Discriminative parts with unusual appearance may be neglected by the trained part detectors. To localize the key object parts efficiently and accurately, a novel FGVC method is proposed in the paper. The main novelty is that the proposed method localizes the key object parts within each image only depending on a single image and hence avoid the influence of diversity between parts in different images. The proposed FGVC method consists of two key steps. Firstly, the proposed method localizes the key parts in each image independently. To this end, potential object parts in each image are identified and then these potential parts are merged to generate the final representative object parts. Secondly, two kinds of features are extracted for simultaneously describing the discriminative clues within each part and the relationship between object parts. In addition, a part based dropout learning technique is adopted to boost the classification performance further in the paper. The proposed method is evaluated in comparison experiments and the experiment results show that the proposed method can achieve comparable or better performance than state-of-the-art methods. IEEE

语种英语
出版者Institute of Electrical and Electronics Engineers Inc.
源URL[http://ir.opt.ac.cn/handle/181661/93458]  
专题西安光学精密机械研究所_光学影像学习与分析中心
作者单位1.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an China (e-mail: xiangtaoz@gmail.com);
2.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an China (e-mail: 553054612@qq.com);
3.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi'an China (e-mail: taoyao0204@163.com);
4.OPTical IMagery Analysis and Learning, Chinese Academy of Sciences, Xi'an China 710119 (e-mail: luxq666666@gmail.com)
推荐引用方式
GB/T 7714
Zheng, Xiangtao,Qi, Lei,Ren, Yutao,et al. Fine-grained Visual Categorization by Localizing Object Parts with Single Image[J]. IEEE Transactions on Multimedia.
APA Zheng, Xiangtao,Qi, Lei,Ren, Yutao,&Lu, Xiaoqiang.
MLA Zheng, Xiangtao,et al."Fine-grained Visual Categorization by Localizing Object Parts with Single Image".IEEE Transactions on Multimedia

入库方式: OAI收割

来源:西安光学精密机械研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。