中国科学院机构知识库网格系统: Learning representative and discriminative image representation by deep appearance and spatial coding

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Learning representative and discriminative image representation by deep appearance and spatial coding

文献类型：期刊论文


作者	Bingyuan Liu; Jing Liu; Hanqing Lu
刊名	Computer Vision and Image Understanding
出版日期	2015
卷号	136 期号:1 页码:23-31
关键词	Image Classification Deep Learning Structured Sparsity
英文摘要	How to build a suitable image representation remains a critical problem in computer vision. Traditional Bag-of-Feature (BoF) based models build image representation by the pipeline of local feature extraction, feature coding and spatial pooling. However, three major shortcomings hinder the performance, i.e., the limitation of hand-designed features, the discrimination loss in local appearance coding and the lack of spatial information. To overcome the above limitations, in this paper, we propose a generalized BoF-based framework, which is hierarchically learned by exploring recently developed deep learning methods. First, with raw images as input, we densely extract local patches and learn local features by stacked Independent Subspace Analysis network. The learned features are then transformed to appearance codes by sparse Restricted Boltzmann Machines. Second, we perform spatial max-pooling on a set of over-complete spatial regions, which is generated by covering various spatial distributions, to incorporate more flexible spatial information. Third, a structured sparse Auto-encoder is proposed to explore the region representations into the image-level signature. To learn the proposed hierarchy, we layerwise pre-train the network in unsupervised manner, followed by supervised fine-tuning with image labels. Extensive experiments on different benchmarks, i.e., UIUC-Sports, Caltech-101, Caltech-256, Scene-15 and MIT Indoor-67, demonstrate the effectiveness of our proposed model.
源URL	[http://ir.ia.ac.cn/handle/173211/13436]
专题	自动化研究所_模式识别国家重点实验室_图像与视频分析团队
通讯作者	Jing Liu
推荐引用方式 GB/T 7714	Bingyuan Liu,Jing Liu,Hanqing Lu. Learning representative and discriminative image representation by deep appearance and spatial coding[J]. Computer Vision and Image Understanding,2015,136(1):23-31.
APA	Bingyuan Liu,Jing Liu,&Hanqing Lu.(2015).Learning representative and discriminative image representation by deep appearance and spatial coding.Computer Vision and Image Understanding,136(1),23-31.
MLA	Bingyuan Liu,et al."Learning representative and discriminative image representation by deep appearance and spatial coding".Computer Vision and Image Understanding 136.1(2015):23-31.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。