中国科学院机构知识库网格系统: Multi-View 3D Object Retrieval With Deep Embedding Network

Multi-View 3D Object Retrieval With Deep Embedding Network

文献类型：期刊论文


作者	Guo, Haiyun1,2 ; Wang, Jinqiao1,2 ; Gao, Yue 3; Li, Jianqiang 4; Lu, Hanqing1,2
刊名	IEEE TRANSACTIONS ON IMAGE PROCESSING
出版日期	2016-12-01
卷号	25 期号:12 页码:5526-5537
关键词	Convolutional Neural Network Multi-view 3d Object Retrieval Triplet Loss
DOI	10.1109/TIP.2016.2609814
文献子类	Article
英文摘要	In multi-view 3D object retrieval, each object is characterized by a group of 2D images captured from different views. Rather than using hand-crafted features, in this paper, we take advantage of the strong discriminative power of convolutional neural network to learn an effective 3D object representation tailored for this retrieval task. Specifically, we propose a deep embedding network jointly supervised by classification loss and triplet loss to map the high-dimensional image space into a low-dimensional feature space, where the Euclidean distance of features directly corresponds to the semantic similarity of images. By effectively reducing the intra-class variations while increasing the inter-class ones of the input images, the network guarantees that similar images are closer than dissimilar ones in the learned feature space. Besides, we investigate the effectiveness of deep features extracted from different layers of the embedding network extensively and find that an efficient 3D object representation should be a tradeoff between global semantic information and discriminative local characteristics. Then, with the set of deep features extracted from different views, we can generate a comprehensive description for each 3D object and formulate the multi-view 3D object retrieval as a set-to-set matching problem. Extensive experiments on SHREC'15 data set demonstrate the superiority of our proposed method over the previous state-of-the-art approaches with over 12% performance improvement.
WOS关键词	MODEL RETRIEVAL ; VISUAL SIMILARITY ; DISTANCE ; DESCRIPTOR
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000388205100002
资助机构	863 Program(2014AA015104) ; National Natural Science Foundation of China(61273034 ; 61332016)
源URL	[http://ir.ia.ac.cn/handle/173211/13354]
专题	自动化研究所_模式识别国家重点实验室_图像与视频分析团队
作者单位	1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100190, Peoples R China 3.Tsinghua Univ, Sch Software, Tsinghua Natl Lab Informat Sci & Technol TNList, Key Lab Informat Syst Secur,Minist Educ, Beijing 100084, Peoples R China 4.Beijing Univ Technol, Sch Software Engn, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100083, Peoples R China
推荐引用方式 GB/T 7714	Guo, Haiyun,Wang, Jinqiao,Gao, Yue,et al. Multi-View 3D Object Retrieval With Deep Embedding Network[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2016,25(12):5526-5537.
APA	Guo, Haiyun,Wang, Jinqiao,Gao, Yue,Li, Jianqiang,&Lu, Hanqing.(2016).Multi-View 3D Object Retrieval With Deep Embedding Network.IEEE TRANSACTIONS ON IMAGE PROCESSING,25(12),5526-5537.
MLA	Guo, Haiyun,et al."Multi-View 3D Object Retrieval With Deep Embedding Network".IEEE TRANSACTIONS ON IMAGE PROCESSING 25.12(2016):5526-5537.

入库方式： OAI收割

来源：自动化研究所

下载0

Multi-View 3D Object Retrieval With Deep Embedding Network

其他版本