中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications

文献类型:期刊论文

作者Zhang, Shiliang3; Tian, Qi4; Hua, Gang1; Huang, Qingming2; Gao, Wen3
刊名IEEE TRANSACTIONS ON IMAGE PROCESSING
出版日期2011-09-01
卷号20期号:9页码:2664-2677
关键词Image retrieval image search re-ranking object recognition visual phrase visual word
ISSN号1057-7149
DOI10.1109/TIP.2011.2128333
英文摘要Bag-of-visual Words (BoWs) representation has been applied for various problems in the fields of multimedia and computer vision. The basic idea is to represent images as visual documents composed of repeatable and distinctive visual elements, which are comparable to the text words. Notwithstanding its great success and wide adoption, visual vocabulary created from single-image local descriptors is often shown to be not as effective as desired. In this paper, descriptive visual words (DVWs) and descriptive visual phrases (DVPs) are proposed as the visual correspondences to text words and phrases, where visual phrases refer to the frequently co-occurring visual word pairs. Since images are the carriers of visual objects and scenes, a descriptive visual element set can be composed by the visual words and their combinations which are effective in representing certain visual objects or scenes. Based on this idea, a general framework is proposed for generating DVWs and DVPs for image applications. In a large-scale image database containing 1506 object and scene categories, the visual words and visual word pairs descriptive to certain objects or scenes are identified and collected as the DVWs and DVPs. Experiments show that the DVWs and DVPs are informative and descriptive and, thus, are more comparable with the text words than the classic visual words. We apply the identified DVWs and DVPs in several applications including large-scale near-duplicated image retrieval, image search re-ranking, and object recognition. The combination of DVW and DVP performs better than the state of the art in large-scale near-duplicated image retrieval in terms of accuracy, efficiency and memory consumption. The proposed image search re-ranking algorithm: DWPRank outperforms the state-of-the-art algorithm by 12.4% in mean average precision and about 11 times faster in efficiency.
资助项目Microsoft Research Asia (MSRA) ; National Science Foundation[IIS 1052581] ; National Natural Science Foundation of China[61025011] ; National Natural Science Foundation of China[60833006] ; National Basic Research Program of China (973 Program)[2009CB320906] ; Beijing Natural Science Foundation[4092042] ; Google Faculty Research Award ; FXPAL Faculty Research Award
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000294132800022
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
源URL[http://119.78.100.204/handle/2XEOYT63/12647]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Zhang, Shiliang
作者单位1.IBM Corp, TJ Watson Res Ctr, Armonk, NY 10532 USA
2.Chinese Acad Sci, Grad Univ, Beijing 100080, Peoples R China
3.Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
4.Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
推荐引用方式
GB/T 7714
Zhang, Shiliang,Tian, Qi,Hua, Gang,et al. Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2011,20(9):2664-2677.
APA Zhang, Shiliang,Tian, Qi,Hua, Gang,Huang, Qingming,&Gao, Wen.(2011).Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications.IEEE TRANSACTIONS ON IMAGE PROCESSING,20(9),2664-2677.
MLA Zhang, Shiliang,et al."Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications".IEEE TRANSACTIONS ON IMAGE PROCESSING 20.9(2011):2664-2677.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。