Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs
文献类型:期刊论文
作者 | Xia, Yingjie1; Zhang, Luming1; Liu, Zhenguang2; Nie, Liqiang3; Li, Xuelong4 |
刊名 | ieee transactions on image processing
![]() |
出版日期 | 2017-08-01 |
卷号 | 26期号:8页码:3748-3758 |
关键词 | Multimodal categorization aerial photograph image kernel weakly-supervised |
ISSN号 | 1057-7149 |
产权排序 | 4 |
通讯作者 | zhang, luming (zglumg@gmail.com) |
英文摘要 | accurately distinguishing aerial photographs from different categories is a promising technique in computer vision. it can facilitate a series of applications, such as video surveillance and vehicle navigation. in this paper, a new image kernel is proposed for effectively recognizing aerial photographs. the key is to encode high-level semantic cues into local image patches in a weakly supervised way, and integrate multimodal visual features using a newly developed hashing algorithm. the flowchart can be elaborated as follows. given an aerial photo, we first extract a number of graphlets to describe its topological structure. for each graphlet, we utilize color and texture to capture its appearance, and a weakly supervised algorithm to capture its semantics. thereafter, aerial photo categorization can be naturally formulated as graphlet-to-graphlet matching. as the number of graphlets from each aerial photo is huge, to accelerate matching, we present a hashing algorithm to seamlessly fuze the multiple visual features into binary codes. finally, an image kernel is calculated by fast matching the binary codes corresponding to each graphlet. and a multi-class svm is learned for aerial photo categorization. we demonstrate the advantage of our proposed model by comparing it with state-of-the-art image descriptors. moreover, an in-depth study of the descriptiveness of the hash-based graphlet is presented. |
WOS标题词 | science & technology ; technology |
类目[WOS] | computer science, artificial intelligence ; engineering, electrical & electronic |
研究领域[WOS] | computer science ; engineering |
关键词[WOS] | object recognition ; image categories ; face recognition ; histograms ; reranking |
收录类别 | SCI ; EI |
语种 | 英语 |
WOS记录号 | WOS:000403819200008 |
源URL | [http://ir.opt.ac.cn/handle/181661/29036] ![]() |
专题 | 西安光学精密机械研究所_光学影像学习与分析中心 |
作者单位 | 1.Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Peoples R China 2.Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore 3.Shandong Univ, Sch Comp Sci & Technol, Jinan 250100, Peoples R China 4.Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning, Xian 710119, Peoples R China |
推荐引用方式 GB/T 7714 | Xia, Yingjie,Zhang, Luming,Liu, Zhenguang,et al. Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs[J]. ieee transactions on image processing,2017,26(8):3748-3758. |
APA | Xia, Yingjie,Zhang, Luming,Liu, Zhenguang,Nie, Liqiang,&Li, Xuelong.(2017).Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs.ieee transactions on image processing,26(8),3748-3758. |
MLA | Xia, Yingjie,et al."Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs".ieee transactions on image processing 26.8(2017):3748-3758. |
入库方式: OAI收割
来源:西安光学精密机械研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。