中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization

文献类型:会议论文

作者Yong Li; Jing Liu; Yuhang Wang; Bingyuan Liu; Jun Fu; Yunze Gao; Hui Wu; Hang Song; Peng Ying; Hanqing Lu
出版日期2015
会议日期September 8-11, 2015
会议地点Toulouse, France
关键词Hybrid Learning Svm Fast R-cnn Annotation Concept Localization
英文摘要In this paper, we describe the details of our participation in the ImageCLEF 2015 Scalable Image Annotation task. The task is to annotate and localize different concepts depicted in images. We propose a hybrid learning framework to solve the scalable annotation task, in which the supervised methods given limited annotated images and the searchbased solutions on the whole dataset are explored jointly. We adopt a two-stage solution to first annotate images with possible concepts and then localize the concepts in the images. For the first stage, we adopt the classification model to get the class-predictions of each image. To overcome the overfitting problem of the trained classifier with limited labelled data, we use a search-based approach to annotate an image by mining the textual information of its similar neighbors, which are similar on both visual appearance and semantics. We combine the results of classification and the search-based solution to obtain the annotations of each image. For the second stage, we train a concept localization model based on the architecture of Fast R-CNN, and output the top-k predicted regions for each concept obtained in the first stage. Meanwhile, localization by search is adopted, which works well for the concepts without obvious objects. The final result is achieved by combing the two kinds of localization results. The submitted runs of our team achieved the second place among the different teams. This shows the outperformance of the proposed hybrid two-stage learning framework for the scalable annotation task.
会议录CEUR Workshop Proceedings 1391
源URL[http://ir.ia.ac.cn/handle/173211/11768]  
专题自动化研究所_模式识别国家重点实验室_图像与视频分析团队
通讯作者Jing Liu
推荐引用方式
GB/T 7714
Yong Li,Jing Liu,Yuhang Wang,et al. Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization[C]. 见:. Toulouse, France. September 8-11, 2015.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。