中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization

文献类型:期刊论文

作者Changwei Wang3,4,5,6; Rongtao Xu3,4; Shibiao Xu1; Weiliang Meng3,4; Ruisheng Wang2; Xiaopeng Zhang3,4
刊名IEEE TRANSACTIONS ON IMAGE PROCESSING
出版日期2024
卷号33期号:0页码:1045 - 1058
关键词Weakly supervised object localization intrinsic discrimination and consistency deep metric learning geometric transformation consistency
ISSN号1057-7149
DOI10.1109/TIP.2024.3356174
英文摘要

Weakly supervised object localization (WSOL) is a challenging and promising task that aims to localize objects solely based on the supervision of image category labels. In the absence of annotated bounding boxes, WSOL methods must employ the intrinsic properties of the image classification task pipeline to generate object localizations. In this work, we propose a WSOL method for exploring the Intrinsic Discrimination and Consistency in the image classification task pipeline, and call it as IDC. First, we develop a Triplet Metrics Based Foreground Modeling (TMFM) framework to directly predict object foreground regions using intrinsic discrimination. Unlike Class Activation Map (CAM) based methods that also rely on intrinsic discrimination, our TMFM framework alleviates the problem of only focusing on the most discriminative parts by optimizing foreground and background regions synergistically. Second, we design a Dual Geometric Transformation Consistency Constraints (DGTC2) training strategy to introduce additional supervision and regularization constraints for WSOL by leveraging intrinsic geometric transformation consistency. The proposed pixel-wise and object-wise consistency constraint losses cost-effectively provide spontaneous supervision for WSOL. Extensive experiments show that our IDC method achieves significant and consistent performance gains compared to existing state-of-the-art WSOL approaches. Code is available at: https://github.com/vignywang/IDC.

URL标识查看原文
语种英语
源URL[http://ir.ia.ac.cn/handle/173211/56663]  
专题多模态人工智能系统全国重点实验室
模式识别国家重点实验室_三维可视计算
通讯作者Shibiao Xu
作者单位1.School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
2.Department of Geomatics Engineering, University of Calgary, Calgary, AB T2N 1N4, Canada.
3.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China
4.State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
5.Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan 250014, China
6.Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, China
推荐引用方式
GB/T 7714
Changwei Wang,Rongtao Xu,Shibiao Xu,et al. Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2024,33(0):1045 - 1058.
APA Changwei Wang,Rongtao Xu,Shibiao Xu,Weiliang Meng,Ruisheng Wang,&Xiaopeng Zhang.(2024).Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization.IEEE TRANSACTIONS ON IMAGE PROCESSING,33(0),1045 - 1058.
MLA Changwei Wang,et al."Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization".IEEE TRANSACTIONS ON IMAGE PROCESSING 33.0(2024):1045 - 1058.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。