Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization
文献类型:期刊论文
作者 | Changwei Wang3,4,5,6![]() ![]() ![]() ![]() ![]() |
刊名 | IEEE TRANSACTIONS ON IMAGE PROCESSING
![]() |
出版日期 | 2024 |
卷号 | 33期号:0页码:1045 - 1058 |
关键词 | Weakly supervised object localization intrinsic discrimination and consistency deep metric learning geometric transformation consistency |
ISSN号 | 1057-7149 |
DOI | 10.1109/TIP.2024.3356174 |
英文摘要 | Weakly supervised object localization (WSOL) is a challenging and promising task that aims to localize objects solely based on the supervision of image category labels. In the absence of annotated bounding boxes, WSOL methods must employ the intrinsic properties of the image classification task pipeline to generate object localizations. In this work, we propose a WSOL method for exploring the Intrinsic Discrimination and Consistency in the image classification task pipeline, and call it as IDC. First, we develop a Triplet Metrics Based Foreground Modeling (TMFM) framework to directly predict object foreground regions using intrinsic discrimination. Unlike Class Activation Map (CAM) based methods that also rely on intrinsic discrimination, our TMFM framework alleviates the problem of only focusing on the most discriminative parts by optimizing foreground and background regions synergistically. Second, we design a Dual Geometric Transformation Consistency Constraints (DGTC2) training strategy to introduce additional supervision and regularization constraints for WSOL by leveraging intrinsic geometric transformation consistency. The proposed pixel-wise and object-wise consistency constraint losses cost-effectively provide spontaneous supervision for WSOL. Extensive experiments show that our IDC method achieves significant and consistent performance gains compared to existing state-of-the-art WSOL approaches. Code is available at: https://github.com/vignywang/IDC. |
URL标识 | 查看原文 |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/56663] ![]() |
专题 | 多模态人工智能系统全国重点实验室 模式识别国家重点实验室_三维可视计算 |
通讯作者 | Shibiao Xu |
作者单位 | 1.School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China 2.Department of Geomatics Engineering, University of Calgary, Calgary, AB T2N 1N4, Canada. 3.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China 4.State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China 5.Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan 250014, China 6.Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences), Jinan 250014, China |
推荐引用方式 GB/T 7714 | Changwei Wang,Rongtao Xu,Shibiao Xu,et al. Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2024,33(0):1045 - 1058. |
APA | Changwei Wang,Rongtao Xu,Shibiao Xu,Weiliang Meng,Ruisheng Wang,&Xiaopeng Zhang.(2024).Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization.IEEE TRANSACTIONS ON IMAGE PROCESSING,33(0),1045 - 1058. |
MLA | Changwei Wang,et al."Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization".IEEE TRANSACTIONS ON IMAGE PROCESSING 33.0(2024):1045 - 1058. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。