Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation
文献类型:期刊论文
作者 | Xu RT(许镕涛)2,3![]() ![]() ![]() ![]() ![]() |
刊名 | IEEE Transactions on Multimedia
![]() |
出版日期 | 2023 |
页码 | 1-13 |
英文摘要 | The Class Activation Map (CAM) is widely used to generate pseudo-labels for Weakly Supervised Semantic Seg mentation (WSSS), while it does not adequately consider the modeling of foreground-independent information, resulting in prone to false positive pixels. In this paper, we propose a Wave like Class Activation Map (WaveCAM) from the perspective of representation fusion and dynamic aggregation representation to alleviate the above problem. Specifically, our WaveCAM includes the foreground-aware representation modeling that enhances perception of foreground information, and the foreground independent representation modeling that enhances perception of foreground-independent information, and a representation adaptive fusion module that fuses the two representations. Both representations are expressed as wave functions with ampli tude and phase to dynamically aggregate representations and extract semantic information after initialization, and they are fused through the adaptive fusion module to obtain an output containing rich semantic information. Extensive experiments on PASCAL VOC 2012 dataset and MS COCO 2014 dataset validate that our WaveCAM can easily embed multi-stage WSSS and end-to-end WSSS, achieving the state-of-the-art performance. The release code is available at:https://github.com/Rongtao Xu/RepresentationLearning/tree/main/WaveCAM-TMM2023. |
源URL | [http://ir.ia.ac.cn/handle/173211/51601] ![]() |
专题 | 模式识别国家重点实验室_三维可视计算 多模态人工智能系统全国重点实验室 |
通讯作者 | Xu RT(许镕涛) |
作者单位 | 1.School of Artificial Intelligence, Beijing University of Posts and Telecommunications 2.the State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences 3.School of Artificial Intelligence, University of Chinese Academy of Sciences, |
推荐引用方式 GB/T 7714 | Xu RT,Wang CW,Xu SB,et al. Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation[J]. IEEE Transactions on Multimedia,2023:1-13. |
APA | Xu RT,Wang CW,Xu SB,Meng WL,&Zhang XP.(2023).Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation.IEEE Transactions on Multimedia,1-13. |
MLA | Xu RT,et al."Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation".IEEE Transactions on Multimedia (2023):1-13. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。