中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation

文献类型:期刊论文

作者Xu RT(许镕涛)2,3; Wang CW(王常维)2,3; Xu SB(徐士彪)1; Meng WL(孟维亮)2,3; Zhang XP(张晓鹏)2,3
刊名IEEE Transactions on Multimedia
出版日期2023
页码1-13
英文摘要

The Class Activation Map (CAM) is widely used to generate pseudo-labels for Weakly Supervised Semantic Seg mentation (WSSS), while it does not adequately consider the modeling of foreground-independent information, resulting in prone to false positive pixels. In this paper, we propose a Wave like Class Activation Map (WaveCAM) from the perspective of representation fusion and dynamic aggregation representation to alleviate the above problem. Specifically, our WaveCAM includes the foreground-aware representation modeling that enhances perception of foreground information, and the foreground independent representation modeling that enhances perception of foreground-independent information, and a representation adaptive fusion module that fuses the two representations. Both representations are expressed as wave functions with ampli tude and phase to dynamically aggregate representations and extract semantic information after initialization, and they are fused through the adaptive fusion module to obtain an output containing rich semantic information. Extensive experiments on PASCAL VOC 2012 dataset and MS COCO 2014 dataset validate that our WaveCAM can easily embed multi-stage WSSS and end-to-end WSSS, achieving the state-of-the-art performance. The release code is available at:https://github.com/Rongtao Xu/RepresentationLearning/tree/main/WaveCAM-TMM2023.

源URL[http://ir.ia.ac.cn/handle/173211/51601]  
专题模式识别国家重点实验室_三维可视计算
多模态人工智能系统全国重点实验室
通讯作者Xu RT(许镕涛)
作者单位1.School of Artificial Intelligence, Beijing University of Posts and Telecommunications
2.the State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences
3.School of Artificial Intelligence, University of Chinese Academy of Sciences,
推荐引用方式
GB/T 7714
Xu RT,Wang CW,Xu SB,et al. Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation[J]. IEEE Transactions on Multimedia,2023:1-13.
APA Xu RT,Wang CW,Xu SB,Meng WL,&Zhang XP.(2023).Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation.IEEE Transactions on Multimedia,1-13.
MLA Xu RT,et al."Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation".IEEE Transactions on Multimedia (2023):1-13.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。