中国科学院机构知识库网格系统: Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment

文献类型：期刊论文


作者	Chuyun Xie1,2 ; Wei Gao1,2 ; Ren Guo1,2
刊名	IEEE ROBOTICS AND AUTOMATION LETTERS
出版日期	2024
卷号	9 期号:3 页码:2359-2366
关键词	Deep Learning for Visual Perception, Transfer Learning, Semantic Scene Understanding
ISSN号	2377-3766
DOI	10.1109/LRA.2024.3355648
文献子类	期刊论文
英文摘要	By demonstrating robustness in scenarios charac- terized by high-speed motion and extreme lighting changes, event cameras hold great potential for enhancing the perception reliability of autonomous driving systems. Because of its novelty and data sparsity, the progress of event-based algorithms is hindered by the scarcity of high-quality labeled datasets. In this work, we propose CMESS (Cross-Modal learning for Event-based Semantic Segmentation), which eliminates the need for event labels by transferring the model from labeled image datasets (source domain) to unlabeled event datasets (target domain) via unsupervised domain adaptation (UDA). Compared to existing UDA methods that require hard alignment of visually consistent embeddings, our approach achieves soft alignment via cross- attention and then augments it with knowledge distillation to convey fine-grained source knowledge to the target domain. Additionally, we introduce an event-driven bidirectional self- labeling method to generate weakly supervised signals for event- only datasets. These designs facilitate cross-modal learning with- out requiring per-pixel paired frames or online reconstruction. Experimental results show that our method outperforms existing state-of-the-art methods in both UDA and supervised settings on common evaluation benchmarks, making it a universal frame- work for further unlabeled event-related visual tasks.
URL标识	查看原文
语种	中文
源URL	[http://ir.ia.ac.cn/handle/173211/56682]
专题	模式识别国家重点实验室_三维可视计算
通讯作者	Wei Gao
作者单位	1.Institute of Automation, Chinese Academy of Sciences 2.School of Artificial Intelligence, University of Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Chuyun Xie,Wei Gao,Ren Guo. Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment[J]. IEEE ROBOTICS AND AUTOMATION LETTERS,2024,9(3):2359-2366.
APA	Chuyun Xie,Wei Gao,&Ren Guo.(2024).Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment.IEEE ROBOTICS AND AUTOMATION LETTERS,9(3),2359-2366.
MLA	Chuyun Xie,et al."Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment".IEEE ROBOTICS AND AUTOMATION LETTERS 9.3(2024):2359-2366.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。