中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment

文献类型:期刊论文

作者Chuyun Xie1,2; Wei Gao1,2; Ren Guo1,2
刊名IEEE ROBOTICS AND AUTOMATION LETTERS
出版日期2024
卷号9期号:3页码:2359-2366
关键词Deep Learning for Visual Perception, Transfer Learning, Semantic Scene Understanding
ISSN号2377-3766
DOI10.1109/LRA.2024.3355648
文献子类期刊论文
英文摘要

By demonstrating robustness in scenarios charac-
terized by high-speed motion and extreme lighting changes,
event cameras hold great potential for enhancing the perception
reliability of autonomous driving systems. Because of its novelty
and data sparsity, the progress of event-based algorithms is
hindered by the scarcity of high-quality labeled datasets. In this
work, we propose CMESS (Cross-Modal learning for Event-based
Semantic Segmentation), which eliminates the need for event
labels by transferring the model from labeled image datasets
(source domain) to unlabeled event datasets (target domain) via
unsupervised domain adaptation (UDA). Compared to existing
UDA methods that require hard alignment of visually consistent
embeddings, our approach achieves soft alignment via cross-
attention and then augments it with knowledge distillation to
convey fine-grained source knowledge to the target domain.
Additionally, we introduce an event-driven bidirectional self-
labeling method to generate weakly supervised signals for event-
only datasets. These designs facilitate cross-modal learning with-
out requiring per-pixel paired frames or online reconstruction.
Experimental results show that our method outperforms existing
state-of-the-art methods in both UDA and supervised settings on
common evaluation benchmarks, making it a universal frame-
work for further unlabeled event-related visual tasks.

URL标识查看原文
语种中文
源URL[http://ir.ia.ac.cn/handle/173211/56682]  
专题模式识别国家重点实验室_三维可视计算
通讯作者Wei Gao
作者单位1.Institute of Automation, Chinese Academy of Sciences
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Chuyun Xie,Wei Gao,Ren Guo. Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment[J]. IEEE ROBOTICS AND AUTOMATION LETTERS,2024,9(3):2359-2366.
APA Chuyun Xie,Wei Gao,&Ren Guo.(2024).Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment.IEEE ROBOTICS AND AUTOMATION LETTERS,9(3),2359-2366.
MLA Chuyun Xie,et al."Cross-modal Learning for Event-based Semantic Segmentation via Attention Soft Alignment".IEEE ROBOTICS AND AUTOMATION LETTERS 9.3(2024):2359-2366.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。