热门
SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer
文献类型:期刊论文
作者 | Jiayi Ma; Linfeng Tang; Fan Fan; Jun Huang; Xiaoguang Mei; Yong Ma |
刊名 | IEEE/CAA Journal of Automatica Sinica
![]() |
出版日期 | 2022 |
卷号 | 9期号:7页码:1200-1217 |
ISSN号 | 2329-9266 |
DOI | 10.1109/JAS.2022.105686 |
英文摘要 | This study proposes a novel general image fusion framework based on cross-domain long-range learning and Swin Transformer, termed as SwinFusion. On the one hand, an attention-guided cross-domain module is devised to achieve sufficient integration of complementary information and global interaction. More specifically, the proposed method involves an intra-domain fusion unit based on self-attention and an interdomain fusion unit based on cross-attention, which mine and integrate long dependencies within the same domain and across domains. Through long-range dependency modeling, the network is able to fully implement domain-specific information extraction and cross-domain complementary information integration as well as maintaining the appropriate apparent intensity from a global perspective. In particular, we introduce the shifted windows mechanism into the self-attention and cross-attention, which allows our model to receive images with arbitrary sizes. On the other hand, the multi-scene image fusion problems are generalized to a unified framework with structure maintenance, detail preservation, and proper intensity control. Moreover, an elaborate loss function, consisting of SSIM loss, texture loss, and intensity loss, drives the network to preserve abundant texture details and structural information, as well as presenting optimal apparent intensity. Extensive experiments on both multi-modal image fusion and digital photography image fusion demonstrate the superiority of our SwinFusion compared to the state-of-theart unified image fusion algorithms and task-specific alternatives. Implementation code and pre-trained weights can be accessed at https://github.com/Linfeng-Tang/SwinFusion. |
源URL | [http://ir.ia.ac.cn/handle/173211/48898] ![]() |
专题 | 自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica |
推荐引用方式 GB/T 7714 | Jiayi Ma,Linfeng Tang,Fan Fan,et al. SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer[J]. IEEE/CAA Journal of Automatica Sinica,2022,9(7):1200-1217. |
APA | Jiayi Ma,Linfeng Tang,Fan Fan,Jun Huang,Xiaoguang Mei,&Yong Ma.(2022).SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer.IEEE/CAA Journal of Automatica Sinica,9(7),1200-1217. |
MLA | Jiayi Ma,et al."SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer".IEEE/CAA Journal of Automatica Sinica 9.7(2022):1200-1217. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。