Cognition-inspired multimodal attention fusion of close-range laser scanning data for globally representative tree species classification
文献类型:期刊论文
| 作者 | Luo, Xin6; Tian, Xin6; Liang, Xinlian5; Mokros, Martin4; Chai, GuoQi6; Li, Zengyuan6; Guo, Ying6; Yang, Yudi6; Pang, Yong6; Wang, Yunsheng3 |
| 刊名 | REMOTE SENSING OF ENVIRONMENT
![]() |
| 出版日期 | 2026-03-15 |
| 卷号 | 335页码:115269 |
| 关键词 | Tree species classification Close-range sensing Multimodal fusion Deep learning Attention mechanism Forest remote sensing |
| ISSN号 | 0034-4257 |
| DOI | 10.1016/j.rse.2026.115269 |
| 产权排序 | 5 |
| 文献子类 | Article |
| 英文摘要 | Species-level tree identification is a fundamental task in forest monitoring, biodiversity assessment, and climate-smart ecosystem modeling. Close-range laser scanning technologies have become indispensable tools for forest mapping because they provide high-resolution, three-dimensional structural data at the individual-tree level. However, species-level identification remains a major challenge in global environmental monitoring and AI-driven ecological assessment owing to high species diversity, structural plasticity, and variability across sensing platforms. Here, we propose the cognition-inspired multimodal attention fusion network (CI-MAFusion), a dual-branch deep learning framework that integrates point cloud data with multi-view imagery. Guided by expert dendrological reasoning and cognitive neuroscience principles, CI-MAFusion incorporates a structural branch based on an improved graph attention-based point network for encoding 3D morphological patterns and a visual branch that processes standardized multi-view projections to extract textural features. A cross-gate attention mechanism adaptively fuses structural and visual features. Each branch uses an enhanced convolutional block attention module to highlight salient features, analogous to selective attention in the human visual system. We tested CI-MAFusion using Global LiDAR TreeBank, which contains 12,057 trees from 36 species across four continents and six K & ouml;ppen climate zones. The model achieved 87.50% overall accuracy at the genus level and 86.12% at the species level, outperforming unimodal and existing fusion approaches by up to 8.1%. Additionally, it further achieved > 90% overall accuracy across regions and > 80% across climate zones, with attention visualizations highlighting biologically diagnostic features such as crown contours, bark textures, and branch junctions. This cognitively inspired architecture improves generalization and advances AI-based systems toward robust recognition of biological structures in complex environments. |
| URL标识 | 查看原文 |
| WOS关键词 | TERRESTRIAL |
| WOS研究方向 | Environmental Sciences & Ecology ; Remote Sensing ; Imaging Science & Photographic Technology |
| 语种 | 英语 |
| WOS记录号 | WOS:001685420900001 |
| 出版者 | ELSEVIER SCIENCE INC |
| 源URL | [http://ir.igsnrr.ac.cn/handle/311030/221245] ![]() |
| 专题 | 中国科学院地理科学与资源研究所 |
| 通讯作者 | Tian, Xin |
| 作者单位 | 1.Beijing Forestry Univ, Key Lab Forest Cultivat & Protect, Minist Educ, Beijing 100083, Peoples R China 2.Chinese Acad Sci, LREIS, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China; 3.Nat Land Survey Finland, Dept Remote Sensing & Photogrammetry, Finnish Geospatial Res Inst, Vuorimiehentie 5, Espoo 02150, Finland; 4.UCL, Fac Social & Hist Sci, Geog Dept, London, England; 5.Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430070, Peoples R China; 6.Chinese Acad Forestry, Inst Forest Resource Informat Tech, Beijing 100091, Peoples R China; |
| 推荐引用方式 GB/T 7714 | Luo, Xin,Tian, Xin,Liang, Xinlian,et al. Cognition-inspired multimodal attention fusion of close-range laser scanning data for globally representative tree species classification[J]. REMOTE SENSING OF ENVIRONMENT,2026,335:115269. |
| APA | Luo, Xin.,Tian, Xin.,Liang, Xinlian.,Mokros, Martin.,Chai, GuoQi.,...&Wang, Haiyi.(2026).Cognition-inspired multimodal attention fusion of close-range laser scanning data for globally representative tree species classification.REMOTE SENSING OF ENVIRONMENT,335,115269. |
| MLA | Luo, Xin,et al."Cognition-inspired multimodal attention fusion of close-range laser scanning data for globally representative tree species classification".REMOTE SENSING OF ENVIRONMENT 335(2026):115269. |
入库方式: OAI收割
来源:地理科学与资源研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。

