中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector

文献类型:期刊论文

作者Zhou, Shilong1,2; Zhou, Haijin2
刊名REMOTE SENSING
出版日期2024-07-01
卷号16
关键词UAV object detection YOLOv8n attention mechanism feature fusion
DOI10.3390/rs16132416
通讯作者Zhou, Haijin(hjzhou@aiofm.ac.cn)
英文摘要In response to the challenges of remote sensing imagery, such as unmanned aerial vehicle (UAV) aerial imagery, including differences in target dimensions, the dominance of small targets, and dense clutter and occlusion in complex environments, this paper optimizes the YOLOv8n model and proposes an innovative small-object-detection model called DDSC-YOLO. First, a DualC2f structure is introduced to improve the feature-extraction capabilities of the model. This structure uses dual-convolutions and group convolution techniques to effectively address the issues of cross-channel communication and preserving information in the original input feature mappings. Next, a new attention mechanism, DCNv3LKA, was developed. This mechanism uses adaptive and fine-grained information-extraction methods to simulate receptive fields similar to self-attention, allowing adaptation to a wide range of target size variations. To address the problem of false and missed detection of small targets in aerial photography, we designed a Semantics and Detail Infusion Feature Pyramid Network (SDI-FPN) and added a dedicated detection scale specifically for small targets, effectively mitigating the loss of contextual information in the model. In addition, the coordinate adaptive spatial feature fusion (CASFF) mechanism is used to optimize the original detection head, effectively overcoming multi-scale information conflicts while significantly improving small target localization accuracy and long-range dependency perception. Testing on the VisDrone2019 dataset shows that the DDSC-YOLO model improves the mAP0.5 by 9.3% over YOLOv8n, and its performance on the SSDD and RSOD datasets also confirms its superior generalization capabilities. These results confirm the effectiveness and significant progress of our novel approach to small target detection.
WOS关键词UAV ; SAFETY
资助项目National Key R&D Program of China[2022YFB3904805]
WOS研究方向Environmental Sciences & Ecology ; Geology ; Remote Sensing ; Imaging Science & Photographic Technology
语种英语
WOS记录号WOS:001269274000001
出版者MDPI
资助机构National Key R&D Program of China
源URL[http://ir.hfcas.ac.cn:8080/handle/334002/137125]  
专题中国科学院合肥物质科学研究院
通讯作者Zhou, Haijin
作者单位1.Anhui Univ, Inst Phys Sci & Informat Technol, Hefei 230601, Peoples R China
2.Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China
推荐引用方式
GB/T 7714
Zhou, Shilong,Zhou, Haijin. Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector[J]. REMOTE SENSING,2024,16.
APA Zhou, Shilong,&Zhou, Haijin.(2024).Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector.REMOTE SENSING,16.
MLA Zhou, Shilong,et al."Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector".REMOTE SENSING 16(2024).

入库方式: OAI收割

来源:合肥物质科学研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。