Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector
文献类型:期刊论文
作者 | Zhou, Shilong1,2; Zhou, Haijin2![]() |
刊名 | REMOTE SENSING
![]() |
出版日期 | 2024-07-01 |
卷号 | 16 |
关键词 | UAV object detection YOLOv8n attention mechanism feature fusion |
DOI | 10.3390/rs16132416 |
通讯作者 | Zhou, Haijin(hjzhou@aiofm.ac.cn) |
英文摘要 | In response to the challenges of remote sensing imagery, such as unmanned aerial vehicle (UAV) aerial imagery, including differences in target dimensions, the dominance of small targets, and dense clutter and occlusion in complex environments, this paper optimizes the YOLOv8n model and proposes an innovative small-object-detection model called DDSC-YOLO. First, a DualC2f structure is introduced to improve the feature-extraction capabilities of the model. This structure uses dual-convolutions and group convolution techniques to effectively address the issues of cross-channel communication and preserving information in the original input feature mappings. Next, a new attention mechanism, DCNv3LKA, was developed. This mechanism uses adaptive and fine-grained information-extraction methods to simulate receptive fields similar to self-attention, allowing adaptation to a wide range of target size variations. To address the problem of false and missed detection of small targets in aerial photography, we designed a Semantics and Detail Infusion Feature Pyramid Network (SDI-FPN) and added a dedicated detection scale specifically for small targets, effectively mitigating the loss of contextual information in the model. In addition, the coordinate adaptive spatial feature fusion (CASFF) mechanism is used to optimize the original detection head, effectively overcoming multi-scale information conflicts while significantly improving small target localization accuracy and long-range dependency perception. Testing on the VisDrone2019 dataset shows that the DDSC-YOLO model improves the mAP0.5 by 9.3% over YOLOv8n, and its performance on the SSDD and RSOD datasets also confirms its superior generalization capabilities. These results confirm the effectiveness and significant progress of our novel approach to small target detection. |
WOS关键词 | UAV ; SAFETY |
资助项目 | National Key R&D Program of China[2022YFB3904805] |
WOS研究方向 | Environmental Sciences & Ecology ; Geology ; Remote Sensing ; Imaging Science & Photographic Technology |
语种 | 英语 |
WOS记录号 | WOS:001269274000001 |
出版者 | MDPI |
资助机构 | National Key R&D Program of China |
源URL | [http://ir.hfcas.ac.cn:8080/handle/334002/137125] ![]() |
专题 | 中国科学院合肥物质科学研究院 |
通讯作者 | Zhou, Haijin |
作者单位 | 1.Anhui Univ, Inst Phys Sci & Informat Technol, Hefei 230601, Peoples R China 2.Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China |
推荐引用方式 GB/T 7714 | Zhou, Shilong,Zhou, Haijin. Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector[J]. REMOTE SENSING,2024,16. |
APA | Zhou, Shilong,&Zhou, Haijin.(2024).Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector.REMOTE SENSING,16. |
MLA | Zhou, Shilong,et al."Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector".REMOTE SENSING 16(2024). |
入库方式: OAI收割
来源:合肥物质科学研究院
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。