中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Exploring effective ways to increase reliable positive samples for machine learning-based urban waterlogging susceptibility assessments

文献类型:期刊论文

作者Tang, Xianzhe1,2; Wu, Zhanyu2; Liu, Wei3,4; Tian, Juwei1; Liu, Luo1
刊名JOURNAL OF ENVIRONMENTAL MANAGEMENT
出版日期2023-10-15
卷号344页码:9
ISSN号0301-4797
关键词Urban waterlogging Imbalanced class Machine learning SMOTE OSSA
DOI10.1016/j.jenvman.2023.118682
通讯作者Liu, Wei(liuwei6010@igsnrr.ac.cn) ; Liu, Luo(liuluo@scau.edu.cn)
英文摘要Machine learning (ML)-based urban waterlogging susceptibility studies suffer from class imbalance, as fewer positive samples are generally available than potential negative samples. Few studies have considered optimizing the results by improving the quality of training samples. To address this issue, we explored effective approaches to reliably increase the numbers of positive samples for such studies. The Synthetic Minority Over-Sampling Technique (SMOTE) and Optimized Seed Spread Algorithm (OSSA), representative of oversampling (synthesizing new samples based on the feature space) and physical (simulating potential inundated area based on the mechanisms of water flow) approaches, respectively, were employed to increase the number of positive samples. Waterlogging in Shenzhen was selected as a case study using eight selected spatial variables. An elaborate experiment was conducted to compare the quality of added samples based on the classifiers' performance and accuracy of waterlogging susceptibility maps (WSMs). The results indicated that (1) the performance of classifiers generated with SMOTE was worse than the original samples, while the use of OSSA improved the trained classifiers, and (2) the accuracy of WSMs was not improved with SMOTE but increased markedly with OSSA. These results may be driven by the diversity of information and features of the added samples. This study indicates the use of SMOTE fails to synthesize reliable samples when applied to waterlogging analysis in Shenzhen, whereas an effective solution for generating reliable positive samples is to use OSSA that simulates the potential submerged regions based on the mechanisms of disaster occurrence and spread.
WOS关键词SMOTE ; SIMULATION ; EXPANSION ; MODEL ; SWMM ; PSO
资助项目National Key Research and Development Program of China[2020YFD1100203]
WOS研究方向Environmental Sciences & Ecology
语种英语
出版者ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
WOS记录号WOS:001144123700001
资助机构National Key Research and Development Program of China
源URL[http://ir.igsnrr.ac.cn/handle/311030/202158]  
专题中国科学院地理科学与资源研究所
通讯作者Liu, Wei; Liu, Luo
作者单位1.South China Agr Univ, Guangdong Prov Key Lab Land Use & Consolidat, Guangzhou 510642, Peoples R China
2.South China Agr Univ, Joint Inst Environm & Educ, Coll Nat Resources & Environm, Guangzhou 510642, Peoples R China
3.Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
4.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
推荐引用方式
GB/T 7714
Tang, Xianzhe,Wu, Zhanyu,Liu, Wei,et al. Exploring effective ways to increase reliable positive samples for machine learning-based urban waterlogging susceptibility assessments[J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT,2023,344:9.
APA Tang, Xianzhe,Wu, Zhanyu,Liu, Wei,Tian, Juwei,&Liu, Luo.(2023).Exploring effective ways to increase reliable positive samples for machine learning-based urban waterlogging susceptibility assessments.JOURNAL OF ENVIRONMENTAL MANAGEMENT,344,9.
MLA Tang, Xianzhe,et al."Exploring effective ways to increase reliable positive samples for machine learning-based urban waterlogging susceptibility assessments".JOURNAL OF ENVIRONMENTAL MANAGEMENT 344(2023):9.

入库方式: OAI收割

来源:地理科学与资源研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。