Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils
文献类型:期刊论文
作者 | Proshad, Ram4,5; Asha, S. M. Asharaful Abedin3; Tan, Rong2; Lu, Yineng2; Abedin, Md Anwarul1; Ding, Zihao2; Zhang, Shuangting2; Li, Ziyi2; Chen, Geng2; Zhao, Zhuanjun5 |
刊名 | JOURNAL OF HAZARDOUS MATERIALS
![]() |
出版日期 | 2025-01-05 |
卷号 | 481页码:23 |
关键词 | Heavy metals Machine learning XGBoost model Outliers DBSCAN LISA |
ISSN号 | 0304-3894 |
DOI | 10.1016/j.jhazmat.2024.136536 |
英文摘要 | Machine learning (ML) models for accurately predicting heavy metals with inconsistent outputs have improved owing to dataset outliers, which influence model reliability and accuracy. A comprehensive technique that combines machine learning and advanced statistical methods was applied to assess data outlier's effects on ML models. Ten ML models with three outlier detection methods predicted Cr, Ni, Cd, and Pb in Narayanganj soils. XGBoost with density-based spatial clustering of applications with noise (DBSCAN) improved model efficacy (R2). The R2 of Cr, Ni, Cd, and Pb was considerably enhanced by 11.11 %, 6.33 %, 14.47 %, and 5.68 %, respectively, indicating that outliers affected the model's HM prediction. Soil factors affected Cr (80 %), Ni (72.61 %), Cd (53.35 %), and Pb (63.47 %) concentrations based on feature importance. Contamination factor prediction showed considerable contamination for Cr, Ni, and Cd. LISA revealed Cd (55.4 %), Cr (49.3 %), and Pb (47.3 %) as the significant pollutant (p < 0.05). Moran's I index values for Cr, Ni, Cd, and Pb were 0.65, 0.58, 0.60, and 0.66, respectively, indicating strong positive spatial autocorrelation and clusters with similar contamination. Finally, this work successfully assessed the influence of data outliers on the ML model for soil HM contamination prediction, identifying crucial regions that require rapid conservation measures. |
WOS关键词 | POLLUTION |
资助项目 | Science and Technology Research Program of Institute of Mountain Hazards and Environment, Chinese Academy of Sciences[IMHE-ZDRW-05E3K2290290] ; CAS-TWAS President Fellowship[2019A8006338002] |
WOS研究方向 | Engineering ; Environmental Sciences & Ecology |
语种 | 英语 |
WOS记录号 | WOS:001363055900001 |
出版者 | ELSEVIER |
资助机构 | Science and Technology Research Program of Institute of Mountain Hazards and Environment, Chinese Academy of Sciences ; CAS-TWAS President Fellowship |
源URL | [http://ir.imde.ac.cn/handle/131551/58547] ![]() |
专题 | 中国科学院水利部成都山地灾害与环境研究所 |
通讯作者 | Zhao, Zhuanjun |
作者单位 | 1.Bangladesh Agr Univ, Dept Soil Sci, Lab Environm & Sustainable Dev, Mymensingh 2202, Bangladesh 2.Lanzhou Univ, Coll Earth & Environm Sci, Lanzhou 730000, Peoples R China 3.Geol Survey Bangladesh, Dhaka 1000, Bangladesh 4.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 5.Chinese Acad Sci, Inst Mt Hazards & Environm, State Key Lab Mt Hazards & Engn Safety, Chengdu 610041, Sichuan, Peoples R China |
推荐引用方式 GB/T 7714 | Proshad, Ram,Asha, S. M. Asharaful Abedin,Tan, Rong,et al. Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils[J]. JOURNAL OF HAZARDOUS MATERIALS,2025,481:23. |
APA | Proshad, Ram.,Asha, S. M. Asharaful Abedin.,Tan, Rong.,Lu, Yineng.,Abedin, Md Anwarul.,...&Zhao, Zhuanjun.(2025).Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils.JOURNAL OF HAZARDOUS MATERIALS,481,23. |
MLA | Proshad, Ram,et al."Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils".JOURNAL OF HAZARDOUS MATERIALS 481(2025):23. |
入库方式: OAI收割
来源:成都山地灾害与环境研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。