中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils

文献类型:期刊论文

作者Proshad, Ram4,5; Asha, S. M. Asharaful Abedin3; Tan, Rong2; Lu, Yineng2; Abedin, Md Anwarul1; Ding, Zihao2; Zhang, Shuangting2; Li, Ziyi2; Chen, Geng2; Zhao, Zhuanjun5
刊名JOURNAL OF HAZARDOUS MATERIALS
出版日期2025-01-05
卷号481页码:23
关键词Heavy metals Machine learning XGBoost model Outliers DBSCAN LISA
ISSN号0304-3894
DOI10.1016/j.jhazmat.2024.136536
英文摘要

Machine learning (ML) models for accurately predicting heavy metals with inconsistent outputs have improved owing to dataset outliers, which influence model reliability and accuracy. A comprehensive technique that combines machine learning and advanced statistical methods was applied to assess data outlier's effects on ML models. Ten ML models with three outlier detection methods predicted Cr, Ni, Cd, and Pb in Narayanganj soils. XGBoost with density-based spatial clustering of applications with noise (DBSCAN) improved model efficacy (R2). The R2 of Cr, Ni, Cd, and Pb was considerably enhanced by 11.11 %, 6.33 %, 14.47 %, and 5.68 %, respectively, indicating that outliers affected the model's HM prediction. Soil factors affected Cr (80 %), Ni (72.61 %), Cd (53.35 %), and Pb (63.47 %) concentrations based on feature importance. Contamination factor prediction showed considerable contamination for Cr, Ni, and Cd. LISA revealed Cd (55.4 %), Cr (49.3 %), and Pb (47.3 %) as the significant pollutant (p < 0.05). Moran's I index values for Cr, Ni, Cd, and Pb were 0.65, 0.58, 0.60, and 0.66, respectively, indicating strong positive spatial autocorrelation and clusters with similar contamination. Finally, this work successfully assessed the influence of data outliers on the ML model for soil HM contamination prediction, identifying crucial regions that require rapid conservation measures.

WOS关键词POLLUTION
资助项目Science and Technology Research Program of Institute of Mountain Hazards and Environment, Chinese Academy of Sciences[IMHE-ZDRW-05E3K2290290] ; CAS-TWAS President Fellowship[2019A8006338002]
WOS研究方向Engineering ; Environmental Sciences & Ecology
语种英语
WOS记录号WOS:001363055900001
出版者ELSEVIER
资助机构Science and Technology Research Program of Institute of Mountain Hazards and Environment, Chinese Academy of Sciences ; CAS-TWAS President Fellowship
源URL[http://ir.imde.ac.cn/handle/131551/58547]  
专题中国科学院水利部成都山地灾害与环境研究所
通讯作者Zhao, Zhuanjun
作者单位1.Bangladesh Agr Univ, Dept Soil Sci, Lab Environm & Sustainable Dev, Mymensingh 2202, Bangladesh
2.Lanzhou Univ, Coll Earth & Environm Sci, Lanzhou 730000, Peoples R China
3.Geol Survey Bangladesh, Dhaka 1000, Bangladesh
4.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
5.Chinese Acad Sci, Inst Mt Hazards & Environm, State Key Lab Mt Hazards & Engn Safety, Chengdu 610041, Sichuan, Peoples R China
推荐引用方式
GB/T 7714
Proshad, Ram,Asha, S. M. Asharaful Abedin,Tan, Rong,et al. Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils[J]. JOURNAL OF HAZARDOUS MATERIALS,2025,481:23.
APA Proshad, Ram.,Asha, S. M. Asharaful Abedin.,Tan, Rong.,Lu, Yineng.,Abedin, Md Anwarul.,...&Zhao, Zhuanjun.(2025).Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils.JOURNAL OF HAZARDOUS MATERIALS,481,23.
MLA Proshad, Ram,et al."Machine learning models with innovative outlier detection techniques for predicting heavy metal contamination in soils".JOURNAL OF HAZARDOUS MATERIALS 481(2025):23.

入库方式: OAI收割

来源:成都山地灾害与环境研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。