中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
A graded proportion method of training sample selection for updating conventional soil maps

文献类型:期刊论文

作者Liu, Xueqi3,4; Zhu, A-Xing1,2,4,6; Yang, Lin5,6; Pei, Tao6; Liu, Junzhi4; Zeng, Canying4; Wang, Desheng4
刊名GEODERMA
出版日期2020
卷号357页码:9
关键词Training sample selection method Data mining model Update conventional soil map Soil-environmental relationships
ISSN号0016-7061
DOI10.1016/j.geoderma.2019.113939
通讯作者Yang, Lin(yanglin@nju.edu.cn)
英文摘要Selection of training samples is a vital step in updating conventional soil maps when utilizing data mining models. Quality of training samples significantly affects the mapping results and accuracies of the updated soil maps. The area-weighted proportion method was a common method for generating training samples. However, this method usually assigns too small weight to those soil types of small areas and large weight to those of large areas in sample size allocation, which causes the unreasonable proportions of sample numbers for soil types and thereby biases the representation of soil-environmental relationships for those soil types. Meanwhile, random selection of training samples from a soil type may generate some 'noise' samples located in the transition areas between soil types. These two aspects in training sample selection could probably reduce the accuracy of the updated soil maps. In this study, a new method was developed to select training samples based on soil type grading according to their area coverages. The method consists of two steps. The first step is to determine the numbers of training samples for each soil type based on soil type grading so as to maintain the reasonable proportion in sample numbers among soil types with different area coverages. The second step is to select typical (representative) samples for each soil type from conventional soil map, to avoid generation of 'noise samples'. To evaluate the proposed method, the method was compared with three other training sample selection methods with four training sample sizes. Each method was ran for 100 times to generate training sample datasets with each sample size to evaluate their effectiveness and stability. Random forest was employed to generate updated soil maps in a small watershed in Raffelson, Wisconsin (USA). The validation results showed that the graded proportion method effectively solved the imbalanced issue of training samples among soil types with area coverages in big differences caused by the area-weighted proportion strategy. Thus training samples generated using the proposed method usually obtained more accurate and reasonable mapping results than those using the area-weighted proportion strategy. Furthermore, the performance of the proposed method was more stable than that of the area-weighted proportion strategy with the training sample size increasing. It is concluded that the proposed method is an effective training sample selection method for data mining model to update conventional soil maps.
WOS关键词RANDOM FORESTS ; KNOWLEDGE ; UNITS ; TREE
资助项目National Natural Science Foundation of China[41431177] ; National Natural Science Foundation of China[41971054 41871300] ; National Basic Research Program of China[2015CB954102] ; PAPD ; Outstanding Innovation Team in Colleges and Universities in Jiangsu Province ; Vilas Associate Award ; Hammel Faculty Fellow Award ; University of Wisconsin-Madison
WOS研究方向Agriculture
语种英语
WOS记录号WOS:000496837300026
出版者ELSEVIER
资助机构National Natural Science Foundation of China ; National Basic Research Program of China ; PAPD ; Outstanding Innovation Team in Colleges and Universities in Jiangsu Province ; Vilas Associate Award ; Hammel Faculty Fellow Award ; University of Wisconsin-Madison
源URL[http://ir.igsnrr.ac.cn/handle/311030/132010]  
专题中国科学院地理科学与资源研究所
通讯作者Yang, Lin
作者单位1.Jiangsu Ctr Collaborat Innovat Geog Informat Reso, 1 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
2.Univ Wisconsin, Dept Geog, Madison, WI 53706 USA
3.Beijing Normal Univ, Fac Geog Sci, Beijing 100875, Peoples R China
4.Nanjing Normal Univ, Key Lab Virtual Geog Environm, Minist Educ, 1 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
5.Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Jiangsu, Peoples R China
6.Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
推荐引用方式
GB/T 7714
Liu, Xueqi,Zhu, A-Xing,Yang, Lin,et al. A graded proportion method of training sample selection for updating conventional soil maps[J]. GEODERMA,2020,357:9.
APA Liu, Xueqi.,Zhu, A-Xing.,Yang, Lin.,Pei, Tao.,Liu, Junzhi.,...&Wang, Desheng.(2020).A graded proportion method of training sample selection for updating conventional soil maps.GEODERMA,357,9.
MLA Liu, Xueqi,et al."A graded proportion method of training sample selection for updating conventional soil maps".GEODERMA 357(2020):9.

入库方式: OAI收割

来源:地理科学与资源研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。