Journal of Jilin University(Earth Science Edition) ›› 2025, Vol. 55 ›› Issue (5): 1629-1643.doi: 10.13278/j.cnki.jjuese.20230139

Previous Articles     Next Articles

Construction and Comparison of Models for Predicting Selenium Rich Soil Based on Machine Learning: A Case Study of Youshan Area,Xinfeng County, Jiangxi Province

Yang Lan1,Wang Yun1,Zou Yongjun2,Hu Baoqun1,Li Mangen1,Zhang An1,Zhu Manhuai1   

  1. 1. Key Laboratory of Digital Land and Resources of Jiangxi Province,East China University of Technology,

    Nanchang 330013,China

    2. Geological Environment Monitoring Institute of Jiangxi Geological Exploration Institute,Nanchang 330001,China

  • Online:2025-09-26 Published:2025-11-15
  • Supported by:
    Supported by the Project of China Geological Survey (DD20160321),the Key Research and Development Plan Project of Jiangxi Province (20203BBG72W011),the Doctoral Startup Fund of East China University of Technology (DHBK2019051), the Project of Key Laboratory for Digital Land and Resources of Jiangxi Province, East China University of Technology (DLLJ202205) and the Project of Jiangxi Postgraduate Innovation Special Fund (YC2022-S600)

Abstract:

In order to find selenium rich soil quickly, efficiently and accurately using selenium free data, it is necessary to build the best model to predict selenium rich soil. 502 data sets were selected from 1 277 1∶50 000 surface soil geochemical data. With w(Zn),w(K2O),w(P),w(Mo),w(Mn),w(Cr),pH,D(Devonian) as independent variables and Se rich or not as dependent variables, SPSS Modeler 18 software was used to build binary Logistic regression model, multi-layer perceptron neural network model, random forest model and support vector machine model (linear, multinomial, radial basis function, Sigmoid) for predicting Se rich soil, and the measured data of 35 soil samples were used for verification. The results show that, using binary Logistic regression model, multilayer perceptron neural network model, random forest model and support vector machine model (linear, polynomial, radial basis function, Sigmoid), the overall accuracy of prediction and verification of the seven prediction models and were 88.8% and 94.3%, 91.0% and 97.1%, 96.6% and 97.1%, 87.9% and 97.1%, 86.1% and 94.3%, 86.9% and 94.3%, 80.3% and 91.4%. The AUC were 0.948, 0.950, 0.993, 0.937, 0.945, 0.928 and 0.873, respectively. The accuracy and stability of the random forest model are the best. Meanwhile, this study identified clean selenium-rich soil and green selenium-rich mountain rice, indicating that this method is feasible in the prediction of selenium-rich soil, and it can be further extended to geological prospecting and environmental monitoring.

Key words: selenium rich soil, machine learning, binary Logistic regression model, multilayer perceptron neural network model, random forest, support vector machine model

CLC Number: 

  • P59
[1] Cao Zhimin, Zhang Li, Zheng Bing, Han Jian.

Lithology Identification Using Extra Trees Based on SMOTE for Data Balancing [J]. Journal of Jilin University(Earth Science Edition), 2025, 55(4): 1372-1386.

[2] Zhang Shengyu, Shen Wenchao, Su Xiaosi.  Risk Assessment of Regional Groundwater Nitrate Pollution Based on Random Forest Method [J]. Journal of Jilin University(Earth Science Edition), 2025, 55(3): 943-956.
[3] Wang Mingchang, Yu Haibin, Zeng Zhaofa, Wang Dian, Han Fuxing, Zhang Jian, Luo Xiujie, Leng Liang, Liu Ziwei.  Prediction of Urban Road Collapse Susceptibility Based on Multi-Source Remote Sensing Data [J]. Journal of Jilin University(Earth Science Edition), 2025, 55(3): 1028-1038.
[4] Cao Zhimin , , Ding Lu, Han Jian, , Hao Lechuan, . Large-Scale Difference Super-Resolution of Logging Curves Based on Integrated Machine Learning [J]. Journal of Jilin University(Earth Science Edition), 2025, 55(2): 670-685.
[5] Lyu Huaxing, Chen Zhaoming, Zhang Zhenbo, Jiang Dapeng, Li Kecheng, Guo Wei. Application of Machine Learning High-Resolution Fusion Inversion in Stratigraphic Correlation: A Case Study of Kaiping A Structural #br# Belt in Kaiping Sag of the Pearl River Mouth Basin [J]. Journal of Jilin University(Earth Science Edition), 2025, 55(1): 289-297.
[6] An Xuelian, Mi Changlin, Sun Deliang, Wen Haijia, Li Xiaoqin, Gu Qingyu, Ding Yuekai. Comparison of  Landslide Susceptibility in Three Gorges Reservoir Area Based on Different Evaluation Units——Take Yunyang County in Chongqing as an Example [J]. Journal of Jilin University(Earth Science Edition), 2024, 54(5): 1629-1644.
[7] Wang Xinling, Zhu Xinyi, Zhang Hongbing, Sun Bo, Xu Kexin. Lithology Identification Method for Logging While Drilling Based on Random Tree Embedding [J]. Journal of Jilin University(Earth Science Edition), 2024, 54(2): 701-708.
[8] Wang Mingchang, Ding Wen, Zhao Jingzheng, Wu Linlin, Wang Fengyan, Ji Xue. Remote Sensing Identification of Dendrolimus Superans Infestation Based on Knowledge Graph and Random Forest [J]. Journal of Jilin University(Earth Science Edition), 2023, 53(6): 2006-2017.
[9] Wang Xuedong, Zhang Chaobiao, Wang Cui, Zhu Yongdong, Wang Haipeng. Geological Disaster Susceptibility in Helong City Based on Logistic Regression and Random Forest [J]. Journal of Jilin University(Earth Science Edition), 2022, 52(6): 1957-1970.
[10] Yang Guohua, Li Wanlu, Meng Bo. Spatiotemporal Distribution of Groundwater Ammonia Nitrogen Based on Machine Learning Methods#br# [J]. Journal of Jilin University(Earth Science Edition), 2022, 52(6): 1982-1995.
[11] Hou Xianmu, Wang Fuyong, Zai Yun, Lian Peiqing. Prediction of Carbonate Porosity and Permeability Based on Machine Learning and Logging Data [J]. Journal of Jilin University(Earth Science Edition), 2022, 52(2): 644-653.
[12] Wang Mingchang, , Liu Peng, Chen Xueye, Wang Fengyan, Song Yulian, Liu Hanyuan. Land Expansion of Urban Construction in the Three Provinces of Northeast China Based on Google Earth Engine [J]. Journal of Jilin University(Earth Science Edition), 2022, 52(1): 292-.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!