基于地统计学与机器学习模型的宝清县表层有机碳含量预测及比较研究

    Prediction and comparison of organic carbon content in topsoils based on geostatistics and machine learning models: A case study of Baoqing County

    • 摘要: 为了准确预测黑土地区县域土壤有机碳含量,满足县域农业生产和“双碳”目标的需要,本研究利用宝清县黑土地地表基质调查取得的427个土壤样本数据,通过确定性插值法(反距离权重法)、地统计学方法(普通克里金法)和机器学习方法(随机森林模型法)建立评估模型,进行宝清县表层土壤有机碳含量的预测,并比较不同方法的预测精度和效果。结果显示,反距离权重法预测研究区的土壤有机碳含量均值为27.21×10-3,普通克里金法预测的均值为26.33×10-3,随机森林模型预测的均值为32.05×10-3。随机森林模型在均方根误差(RMSE)、平均绝对误差(MAE)和决定系数(R2)等指标上均优于反距离权重法和普通克里金法,且随机森林模型训练集和验证集的决定系数均达到0.73和0.53,精度均显著高于反距离权重法和普通克里金法,反映出随机森林模型通过环境变量的非线性交互作用,能够更充分地挖掘数据中的潜在规律。总体看,结合多环境变量建立的随机森林模型是评估宝清县表层土壤有机碳含量的最佳模型,且预测精度较好,这对评估黑土地区县域农业生产土壤有机质空间差异和“双碳”目标的区域差异分析具有重要的理论和方法参考。

       

      Abstract: This study aims to accurately predict the organic carbon content in black soils at the county level, thereby supporting county-level agricultural production and carbon peak and neutrality goals. This study examined 427 soil samples obtained from a surface substrate survey of the black soil area in Baoqing County. Employing deterministic interpolation (inverse distance weighting, IDW), geostatistics (ordinary Kriging method, OK), and machine learning (random forest, RF), this study constructed assessment models to predict the organic carbon content in topsoils in Baoqing County and to compare their prediction accuracy and performance. The results show that the IDW, OK, and RF models yielded average organic carbon contents of 27.21×10-3, 26.33×10-3, and 32.05×10-3, respectively. The RF model outperformed the other two models in terms of root mean square error (RMSE), mean absolute error (MAE), and the coefficient of determination (R2). Specifically, the RF model achieved R2 values of 0.73 and 0.53 on training and validation sets, respectively, suggesting significantly higher accuracy. This superior performance demonstrates that the RF model can more fully explore potential patterns in data through the nonlinear interaction of environmental variables. Overall, the RF model, incorporating multiple environmental variables, proved to be the optimal approach for predicting the organic carbon content in topsoils in Baoqing County, demonstrating high prediction accuracy. This study provides valuable theoretical and methodological insights for assessing the spatial variations in soil organic matter relevant to county-level agricultural production and regional differences in carbon peak and neutrality goals within black soil areas.

       

    /

    返回文章
    返回