Full Text Available

Note: Clicking the button above will open the full text document at the original institutional repository in a new window.

The effect of measurement error and land management covariates on machine learning derived digital soil maps

Malatji, Boitumelo Raymond. 2024. The effect of measurement error and land management covariates on machine learning derived digital soil maps. Unpublished masters dissertation. Stellenbosch : Stellenbosch University [online]. Available: https://scholar.sun.ac.za/handle/10019.1/131815 Thesis (MScA...

Full description

Saved in:
Bibliographic Details
Main Author: Malatji, Boitumelo Raymond
Other Authors: Van der Westhuizen, Stephan
Format: Thesis
Published: Stellenbosch : Stellenbosch University 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1867613964926451712
access_status_str Open Access
author Malatji, Boitumelo Raymond
author2 Van der Westhuizen, Stephan
author_browse Malatji, Boitumelo Raymond
Van der Westhuizen, Stephan
author_facet Van der Westhuizen, Stephan
Malatji, Boitumelo Raymond
author_sort Malatji, Boitumelo Raymond
collection Thesis
dc_rights_str_mv Stellenbosch University
description Malatji, Boitumelo Raymond. 2024. The effect of measurement error and land management covariates on machine learning derived digital soil maps. Unpublished masters dissertation. Stellenbosch : Stellenbosch University [online]. Available: https://scholar.sun.ac.za/handle/10019.1/131815 Thesis (MScAgric)--Stellenbosch University, 2024.
format Thesis
id oai:scholar.sun.ac.za:10019.1/131815
institution Stellenbosch University (South Africa)
last_indexed 2026-06-10T12:44:30.757Z
license_str Other — see source repository
provenance_str_mv Harvested via OAI-PMH from SUNScholar — Stellenbosch University Repository
publishDate 2025
publishDateRange 2025
publishDateSort 2025
publisher Stellenbosch : Stellenbosch University
publisherStr Stellenbosch : Stellenbosch University
record_format dspace
source_str SUNScholar — Stellenbosch University Repository
spelling oai:scholar.sun.ac.za:10019.1/131815 The effect of measurement error and land management covariates on machine learning derived digital soil maps Malatji, Boitumelo Raymond Van der Westhuizen, Stephan Clarke, Catherine E. Stellenbosch University. Faculty of AgriSciences. Dept. of Soil Science. Digital soil mapping -- Data processing Machine learning Errors -- Measurement Soil organic carbon Errors-in-variables models Land use -- Management UCTD Malatji, Boitumelo Raymond. 2024. The effect of measurement error and land management covariates on machine learning derived digital soil maps. Unpublished masters dissertation. Stellenbosch : Stellenbosch University [online]. Available: https://scholar.sun.ac.za/handle/10019.1/131815 Thesis (MScAgric)--Stellenbosch University, 2024. ENGLISH ABSTRACT: Digital soil mapping (DSM) has become a fundamental tool used to create detailed spatial predictions of soil properties such as soil organic carbon (SOC) and clay content. Despite its advantages, its accuracy can be hindered by various sources of error, such as measurement errors. When these errors are not accounted for, they propagate to the DSM outputs, resulting in soil maps that are not measurement error-free. Since these soil maps help in making informed decisions in the agricultural sciences, and in policy making, it is important to account for these errors. Moreover, these soil properties vary across depths, and as depth-specific soil data is crucial for a thorough understanding of soil formation processes and for assessing potential environmental impacts, these properties are predicted at different soil depths. Therefore, models that are better in predicting these properties are necessary for obtaining accurate predictions. In addition, land management practices have a significant effect on the spatial distribution of SOC in farmlands. Therefore, incorporating this information as covariates for SOC predictions can be vital for improving the prediction accuracy of SOC maps. This study investigated the effect of accounting for measurement errors on digital maps of SOC and clay prediction, and the effect of land management covariates on SOC prediction using machine learning (ML). Moreover, this study compared the model performance between a depth-specific modelling and a three-dimensional (3D) modelling that uses depth as a covariate. This study also investigated how incorporating land management covariates influenced the prediction of SOC. The effect of measurement error was accounted for by incorporating measurement error variances (MEVs) derived from the mid-infrared (MIR) repeats, as weights in the calibration of a ML model. In this way measurements with larger MEVs receive less weight when a ML model is fitted. This was done based on SOC and clay, for 0-10 cm, 10-30 cm, and 30-50 cm depths, by comparing random forest model (RF) with measurement error-filtered RF model (FRF). The results showed that FRF outperformed RF for both SOC and clay. Moreover, a depth-specific model was compared with a 3D model, and the results showed that both modelling approaches produced comparable results for both SOC and clay. However, the results also showed that model performance decreased with increasing soil depth for both SOC and clay using both modelling approaches. The effect of land management covariates on the prediction of SOC was assessed by comparing different sets of covariates with and without land management covariates. Three land management covariates used were land management practices, land use, and crop ecotope. The results showed that land management covariates improved model performance in the 0-10 cm depth. Farm and land use were among the most important covariates in predicting SOC at this depth. However, no model improvement was observed in the 10-30 cm and 30-50 cm. The overall results of this study showed that accounting for measurement errors are vital for improving model prediction. In addition, incorporating land management covariates in predicting topsoil SOC is important for obtaining better results. AFRIKAANSE OPSOMMING: Digitale grondkartering (DGK) het 'n fundamentele hulpmiddel geword wat gebruik word om gedetailleerde ruimtelike voorspellings van grondeienskappe soos grondorganiese koolstof (GOK) en klei-inhoud te skep. Ten spyte van die voordele van DGK, kan die akkuraatheid daarvan deur verskeie bronne van foute, soos meetfoute, belemmer word. Wanneer hierdie foute nie in ag geneem word nie, dra dit oor na die DGK-afvoer, wat lei tot grondkaarte wat nie meetfoutvry is nie. Aangesien hierdie grondkaarte gebruik word om ingeligte besluite in die landbouwetenskappe en beleidsvorming te neem, is dit belangrik om hierdie foute in ag te neem. Verder wissel hierdie grondeienskappe oor diepte, en aangesien dieptespesifieke gronddata noodsaaklik is vir 'n deeglike begrip van grondvormingsprosesse en vir die assessering van moontlike omgewingsimpakte, word hierdie kenmerke op verskillende grond-dieptes voorspel. Daarom is modelle wat beter is in die voorspelling van hierdie kenmerke nodig om akkurate voorspellings te verkry. Boonop het grondbestuurspraktyke 'n beduidende invloed op die ruimtelike verspreiding van GOK in landerye. Daarom kan die inkorporering van hierdie inligting as kovariate vir GOK-voorspellings van kritieke belang wees vir die verbetering van die voorspelling-akkuraatheid van GOK-kaarte. Hierdie studie het die effek van die inagneming van meetfoute op digitale kaarte van GOK- en klei-inhoud voorspellings ondersoek, asook die invloed van grondbestuur-kovariate op GOK-voorspellings deur middel van masjienleer (ML). Verder het hierdie studie die modelprestasie vergelyk tussen 'n dieptespesifieke modellering en 'n driedimensionele (3D) modellering wat diepte as 'n kovariaat gebruik. Die studie het ook ondersoek ingestel na hoe die inkorporering van grondbestuur-kovariate die voorspelling van GOK beïnvloed het. Die effek van meetfoute is in ag geneem deur meetfoutvariansies (MFV's), afgelei van midinfrarooi (MIR) herhalings, as gewigte in die kalibrasie van 'n ML-model in te sluit. Op hierdie manier ontvang metings met groter MFV's kleiner gewig wanneer 'n ML-model gepas word. Dit is gedoen vir GOK en klei, vir dieptes van 0-10 cm, 10-30 cm, en 30-50 cm, deur 'n ewekansige ‘random forest’ (RF) te vergelyk met 'n meetfout-gefiltreerde RF-model (FRF). Die resultate het getoon dat FRF beter gevaar het as RF vir beide GOK en klei. Verder is 'n dieptespesifieke model vergelyk met 'n 3D-model, en die resultate het getoon dat beide modelleringsbenaderings vergelykbare resultate opgelewer het vir beide GOK en klei. Die resultate het egter ook getoon dat modelprestasie afgeneem het met toenemende grond-diepte vir beide GOK en klei met beide modelleringsbenaderings. Die effek van grondbestuur-kovariate op die voorspelling van GOK is beoordeel deur verskillende stelle kovariate met en sonder grondbestuur-kovariate te vergelyk. Drie grondbestuur-kovariate wat gebruik is, was grondbestuurspraktyke, grondgebruik en gewasekotoop. Die resultate het getoon dat grondbestuur-kovariate modelprestasie verbeter het in die 0-10 cm diepte. Plaas- en grondgebruik was onder die belangrikste kovariate in die voorspelling van GOK op hierdie diepte. Geen modelverbetering is egter waargeneem in die 10-30 cm en 30-50 cm dieptes nie. Die algehele resultate van hierdie studie het getoon dat die inagneming van meetfoute noodsaaklik is vir die verbetering van modelvoorspellings. Daarbenewens is die inkorporering van grondbestuur-kovariate in die voorspelling van bogrondse GOK belangrik om beter resultate te verkry. Masters 2025-03-31T07:02:03Z 2025-03-31T07:02:03Z 2024-12 Thesis https://scholar.sun.ac.za/handle/10019.1/131815 Stellenbosch University xiii, 85 pages : illustrations, maps application/pdf Stellenbosch : Stellenbosch University
spellingShingle Digital soil mapping -- Data processing
Machine learning
Errors -- Measurement
Soil organic carbon
Errors-in-variables models
Land use -- Management
UCTD
Malatji, Boitumelo Raymond
The effect of measurement error and land management covariates on machine learning derived digital soil maps
title The effect of measurement error and land management covariates on machine learning derived digital soil maps
title_full The effect of measurement error and land management covariates on machine learning derived digital soil maps
title_fullStr The effect of measurement error and land management covariates on machine learning derived digital soil maps
title_full_unstemmed The effect of measurement error and land management covariates on machine learning derived digital soil maps
title_short The effect of measurement error and land management covariates on machine learning derived digital soil maps
title_sort effect of measurement error and land management covariates on machine learning derived digital soil maps
topic Digital soil mapping -- Data processing
Machine learning
Errors -- Measurement
Soil organic carbon
Errors-in-variables models
Land use -- Management
UCTD
url https://scholar.sun.ac.za/handle/10019.1/131815
work_keys_str_mv AT malatjiboitumeloraymond theeffectofmeasurementerrorandlandmanagementcovariatesonmachinelearningderiveddigitalsoilmaps
AT malatjiboitumeloraymond effectofmeasurementerrorandlandmanagementcovariatesonmachinelearningderiveddigitalsoilmaps