Predictive soil mapping using random forest models: Applications in pH and soil organic matter assessment
DOI:
https://doi.org/10.14719/pst.3865Keywords:
Digital Soil Mapping (DSM), Random Forest (RF), pH, Soil Organic Matter (SOM), Conditioned Latin Hypercube Sampling (cLHS), Remote sensingAbstract
Digital Soil Mapping (DSM) presents a highly scalable and efficient alternative to traditional soil analysis, which is typically limited by its labor-intensive processes, time constraints and low spatial resolution. By utilizing advanced computational techniques such as machine learning and remote sensing, DSM overcomes these limitations and improves the accuracy, efficiency and scalability of soil property assessments. This study, conducted across Tamil Nadu, India, applied DSM and Random Forest (RF) models to predict 2 key soil properties: pH and Soil Organic Matter (SOM). We employed Conditioned Latin Hypercube Sampling (cLHS) for optimized sampling point selection and utilized the Boruta algorithm to identify the most relevant covariates for accurate modeling. The RF models were fine-tuned using a comprehensive grid search, with the optimal configuration spanning from 500 to 2000 trees (ntree) and mtry from 1 to 11. The best-performing model was found with 2000 trees and mtry set to 1 yielding superior prediction for SOM and pH with Root Mean Square Error (RMSE) values of 0.71 and 0.60 respectively, showcasing a high level of predictive accuracy. Our findings emphasize the critical role that remote sensing indices play in predicting SOM, while pH was influenced by both terrain features and remote sensing data. In comparison to previous studies, this research offers novel improvements in both sampling optimization and model configuration, leading to enhanced predictive performance. These results hold significant potential for sustainable land-use planning, agricultural productivity and environmental management.
Downloads
References
Brady NC, Weil RR. The nature and properties of soils. Upper Saddle River, NJ: Prentice Hall; 2008.
Jobbagy EG, Jackson RB. The distribution of soil nutrients with depth: global patterns and the imprint of plants. Biogeochemistry. 2001;53(1):51-77. https://doi.org/10.1023/A:1010760720215.
Lal R. Soil carbon sequestration impacts on global climate change and food security. Science. 2004;304(5677):1623-627. https://doi.org/10.1126/science.1097396.
Lagacherie P. Digital soil mapping: A state of the art. In: digital soil mapping with limited data; 2008:3-14. https://doi.org/10.1007/978-1-4020-8592-5_1
McBratney AB, Mendonça Santos ML, Minasny B. On digital soil mapping. Geoderma. 2003;117(1-2):3-52. https://doi.org/10.1016/S0016-7061(03)00223-4
Minasny B, McBratney AB. A conditioned latin hypercube method for sampling in the presence of ancillary information. Computers and Geosciences. 2006;32(9):1378-388. https://doi.org/10.1016/j.cageo.2005.12.009.
Tian H, Chen G, Lu C, Xu X, Ren W, Zhang B, et al. Global methane and nitrous oxide emissions from terrestrial ecosystems due to multiple environmental changes. Ecosystem Health and Sustainability. 2015;1(1):1-20. https://doi.org/10.1890/EHS14-0015.1.
Zhang M, Zhang M, Yang H, Jin Y, et al. Mapping regional soil organic matter based on sentinel-2A and MODIS imagery using machine learning algorithms and google earth engine. Remote Sensing. 2021;13(15). https://doi.org/10.3390/rs13152934.
Lamichhane S, Kumar L, Wilson B. Digital soil mapping algorithms and covariates for soil organic carbon mapping and their implications: A review. Geoderma. 2019;352:395-13. https://doi.org/10.1016/j.geoderma.2019.05.031.
Liu E, Yan C, Mei X, Zhang Y, Fan T. Long-term effect of manure and fertilizer on soil organic carbon pools in dryland farming in Northwest China. PloS One. 2013;8(2). https://doi.org/10.1371/journal.pone.0056536.
Blair GJ, Lefroy RD, Lisle L. Soil carbon fractions based on their degree of oxidation and the development of a carbon management index for agricultural systems. Australian Journal of Agricultural Research. 1995;46(7):1459-466. https://doi.org/10.1071/AR9951459.
Chen S, Arrouays D, Angers DA, Martin MP, Walter C .Soil carbon stocks under different land uses and the applicability of the soil carbon saturation concept. Soil and Tillage Research. 2019;188:53-58. https://doi.org/10.1016/j.still.2018.11.001.
Wulanningtyas HS, Gong Y, Li P, Sakagami N, Nishiwaki J, Komatsuzaki M. A cover crop and No-tillage system for enhancing soil health by increasing soil organic matter in soybean cultivation. Soil and Tillage Research. 2021;205. https://doi.org/10.1016/j.still.2020.104749.
Hong S, Gan P, Chen A. Environmental controls on soil pH in planted forest and its response to nitrogen deposition. Environmental Research. 2019;172:159-65. https://doi.org/10.1016/j.envres.2019.02.020.
Lal R. Regenerative agriculture for food and climate. Journal of Soil and Water Conservation. 2020;75(5):123A-4A. https://doi.org/10.2489/jswc.2020.0620A.
Minasny B, Malone BP, McBratney AB, Angers DA, et al. Soil carbon 4 per mille. Geoderma. 2017;292:59-86. https://doi.org/10.1016/j.geoderma.2017.01.002.
Jackson ML. Soil chemical analysis. New Delhi: Prentice Hall of India; 1973.
Walkley A, Black IA. An examination of the degtjareff method for determining soil organic matter and a proposed modification of the chromic acid titration method. Soil Science. 1934;37(1):29-38. https://doi.org/10.1097/00010694-193401000-00003
Shapiro SS, Wilk MB. An analysis of variance test for normality (complete samples). Biometrika. 1965;52(3-4):591-11. https://doi.org/10.1093/biomet/52.3-4.591.
Roy DP, Wulder MA, Loveland TR, et al. Landsat-8: science and product vision for terrestrial global change research. Remote Sensing of Environment. 2014;145:154-72. https://doi.org/10.1016/j.rse.2014.02.001
Farr TG, Rosen PA, Caro E, et al. The shuttle radar topography mission. Reviews of Geophysics. 2007;45(2). https://doi.org/10.1029/2005RG000183
Kursa MB, Rudnicki WR. Feature selection with the boruta package. Journal of Statistical Software. 2010;36(11):1-13. https://doi.org/10.18637/jss.v036.i11.
Kursa MB. Robustness of random forest-based gene selection methods. BMC Bioinformatics. 2014;15:8. https://doi.org/10.1186/1471-2105-15-8.
Breiman L. Random forests. Machine Learning. 2001;45(1):5-32. https://doi.org/10.1023/A:1010933404324.
Pouladi N, Møller AB, Tabatabai S, Greve MH. Mapping soil organic matter contents at field level with cubist, Random forest and kriging. Geoderma. 2019;342:85-92. https://doi.org/10.1016/j.geoderma.2019.02.019.
Shi JJ, Yang L, Zhu AX, Qin CZ, Liang P, Zeng CY, Pei T. Machine-learning variables at different scales vs knowledge-based variables for mapping multiple soil properties. Soil Science Society of America Journal. 2018;82(3):645-56. https://doi.org/10.2136/sssaj2017.11.0392.
Garcia S, Herrera F. An extension on "Statistical comparisons of classifiers over multiple data sets" for all pairwise comparisons. Journal of Machine Learning Research. 2008;9:1-16.
Genuer R, Poggi JM. Random forests in practice: Two-step implementation for improved performance. Statistical Modelling. 2020;20(1):1-23. https://doi.org/10.1177/1471082X19872707.
Žížala D, Šefrna L, Bobek P. Soil mapping using random forest model: A case study of soil property spatial prediction in agricultural landscapes. Geoderma. 2022;407. https://doi.org/10.1016/j.geoderma.2021.115601.
Wiesmeier M, Barthold F, Blank B, Kögel-Knabner I. Digital mapping of soil organic matter stocks using random forest modeling in a semi-arid region. Geoderma. 2011;170:93-102. https://doi.org/10.1016/j.geoderma.2011.10.011.
Zhang M, Zhang H, Yang G. Regional soil organic matter mapping using sentinel-2A and MODIS imagery in a heterogeneous landscape. Remote Sensing. 2021;13(5):954. https://doi.org/10.3390/rs13050954.
Seibert J, Stendahl J, Sorensen R. Topographical influences on soil properties in boreal forests. Geoderma. 2007;141(1-2):139-48. https://doi.org/10.1016/j.geoderma.2007.05.013.
Obalum SE, Chibuike GU, Peth S, Ouyang Y. Soil organic matter as sole indicator of soil degradation. Environmental Monitoring and Assessment. 2017;189(4):176. https://doi.org/10.1007/s10661-017-5881-y.
Murphy BW. Impact of soil organic matter on soil properties- A review with emphasis on Australian soils. Soil Research. 2015;53(6):605-35. https://doi.org/10.1071/SR14246.
Bot A, Benites J. The importance of soil organic matter: Key to drought-resistant soil and sustained food production. Rome, Italy: Food and Agriculture Org. of the UN; 2005.
Yang X, Chen X, Yang X. Effect of organic matter on phosphorus adsorption and desorption in a black soil from Northeast China. Soil and Tillage Research. 2019;187:85-91. https://doi.org/10.1016/j.still.2018.11.016.
Van Geel M, Yu K, Peeters G, van Acker K, Ramos M, et al. Soil organic matter rather than ectomycorrhizal diversity is related to urban tree health. PloS One. 2019;14(11). https://doi.org/10.1371/journal.pone.0225714.
Bai Z, Caspari T, Gonzalez MR, Batjes NH, et al. Effects of agricultural management practices on soil quality: A review of long-term experiments for Europe and China. Agriculture, Ecosystems and Environment. 2018;265:1-7. https://doi.org/10.1016/j.agee.2018.05.028.
McCauley A, Jones C, Jacobsen J. Soil pH and organic matter. Nutrient Management Module. 2009;8(2):1-2.
Leifeld J, Zimmermann M, Fuhrer J. Simulating decomposition of labile soil organic carbon: effects of pH. Soil Biology and Biochemistry. 2008;40(12):2948-51. https://doi.org/10.1016/j.soilbio.2008.08.019.
Hock WK. Effect of pH on pesticide stability and efficacy. Pesticide Safety Education Program (PSEP). Cornell University. 2012.
Sylvain JD, Anctil F, Thiffault É. Using bias correction and ensemble modelling for predictive mapping and related uncertainty: A case study in digital soil mapping. Geoderma. 2021;403. https://doi.org/10.1016/j.geoderma.2021.115153.
Ramcharan A, Hengl T, Nauman T, Brungard C, et al. Soil property and class maps of the conterminous US at 100 m spatial resolution based on a compilation of national soil point observations and machine learning. ArXiv Preprint. 2017;1705. https://doi.org/10.2136/sssaj2017.04.0122
Dharumarajan S, Hegde R, Singh SK. Spatial prediction of major soil properties using random forest techniques- A case study in semi-arid tropics of South India. Geoderma Regional. 2017;10:154-62. https://doi.org/10.1016/j.geodrs.2017.07.005.
Zeraatpisheh M, Ayoubi S, Jafari A, Finke P. Comparing the efficiency of digital and conventional soil mapping to predict soil types in a semi-arid region in Iran. Geomorphology. 2017;285:186-204. https://doi.org/10.1016/j.geomorph.2017.02.015.
Zhang YY, Wu W, Liu H. Factors affecting variations of soil pH in different horizons in hilly regions. PloS One. 2019;14(6). https://doi.org/10.1371/journal.pone.0218563
Reddy NN, Chakraborty P, Roy S, Singh K, Minasny B, McBratney B, et al. Legacy data-based national-scale digital mapping of key soil properties in India. Geoderma. 2021;381. https://doi.org/10.1016/j.geoderma.2020.114684.
Downloads
Published
Versions
- 12-10-2024 (2)
- 11-10-2024 (1)
How to Cite
Issue
Section
License
Copyright (c) 2024 B Bhanukiran Reddy, Maragatham S, Santhi R, Balachandar D, Vijayalakshmi D, Davamani V, Vasu D, Gopalakrishnan M
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright and Licence details of published articles
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Open Access Policy
Plant Science Today is an open access journal. There is no registration required to read any article. All published articles are distributed under the terms of the Creative Commons Attribution License (CC Attribution 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited (https://creativecommons.org/licenses/by/4.0/). Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).