Geostatistics or machine learning for mapping soil attributes and agricultural practices
Keywords:
machine learning, digital soil mapping, kriging, remote sensing, decision treeAbstract
Applying the upcoming technologies in agriculture has been a major economic, environmental and social challenge for scientists and farmers. In order to overcome such challenge, this study evaluated the advantages and limitations of using geostatistics and machine learning for soil mapping in agricultural practices and soil surveys. The study occurred in Tocantins State, Brazil, and consisted into seven areas with a total extension of 17.24 km2, 222 meters regular gridded resulting in one-point sampling per 0.0493 km2 of five randomly sampled cores within a 1 m circle radius. It was collected 332 georeferenced soil samples at 0-20 cm depth using an auger and then, soil laboratory analyses performed. Afterward, liming rate maps were originated from the predicted soil attributes clay, cation exchange capacity and base saturation comparing four methods: ordinary kriging, random forest, cubist, support vector machine and the best model results of each soil attribute. Evaluating the methods, the Pearson’s index presented strong results for soil attributes predicted by random forest and ordinary kriging. Machine learning methods can be successfully applied for soil mapping in agricultural practices and soil surveys using less soil samples rather than geostatistical framework.
References
Adamchuk V, Lacroix R, Shinde S, Tremblay N & Huang H (2017) An uncertainty-based comprehensive decision support system for site-specific crop management. Advances in Animal Biosciences, 8:625-629.
Beguin J, Fuglstad G, Mansuy N & Paré D (2017) Predicting soil properties in the Canadian boreal forest with limited data: Comparison of spatial and non-spatial statistical approaches. Geoderma, 306:195-205.
Camargo AO, Moniz AC, Jorge JA & Valadares JMAS (2009) Métodos de Análise Química, Mineralógica e Física de Solos do Instituto Agronômico de Campinas. Campinas, IAC. 77p.
Castro-Franco M, Córdoba MA, Balzarini MG & Costa JL (2018) A pedometric technique to delimitate soil-specific zones at field scale. Geoderma, 322:101-111.
Cherubin MR, Santi AL, Eitelwein MT, Amado TJC, Simon DH & Damian JM (2015) Dimensão da malha amostral para caracterização da variabilidade espacial de fósforo e potássio em Latossolo Vermelho. Pesquisa Agropecuária Brasileira, 50:168-177.
Demattê J, Rizzo R & Botteon VW (2015) Pedological mapping through integration of digital terrain models spectral sensing and photopedology. Revista Ciência Agronômica, 46:669-678.
Demattê J, Fongaro CT, Rizzo R & Safanelli JL (2018) Geospatial Soil Sensing System (GEOS3): A powerful data mining procedure to retrieve soil spectral reflectance from satellite images. Remote Sensing of Environment, 212:161-175.
Dimitriadou E, Hornik K, Leisch F, Meyer D & Maintainer AW (2011) Package “e1071”. Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.191.317&rep=rep1&type=pdf. Accessed on: June 1st, 2019.
Donagemma G, Campos DVB, Calderano SB, Teixeira WG & Viana JHM (2011) Manual de Métodos de Análise de Solo. 2a ed. Rio de Janeiro, Embrapa Solos. 230p.
Dowd P (1991) A review of recent developments in geostatistics. Computers & Geosciences, 17:1481-1500.
Flaxman AD, Vahdatpour A, Green S, James SL & Murray CJ (2011) Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards. Population Health Metrics, 9:01-29.
Fongaro C, Demattê J, Rizzo R, Safanelli J, Mendes W, Dotto A & Ustin S (2018) Improvement of Clay and Sand Quantification Based on a Novel Approach with a Focus on Multispectral Satellite Images. Remote Sensing, 10:01-21.
Gallo B, Demattê J, Rizzo R, Safanelli J, Mendes W, Lepsch I & Lacerda MPC (2018) Multi-Temporal Satellite Images on Topsoil Attribute Quantification and the Relationship with Soil Classes and Geology. Remote Sensing, 10:1571.
Heung B, Bulmer CE & Schmidt MG (2014) Predictive soil parent material mapping at a regional-scale: A Random Forest approach. Geoderma, 214-215:141-154.
Hiemstra PH, Pebesma EJ, Twenhöfel CJW & Heuvelink GBM (2009) Real-time automatic interpolation of ambient gamma dose rates from the Dutch radioactivity monitoring network. Computers & Geosciences, 35:1711-1721.
Krige DG (1951) A statistical approach to some basic mine valuation problems on the witwatersrand. Journal of the Chemical Metallurgical and Mining Society of South Africa, 52:119-139.
Kuhn M (2008) Building Predictive Models in R Using the caret Package. Journal of Statistical Software, 28:01-26.
McBratney AB, Mendonça Santos ML & Minasny B (2003) On digital soil mapping.Geoderma, 117:03-52.
Molin JP, Amaral LR & Colaço AF (2015) Agricultura de precisão. São Paulo, Oficina de Textos. 224p.
Mueller TG, Pusuluri NB, Mathias K, Cornelius PL, Barnhisel RI & Shearer SA (2004) Map Quality for Ordinary Kriging and Inverse Distance Weighted Interpolation. Soil Science Society of America Journal, 68:2042.
Nanni MR, Povh FP, Demattê J, Oliveira RB, Chicati ML & Cezar E (2011) Optimum size in grid soil sampling for variable rate application in site-specific management. Scientia Agricola, 68:386-392.
Nussbaum M, Walthert L, Fraefel M, Greiner L & Papritz A (2017) Mapping of soil properties at high resolution in Switzerland using boosted geoadditive models. Soil, 3:191-210.
QGIS Development Team (2018) QGIS geographic information system. Open source geospatial foundation project. Available at: http://www.qgis.org/. Accessed on: November 20th, 2018.
Quinlan JR, John R & Ross J (1993) C4.5: programs for machine learning. San Francisco, Morgan Kaufmann Publishers. 302p.
R Development Core Team (2019) Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Available at: https://www.r-project.org/. Accessed on: June 1st, 2019.
Raij B (1983) Avaliação da fertilidade do solo. 2a ed. Piracicaba, Instituto da Potassa & Fosfato. 142p.
Rodrigo-Comino J, Martinéz-Hernández C, Iserloh T & Cerdà A (2018) Contrasted Impact of Land Abandonment on Soil Erosion in Mediterranean Agriculture Fields. Pedosphere, 28:617-631.
Shannon DK, Clay DE & Sudduth KA (2018) An Introduction to Precision Agriculture. In: Shannon DK, Clay DE & Kitchen NR (Eds.) Precision Agriculture Basics. Madison, ASA / CSSA / SSSA. p.01-12.
Vapnik VN (2000) The Nature of Statistical Learning Theory. New York, Springer New York. 201p.
Vasava HB, Gupta A, Arora R & Das BS (2019) Assessment of soil texture from spectral reflectance data of bulk soil samples and their dry-sieved aggregate size fractions. Geoderma, 337:914- 926.
Wackernagel H (2014) Geostatistics. In: Wiley StatsRef: Statistics Reference Online. Chichester, John Wiley & Sons. p.01-10.
Willmott CJ, Robeson SM & Matsuura K (2012) A refined index of model performance. International Journal of Climatology, 32:2088-2094.
