• Aucun résultat trouvé

Predicting soil depth using survival analysis models

N/A
N/A
Protected

Academic year: 2021

Partager "Predicting soil depth using survival analysis models"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-02794445

https://hal.inrae.fr/hal-02794445

Submitted on 5 Jun 2020

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Predicting soil depth using survival analysis models

Quentin Styc, Philippe Lagacherie

To cite this version:

Quentin Styc, Philippe Lagacherie. Predicting soil depth using survival analysis models. 7. Global Workshop on Digital Soil Mapping, Jun 2016, Aarhus, Denmark. �hal-02794445�

(2)

Updated Book of Abstract – online version 27 June 2016

Predicting soil depth using survival analysis models

Quentin Styc

1

and Philippe Lagacherie

2

1

INRA, UMR LISAH, Montpellier, France

Soil depth is a very important soil property for agronomists and environmental scientists and has been included as such in the GlobalSoilMap specifications (Arrouays et al, 2014). The Digital Soil Mapping applications that deal with soil depth are all confronted with the problem of the “right-censored” input soil data. This means that a significant proportion of the observed sites that serve as input data for Digital Soil Mapping models are characterized by a maximal depth of observation that is smaller than the real soil depth. The information on soil depth at such sites is not an exact value but an inequality (soil depth ≽ site observation depth).

Such situations are frequently encountered in survival analysis that aim to predict the expected duration of time until one or more events happen, such as death in biological organisms and failure in mechanical systems. This very active branch of statistics has produced a lot of statistical inference models for dealing with right-censored data. By doing an analogy between time at “depth” or “failure” and soil depth (“depth at soil end”), such models could be applied to the digital mapping of soil depth.

In this paper, we apply two survival analysis models, namely Cox Regression (Andersen and Gill, 1982) and Random Survival Forest (Ishwaran et al, 2008) for predicting soil depths in two study areas of southern France with contrasted extents and soil observation densities (Languedoc Roussillon Region,and Peyne Watershed). The soil depth prediction results provided by these two models are evaluated over independent validation sets and compared with classical DSM models using as input i) the sites with a known exact soil depth values only or ii) the whole set of sites with the replacement of the censored soil depth by a « common sense » estimation.

Andersen, P. and Gill, R. (1982). Cox's regression model for counting processes, a large sample study. Annals of Statistics 10, 1100-1120.

Arrouays, D., Grundy, M.G., Hartemink, A.E., Hempel, J.W., Heuvelink, G.B.M., Hong, S.Y., Lagacherie, P., Lelyk, G., McBratney, A.B., McKenzie, N.J., Mendonca Santos, M.L., Minasny, B., Montanarella, L., Odeh, I.O.A., Sanchez, P.A., Thompson, J.A., Zhang, G.-L., 2014. Chapter Three – GlobalSoilMap: Toward a Fine-Resolution Global Grid of Soil Properties, in: Advances in Agronomy. pp. 93–134.

Ishwaran, H., Kogalur, U.B., Blackstone, E.H., Lauer, M.S., 2008. Random survival forests. Ann. Appl. Stat. 2, 841–860

Références

Documents relatifs

To explain variation in decision-making speed I first focus on three factors from formal theory that affect the core, the win-set and the proportion of winning coalitions:

However, one can still use algebraic integration of the random effects in the case of a gamma frailty component in a Weibull

[ 17 ] used a two-step approach that combined survival analysis and a DGE–IGE linear mixed animal model for survival time because it was not possible to estimate the vari- ance

Here we show that differences in drought tolerance are associated with differences in ge- ographic distribution between the two species, but that the underlying differences

Two types of models were developed: a stand-level model to predict the degree of damage caused by a forest fire, and tree-level models to predict the probability of a tree to survive

As we said at the beginning of this section, different types of data can be considered, that we have denoted by A1 , A2 , B1 , B2 and B3. Note that the estimators of Π 0 and Π 1 in

bovis gene copy numbers over the time by the IS1561’- based qPCR assay on sterile soil A (solid grey bars), sterile soil B (gray stripe bars) and biotic soil A (solid black

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des