• Aucun résultat trouvé

Usefulness of sensitivity analysis for approximate bayesian computation

N/A
N/A
Protected

Academic year: 2022

Partager "Usefulness of sensitivity analysis for approximate bayesian computation"

Copied!
16
0
0

Texte intégral

(1)

HAL Id: hal-02805369

https://hal.inrae.fr/hal-02805369

Submitted on 6 Jun 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Usefulness of sensitivity analysis for approximate bayesian computation

Olivier Martin, Claude Bruchou, Loic Pagès

To cite this version:

Olivier Martin, Claude Bruchou, Loic Pagès. Usefulness of sensitivity analysis for approximate bayesian computation. 7. International Conference on Sensitivity Analysis of Model Output, Jul 2013, Nice, France. 2013. �hal-02805369�

(2)

Usefullness of Sensitivity Analysis for Approximate Bayesian Computation

O. Martin?, C. Bruchou?, L. Pagès

? INRA, Avignon, Biosp INRA, Avignon, PSH

Work supported by ANR Sim-Traces and ANR Emile

(3)

Overview

1 Review of ABC concepts

2 The root system model

3 Sensitivity Analysis for statistics

4 Sensitivity Analysis for MSE criterion

5 Conclusion and discussion

(4)

1. ABC concepts

• Approximate Bayesian Computing (ABC) is a free likelihood method to estimate model parameters

• Definition of statistics (or descriptors)

• Fast computing model

Notations:

Observed data D and simulated dataD? θ is the vector of parameters with Prior π(.)

s(.): function that computes a set of statistics (descriptors) S =s(D) vector of statistics for data D

S? =s(D?) vector of statistics for data D?

(5)

1. ABC: a free likelihood method

Algorithm (Accept/Reject)

0: Suppose we have observed dataD andS =s(D) 1: Generateθ? fromπ(.)

2: GenerateD? from f(.|θ?) 3: Compute statisticsS? for D?

4: Acceptθ? ifdW(S,S?)≤and return to (1)

Prior Simulation Joint Ditribution Posterior

π(θ) D??,S?) π(θ|dW(S,S?)≤)

θ θ*

−100 −50 0 50 100 150

−400−300−200−1000

seg$X1

−seg$Z1

S(D)

θ

0.02

0.02

0.04 0.06

0.08

0.1

0.12

0.14

θ

(6)

1. ABC: a free likelihood method

This algorithm gives an approximation ofπ(θ|D).

Two important points for the approximation:

• The threshold:

smaller betterapproximation

D? is summarised by the statistics S?:

betterstatistics → betterapproximation

(7)

2. The root system model

Complexity of plant root system:

Functionning is linked to the dynamics of the architecture. Water and nutriment uptake depend on the root surface..

Plant root system modelling:

Integration of knowledge and test of new hypotheses Summarize data into a low number of key values

The stochastic model:

Number of parameters: 14

Output of the model: image of root system

(8)

3. The root system model

4 parameters over 14 are estimated with images.

15 statisticsare computed: size and shape of the root system, density of pixels in different areas, ...

(9)

3. Sensitivity analysis of statistics

• Can parameters be estimated with the statistics ?

• Anova: 4 factors with 5 levels, interaction of order 3

Gray: Principal Black: Interaction

F1 F2 F3 F4

(%) 020406080100

S2 , R2 = 100

F1 F2 F3 F4

(%) 020406080100

S13 , R2 = 70

• About 8-10 statistics over the 15 seem to be sufficient to estimate parameters

(10)

4. Sensitivity analysis of MSE

• Find the best weights W of dW to minimize MSE criterion ?

• Point estimate: θˆ=Mean{θ?:dW(S,S?)≤}with

dW2 (S,S?) =

NS=15

X

i=1

wi(SiSi?)2 andwi >0,

NS

X

i=1

wi =1.

• Criterion to evaluate point estimateθ:ˆ

MSEθ(W) =

Nθ=4

X

k=1

θ(k)θ(k))2 σ2

θ(k)

(11)

4. Sensitivity analysis of MSE

• Generate uniformly aR-sample of weights Wr,r =1, ...,R with Wr = (w1r, ...,wNr

S) andPNi=1S wir =1

• Generate aN-sample θl,l =1, ...,N fromπ(θ).

• For eachθl,l =1, ...,N

- ComputeMSEθl(Wr), r=1,...,R - Fit a canonical polynomial of degree 2:

MSEθl(W) =Pl(W) +e,l =1, ...,N withPl(W) =

NS

X

i=1

δiiwi2+

NS

X

i=1 NS

X

i<j

δijwiwj

- Sensitivity indices by comparing nested polynomials models.

(12)

4. Sensitivity analysis of MSE

Sensitivity indices Minimum weights

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

051015202530

w1 w2 w3 w4 w5 w6 w7 w8 w9 w11 w13 w15

0.00.10.20.30.4

(13)

4. Sensitivity analysis of MSE

●●

●●●

● ●

0.0 0.5 1.0 1.5 2.0 2.5

0.00.51.01.52.02.5

MSE min (Non Unif Weights)

MSE (Unif Weights) 0.000.100.200.300.000.100.200.300.000.100.200.30

w1 w2 w3 w4 w5 w6 w7 w8 w9 w10 w11 w12 w13 w14 w15

0.000.100.200.30

(14)

5. Conclusion

Conclusion

• Difficult to find an optimal distance (for allθ)

• Interaction between weights associated to statistics

• ABC with three steps:

1 Pilot ABC (→first approximation θ)˜

2 Determine optimal weights associated toθ˜

3 ABC with the optimal weights (→second approximationθ)ˆ

Future work

• Optimal weights determined by global optimum of PW

• Study based on the expectations of the statistics (rather one observation)

(15)

References

Beaumont, M., Zhang, W., Balding, D.J. (2002). Approximate bayesian computation in population genetics. Genetics.

Cornell, J. (2002). Experiments with mixtures. J. Wiley and sons, N. Y., 3rd edition.

Joyce, P., Marjoram, P. (2008). Approximately sufficient statistics and bayesian computation. Stat. Appl. Genet. Mol. Biol.

Pagès, L. (2011). Links between root developmental traits and foraging performance. Plant, Cell and Environment.

(16)

Factor F3 and F4:

F1 F2 F3 F4

(%) 020406080100

S3 , R2 = 100

F1 F2 F3 F4

(%) 020406080100

S4 , R2 = 100

F1 F2 F3 F4

(%) 020406080100

S5 , R2 = 100

F1 F2 F3 F4

(%) 020406080100

S6 , R2 = 100

F1 F2 F3 F4

(%) 020406080100

S14 , R2 = 100

F1 F2 F3 F4

(%) 020406080100

S15 , R2 = 90

Références

Documents relatifs

Using the Fo¨rster formulation of FRET and combining the PM3 calculations of the dipole moments of the aromatic portions of the chromophores, docking process via BiGGER software,

Later in the clinical course, measurement of newly established indicators of erythropoiesis disclosed in 2 patients with persistent anemia inappropriately low

Households’ livelihood and adaptive capacity in peri- urban interfaces : A case study on the city of Khon Kaen, Thailand.. Architecture,

3 Assez logiquement, cette double caractéristique se retrouve également chez la plupart des hommes peuplant la maison d’arrêt étudiée. 111-113) qu’est la surreprésentation

Et si, d’un côté, nous avons chez Carrier des contes plus construits, plus « littéraires », tendant vers la nouvelle (le titre le dit : jolis deuils. L’auteur fait d’une

la RCP n’est pas forcément utilisée comme elle devrait l’être, c'est-à-dire un lieu de coordination et de concertation mais elle peut être utilisée par certains comme un lieu

The change of sound attenuation at the separation line between the two zones, to- gether with the sound attenuation slopes, are equally well predicted by the room-acoustic diffusion

Si certains travaux ont abordé le sujet des dermatoses à l’officine sur un plan théorique, aucun n’a concerné, à notre connaissance, les demandes d’avis