• Aucun résultat trouvé

A short introduction to Neural Likelihood-free Inference for Physics

N/A
N/A
Protected

Academic year: 2021

Partager "A short introduction to Neural Likelihood-free Inference for Physics"

Copied!
28
0
0

Texte intégral

(1)

A short introduction to

Neural Likelihood-free Inference

for Physics

AMLD 2020

January 28, Lausanne, Switzerland

Gilles Louppe

g.louppe@uliege.be

(2)

A typical science experiment

(3)

―――

(4)
(5)
(6)
(7)

SM with parameters

Simulated observables Real observations

Particle physics

θ

x

x

obs

(8)

p

(x∣θ) =

p

(z ∣θ)p(z ∣z )p(z ∣z )p(x∣z )dz dz dz

intractable!!

p s p d s d p s d

(9)

Ingredients

Statistical inference requires the computation of key ingredients, such as

the likelihood ,

the likelihood ratio ,

or the posterior .

In the simulator-based scenario, each of these ingredients can be approximated

with modern machine learning techniques, even if none are tractable during

training!

p

(x∣θ)

r

(x∣θ , θ ) =

0 1 p(x∣θ ) 1 p(x∣θ )0

p

(θ∣x)

8 / 23

(10)

C

ʀʟ

Supervised learning provides a way to automatically learn :

Let us consider a neural network classi er tasked to distinguish

labelled from labelled .

Train by minimizing the cross-entropy loss.

p

(x∣θ )/p(x∣θ )

0 1

s

^

x

i

∼ p(x∣θ )

0

y

i

= 0

x

i

∼ p(x∣θ )

1

y

i

= 1

s

^

―――

(11)

The solution found after training approximates the optimal classi er Therefore,

s

^

(x) ≈ s (x) =

.

s

^

p

(x∣θ ) + p(x∣θ )

0 1

p

(x∣θ )

1

r

(x∣θ , θ ) ≈ (x∣θ , θ ) =

0 1

r

^

0 1

.

(x)

s

^

1 − (x)

s

^

―――

(12)

―――

(13)

Supervised classi cation is equivalent to likelihood ratio estimation, therefore the whole Deep Learning toolbox can be used for inference!

―――

(14)

There is more...

―――

(15)

Bayesian inference = computing the posterior

x

θ

z

Bayesian inference

Doubly intractable in the likelihood-free scenario:

Cannot evaluate the likelihood .

Cannot evaluate the evidence .

p

(θ∣x) =

.

p

(x)

p

(x∣θ)p(θ)

p

(x∣θ) = p(x, z∣θ)dz

p

(x) = p(x∣θ)p(θ)dθ

14 / 23

(16)

Approximate Bayesian Computation (ABC)

Issues

How to choose ? ? ?

No tractable posterior.

Need to run new simulations for new data or new prior.

x

ϵ

∣∣ ⋅ ∣∣

―――

(17)

Amortizing Bayes

The Bayes rule can be rewritten as

where is the likelihood-to-evidence ratio.

The likelihood-to-evidence ratio can be learned with a neural network tasked to

distinguish from .

This enables direct and amortized posterior evaluation.

p

(θ∣x) =

= r(x∣θ)p(θ) ≈ (x∣θ)p(θ),

p

(x)

p

(x∣θ)p(θ)

r

^

r

(x∣θ) =

pp(x∣θ)(x)

x

∼ p(x∣θ)

x

∼ p(x)

―――

(18)

Showtime

(19)

① Hunting new physics at particle

colliders

The goal is to constrain two EFT parameters and compare against traditional histogram analysis.

―――

(20)

The number of dark matter subhalos and their mass and location lead to complex latent space of each image. The goal is the inference of population parameters

and .

② Dark matter substructure from gravitational lensing

β

f

sub

―――

(21)

―――

(22)

③ Constraining the WDM particle mass

Dark matter subhalos cause disturbances in the density of stellar streams.

Therefore, observations of stellar streams may be used to constrain the mass of the dark matter particle.

(23)
(24)

④ Fast parameter estimation for gravitational waves

(25)

Summary

Much of modern science is based on "likelihood-free" simulations.

The likelihood-ratio is central to many statistical inference procedures, regardless of your religion.

Supervised learning enables likelihood-ratio estimation.

Better likelihood-ratio estimates can be achieved by mining simulators.

(26)

Kyle Cranmer

Juan Pavez Johann

Brehmer HermansJoeri

Antoine Wehenkel

Arnaud

Delaunoy Siddarth Mishra-Sharma

Collaborators

(27)

References

Cranmer, K., Brehmer, J., & Louppe, G. (2019). The frontier of simulation-based inference. arXiv preprint arXiv:1911.01429.

Brehmer, J., Mishra-Sharma, S., Hermans, J., Louppe, G., Cranmer, K. (2019). Mining for Dark Matter Substructure: Inferring subhalo population properties from strong lenses with machine learning. arXiv preprint arXiv 1909.02005. Hermans, J., Begy, V., & Louppe, G. (2019). Likelihood-free MCMC with Approximate Likelihood Ratios. arXiv preprint arXiv:1903.04057.

Stoye, M., Brehmer, J., Louppe, G., Pavez, J., & Cranmer, K. (2018). Likelihood-free inference with an improved cross-entropy estimator. arXiv preprint arXiv:1808.00973.

Brehmer, J., Louppe, G., Pavez, J., & Cranmer, K. (2018). Mining gold from implicit models to improve likelihood-free inference. arXiv preprint arXiv:1805.12244.

Brehmer, J., Cranmer, K., Louppe, G., & Pavez, J. (2018). Constraining Effective Field Theories with Machine Learning. arXiv preprint arXiv:1805.00013.

Brehmer, J., Cranmer, K., Louppe, G., & Pavez, J. (2018). A Guide to Constraining Effective Field Theories with Machine Learning. arXiv preprint arXiv:1805.00020.

Cranmer, K., Pavez, J., & Louppe, G. (2015). Approximating likelihood ratios with calibrated discriminative classi ers. arXiv preprint arXiv:1506.02169.

(28)

The end.

Références

Documents relatifs

Hence, adjusting the profile score by subtracting its bias gives a fixed T unbiased estimating equation and, upon integration, an adjusted profile likelihood in the sense of Pace

‘Model choice using ap- proximate Bayesian computation and random forests: analyses based on model grouping to make inferences about the genetic history of Pygmy human popula-

The aim of the present analyses was (1) to assess dif- ferences in children’s HRQOL (KINDL-R questions) according to socio-demographic factors in a rural partly farming population

Next we prove some immediate numerical range analogues of the Perron–Frobenius theory for positive operators, which generalises results in [5,8] for the matrix case.. In the following

Comparison of 3 and 6 months of oral anticoagulant therapy after a first episode of proximal deep vein thrombosis or pulmonary embolism and comparison of 6 and 12 weeks of

In this approach, we use all existing forward simulations to learn a conditional density distribution of the data 1 d given the parameters θ, using a density estimation

On the other hand, restricted maximum likelihood estimates (of variance components) correspond to the mode of the posterior distribution of the variance components after integration

A la difference des lunules grecques, ces amulettes sont parfois ornees d'un phallus, comme le pendentif en bronze decouvert a Arras dans la tombe d'un enfant d'une