Constraining Effective Field Theories with Machine Learning

(1)

Constraining Effective Field

Theories with Machine Learning

ATLAS ML workshop, October 15-17 2018

Gilles Louppe

g.louppe@uliege.be

with Johann Brehmer, Kyle Cranmer and Juan Pavez.

(2)

(3)

The probability of ending in bin corresponds to the total probability of all the paths from start to .

x

z

x

p

(x∣θ) =

∫

p

(x, z∣θ)dz =

(

n

θ

(1 − θ)

x)

x n−x 2 / 24

(4)

(5)

The probability of ending in bin still corresponds to the total probability of all the paths from start to :

But this integral can no longer be simpli ed analytically!

As grows larger, evaluating becomes intractable since the number of

paths grows combinatorially.

Generating observations remains easy: drop balls.

x

z

x

p

(x∣θ) =

∫

p

(x, z∣θ)dz

n

p

(x∣θ)

(6)

The Galton board is a metaphore for the simulator-based scienti c method: the Galton board device is the equivalent of the scienti c simulator

are parameters of interest

are stochastic execution traces through the simulator are observables

Inference in this context requires likelihood-free algorithms.

θ

z

x

(7)

―――

(8)

(9)

Treat the simulator as a black box

Histograms of observables, Approximate Bayesian computation.

Rely on summary statistics.

Machine learning algorithms

Density estimation, Cᴀʀʟ,

autoregressive models, normalizing ows, etc.

Use latent structure

Matrix Element Method, Optimal Observables, Shower

deconstruction, Event Deconstruction.

Neglect or approximate shower + detector, explicitly calculate integral.

*new* Mining gold from the simulator.

Leverage matrix-element information + Machine Learning.

Likelihood-free inference methods

z

(10)

Mining gold from simulators

is usually intractable.

What about ?

p

(x∣θ)

(11)

This can be computed as the ball falls down the board!

p

(x, z∣θ) = p(z

₁

∣θ)p(z

_{2 1}

∣z

, θ) … p(z

T

∣z

<T

, θ)p(x∣z

≤T

, θ)

= p(z

1

∣θ)p(z

2

∣θ) … p(z

T

∣θ)p(x∣z

T

)

= p(x∣z

T

)

θ

(1 − θ)

t

∏

z_t 1−z_t 10 / 24

(12)

As the trajectory and the observable are emitted, it is often possible:

to calculate the joint likelihood ;

to calculate the joint likelihood ratio ;

to calculate the joint score .

We call this process mining gold from your simulator!

z

= z

₁

, ..., z

T

x

p

(x, z∣θ)

r

(x, z∣θ

₀

, θ

₁

)

t

(x, z∣θ

₀

) = ∇

θ

log p(x, z∣θ)

∣

_θ

(13)

Rᴀsᴄᴀʟ

―――

(14)

Constraining Effective Field

Theories, effectively

(15)

LHC processes

―――

(16)

(17)

LHC processes

―――

(18)

(19)

p

(x∣θ) =

p

(z

∣θ)p(z ∣z

)p(z

∣z

)p(x∣z

)dz

dz

intractable

∭

p s p d s d p s d

(20)

Key insights:

The distribution of parton-level momenta

where and are the total and differential cross sections, is tractable.

Downstream processes , and do not depend on .

This implies that both and can be mined. E.g.,

p

(z

_p

∣θ) =

,

σ

(θ)

1 dz

_p

dσ

(θ)

σ

(θ)

_dz p dσ(θ)

p

(z

s

∣z

p

)

p

(z

d

∣z

s

)

p

(x∣z

d

)

θ

⇒

r

(x, z∣θ

₀

, θ

₁

)

t

(x, z∣θ

₀

)

r

(x, z∣θ

₀

, θ

₁

)

=

p

(z

p

∣θ

1

)

p

(z

_p

∣θ

₀

)

p

(z

s

∣z

p

)

p

(z

_s

∣z

_p

)

p

(z

d

∣z

s

)

p

(z

_d

∣z

_s

)

p

(x∣z

d

)

p

(x∣z

_d

)

p

(z

p

∣θ

1

)

p

(z

_p

∣θ

₀

)

(21)

Proof of concept

Higgs production in weak boson fusion

Goal: Constraints on two theory parameters:

L

= L

SM

+

(D ϕ) σ D ϕ W

−

(ϕ ϕ) W

W

Λ

2

f

_W

2 ig

_μ _† _a _ν μνa

_Λ

₂

f

_{W W}

4 g

2 _† μνa μν a ―――

(22)

(23)

Increased data ef ciency

―――

(24)

(25)

Stronger bounds

―――

(26)

Summary

Many LHC analysis (and much of modern science) are based on "likelihood-free" simulations.

New inference algorithms:

Leverage more information from the simulator Combine with the power of machine learning

First application to LHC physics: stronger EFT constraints with less simulations.

(27)

Collaborators

Johann Brehmer, Kyle Cranmer and Juan Pavez

(28)

References

Stoye, M., Brehmer, J., Louppe, G., Pavez, J., & Cranmer, K. (2018). Likelihood-free inference with an improved cross-entropy estimator. arXiv preprint arXiv:1808.00973.

Brehmer, J., Louppe, G., Pavez, J., & Cranmer, K. (2018). Mining gold from implicit models to improve likelihood-free inference. arXiv preprint

arXiv:1805.12244.

Brehmer, J., Cranmer, K., Louppe, G., & Pavez, J. (2018). Constraining Effective Field Theories with Machine Learning. arXiv preprint arXiv:1805.00013.

Brehmer, J., Cranmer, K., Louppe, G., & Pavez, J. (2018). A Guide to Constraining Effective Field Theories with Machine Learning. arXiv preprint

arXiv:1805.00020.

Cranmer, K., Pavez, J., & Louppe, G. (2015). Approximating likelihood ratios with calibrated discriminative classi ers. arXiv preprint arXiv:1506.02169.

(29)

The end.