• Aucun résultat trouvé

8.3 Top-quark charge discrimination using neural networks

8.3.2 Observables used in the neural network

The previously mentioned measurements of top-quark charge relied on using theb-jet charge as the proxy. Nevertheless, the hadronically-decayingWboson also carries extra charge information and its charge of±1 is three times larger in magnitude than that of thebquark. Previous measurements have not exploited this option, presumably because the correct matching of light-quark jets to the corresponding hadronically-decayingW-boson is challenging. However, for boosted top-quark decays, the decay

(5)The jet momentum vector is sometimes replaced with a unit vector in the momentum direction. Given the normalisation term in the denominator, both such definitions are equivalent.

8. Prospect of charge asymmetry measurement in boosted all-hadronictt¯events

products are highly-collimated and can be clustered within a single large-Rjet. The identification of tracks from individual quarks contained within the large-Rjet is attempted using anti-kt track jets with R =0.2, ghost-matched with the large-Rjet. Since top quark has three decay products, in an ideal case, it is expected that there should be three track jets within the large-Rjet, one of which isb-tagged.

Therefore the charges of the leading-pTb-tagged track jet and two leading-pT non-b-tagged track jets are considered as inputs for the NN.

The choice of using three track jets, out of which one is b-tagged, is motivated by Fig. 8.1, showing the true origin of the track jets based on the matching with partonic decay products of the top (anti-)quark. Theb-tagged track jet in approximately 90 % of the cases originates from ab-quark, and most of the background jets are mistagged charm jets. The two highest-pTnon-b-tagged track jets are most frequently matched to non-b-quarks from the top (anti-)quark decay, with the sub-leading non-b-jet not being matched to any of the decay products in approximately 22 %. Including more track jets as inputs is discouraged based on Fig.8.1d, which shows that in more than 60 % of cases, the 3rd-leading-pTnon-b-tagged track jet is not matched to any of the decay products of a top (anti-)quark.

¯b ¯c ¯s ¯d ¯u ? u d s c b Leading track jet matched parton 0.05 Subleading track jet matched parton 0.05 3rd-leadingpTtrack jet matched parton 0.1

Fig. 8.1: The classification of the origin ofb-tagged track jet (a), the leading (b), 2nd-leading (c) and 3rd-leading (d) non-b-tagged track jet matched to a candidate large-Rjet. The classification is based on matching a decay product from hadronically decaying top (anti-)quark with the track jet based on

∆R(parton,track jet) <0.2. The middle column in the plots, labelled as "?", marks track jets which were not matched to any of the decay products.

The JVC discriminant shows that exploiting information from displaced vertices of aB-hadron decay can provide additional discrimination power. The charges of secondary and tertiary vertices reconstructed by JetFitter forb-tagged track jets and relevant auxiliary variables are thus also considered for inclusion as NN inputs. The charge of the vertices reconstructed by the SV inclusive vertex-finding

8.3. Top-quark charge discrimination using neural networks

algorithm described in Sec.4.4.5is also considered. The aim is to examine whether there is additional information not extracted by JetFitter observables, that can be exploited for further discrimination of the charge of theB-hadrons initiating theb-tagged track-jets contained within the large-Rjet candidates.

The JVC discriminant itself is not used as an input for the NN, primarily because it was optimised on an inclusive sample oftt¯events. Such a sample typically does not contain sufficient amount of generated evens for outlier regions of phase space, such as the high-pT region examined in the all-hadronic channel. It is argued here, that adding some of the input observables directly into the NN trained in this study is more preferable, than nesting the JVC NN output as an input into another NN.

Table 8.4: List of observables of the track-jets matched to a large-Rjet, which are considered as the NN input observables.

Variable(s) Description

b-tagged track jet variables Qb Charge of theb-tagged track jet

pTb pTof theb-tagged track jet

Ntrkb Track multiplicity of theb-tagged track jet QSV Charge of the SV vertex of theb-tagged track jet NtrkSV Track multiplicity of the SV vertex

LSV3D Distance of the SV vertex from primary vertex

∆LSV3D Uncertainty on the distance

QJFS Charge of the JetFitter secondary vertex

NtrkJFS Track multiplicity of the JetFitter secondary vertex

LJFS3D Distance of the JetFitter secondary vertex from primary vertex

∆LJFS3D Uncertainty on the distance

QJFT Charge of the JetFitter tertiary vertex

NtrkJFT Track multiplicity of the JetFitter tertiary vertex

LJFT3D Distance of the JetFitter tertiary vertex from primary vertex

∆LJFT3D Uncertainty on the distance

Non-b-tagged track jet variables

Q1,Q2 Charge of the leading and sub-leading non-b-tagged track jet pT1,p2T pTof the leading and sub-leading non-b-tagged track jet

Ntrk1 ,Ntrk2 Track multiplicity of the leading and sub-leading non-b-tagged track jet Binary-decision variables

d1 Whether the large-Rjet has at least one non-b-tagged track jet d2 Whether the large-Rjet has at least two non-b-tagged track jets dJFS Whether JetFitter secondary vertex is reconstructed

dJFT Whether JetFitter tertiary vertex is reconstructed dSV Whether SV vertex is reconstructed

The full list of observables investigated for the NN is listed in Table8.4. Since not all of the observables are always defined, the binary-decision variablesd1,d2, dJFS,dJFTanddSV are added, which have a value of zero if the respective object is undefined, or one, if it is defined. The fraction

8. Prospect of charge asymmetry measurement in boosted all-hadronictt¯events

of large-Rjets in the sample, where the variables are defined, is listed in Table8.5. For undefined objects, the respective variables such aspT, track multiplicity or vertex position are set to zero. The corresponding charges are also set to zero, thus giving no discrimination for the class of the considered large-Rjet. This is an alternative, simplified approach, where it is possible to train a single NN with inputs that are not always defined, in contrast to the approach employed for the JVC, where for each combination of defined variables, a separate NN was trained.

For the charge observables, the method of how the charges of the tracks should be weighted, and the value of κmust be chosen. The definitions of jet charge which are considered, are defined in Eq.8.1and8.3. The figure of merit in the optimisation is the separation power of the charge variables.

The κvalue is optimised separately for the individual charge observables. It is found that the two charge definitions give very similar separation for equalκvalues, yielding no preference of one over the other. As such, the definition in Eq.8.3is chosen. The optimised values ofκfrom a scan in the range ofκ = 0.1 up to 1.0 are shown in Table8.6. In all cases a maximum in the scanned interval was found, corresponding to separation power quoted in the table. For illustration, theQbandQ1jet charge variables using the respective optimisedκvalues are shown in Fig.8.2. All of the distributions of the NN input variables are shown in App.D.1.

−1.0 −0.5 0.0 0.5 1.0 Qb 10000

20000 30000 40000 50000 60000 70000 80000 90000

Events Large-R Jet (top)

Large-R Jet (anti-top)

(a)

−1.0 −0.5 0.0 0.5 1.0 Q1 10000

20000 30000 40000 50000 60000 70000 80000

Events Large-R Jet (top)

Large-R Jet (anti-top)

(b)

Fig. 8.2: Distribution of theQb(a) andQ1(b) track jet charges for large-Rjets originating from top quark (red line) and top anti-quark (blue line). The spikes at values±1 correspond to track jets, where all tracks have the same charge. Most frequently, these track jets consist of two tracks, which is the minimum track multiplicity requirement.

Table 8.5: Fraction of large-Rjets for which respective features are defined, denoted by the binary-decision variables listed in the table.

Feature d1 d2 dJFS dJFT dSV [%] of cases defined 95 59 95 54 95

Except for the charge observables in Table8.6, the other variables areauxiliaryin the sense that they do not have discrimination power itself, but may provide additional information to the NN through correlations. Of particular interest are observables which have different correlations for each of the two classes of large-Rjets, because such observables bring additional discrimination power that the NN can exploit. The correlations and anti-correlations of track multiplicities with charge are an example of this property, as shown in Fig.8.4and8.5, motivating why the track multiplicity variables are included

8.3. Top-quark charge discrimination using neural networks

Table 8.6: Optimised values ofκparameter for the charge definition of the individual charge observables.

The separation powerSfor the optimal value ofκis also shown, based on the definition in Sec.7.3.1.

Variable Qb Q1 Q2 QSV QJFS QJFT

κ 0.6 0.4 0.4 0.4 0.5 0.5

S[%] 8.8 15.2 9.2 9.6 5.1 1.4

as the NN inputs. The distances of vertices from primary vertex and corresponding fitted distance errors are tested as NN input observables since they could be expected to provide some metric of the quality of the reconstructed vertex. Similarly the pT of the individual jets are tested as NN inputs under the assumption that the kinematic information may be relevant since the tracking performance is expected to deteriorate with highpT. Whether these assumptions transfer into a better discrimination of the trained NN is not clear, therefore a test needs to be designed to assess the gain of inclusion of these observables in the NN. The approach used in this study is to train multiple NN setups with different choices of input observables, and compare the performance of these networks. First, the NN architecture and training is discussed in the following section, and subsequently the NN setups are compared in Sec.8.3.4.