• Aucun résultat trouvé

Bayesian hierarchical reconstruction of protein profiles including a digestion model

N/A
N/A
Protected

Academic year: 2021

Partager "Bayesian hierarchical reconstruction of protein profiles including a digestion model"

Copied!
3
0
0

Texte intégral

(1)

HAL Id: hal-00672814

https://hal.archives-ouvertes.fr/hal-00672814

Submitted on 22 Feb 2012

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Bayesian hierarchical reconstruction of protein profiles

including a digestion model

Pierre Grangeat, Pascal Szacherski, Laurent Gerfault, Jean-François

Giovannelli

To cite this version:

Pierre Grangeat, Pascal Szacherski, Laurent Gerfault, Jean-François Giovannelli. Bayesian hierarchi-cal reconstruction of protein profiles including a digestion model. 59th American Society for Mass Spectrometry Conference, Apr 2011, Dallas, France. TOA-350, page 34. �hal-00672814�

(2)

Bayesian hierarchical reconstruction of protein profiles including

a digestion model

Pierre GRANGEAT1, Pascal SZACHERSKI12, Laurent GERFAULT1, Jean-François GIOVANNELLI2

1

CEA, LETI, MINATEC Campus, DTBS, 17 rue des Martyrs, F-38054 Grenoble cedex 9, France.

2

Université de Bordeaux 1 – CNRS - IPB, IMS, 351 Cours de la Libération, F-33405, Talence cedex, France.

Introduction :

Mass spectrometry approaches are very attractive to detect protein panels in a sensitive and high speed way. MS can be coupled to many proteomic separation techniques. However, controlling technological variability on these analytical chains is a critical point. Adequate information processing is mandatory for data analysis to take into account the complexity of the analysed mixture, to improve the measurement reliability and to make the technology user friendly. Therefore we develop a hierarchical parametric probabilistic model of the LC-MS analytical chain including the technological variability. We introduce a Bayesian reconstruction methodology to recover the protein biomarkers content in a robust way. We will focus on the digestion step since it brings a major contribution to technological variability.

Method :

In this communication, we introduce a hierarchical model of the LC-MS analytical chain. Such a chain is a cascade of molecular events depicted by a graph structure, each node being associated to a molecular state such as protein, peptide and ion and each branch to a molecular processing such as digestion, ionisation and LC-MS separation. This molecular graph defines a hierarchical mixture model. We extend the Bayesian statistical framework we have introduced previously [1] to this hierarchical description. As an example, we will consider the digestion step. We describe the digestion process on a pair of peptides within the targeted protein as a Bernoulli random process associated with a cleavage probability controlled by the digestion kinetic law.

Preliminary data :

In order to produce simulated data, we consider the experimental framework described in [2] applied to human NSE combining immunoenrichment, trypsin digestion and LC-MS analysis on an Orbitrap. PSAQ quantification is performed spiking an isotopically-labeled version of NSE [3]. We define 5 amounts of unlabeled NSE from 0 to 400 ng/ml within 1 ml serum samples. We analyze 2 peptides assuming artificially that there exists a miscleavage delivering a molecule combining both peptides.

Technological variability is introduced by considering variation of 10% on LC retention time and 30% on MS ionisation gain. Variability on digestion is induced in particular by temperature and pH [4]. We model the digestion coefficient linking protein and peptides content as a constant yield weighted by the Bernoulli parameter p for the cleaved peptides and

(3)

parameter 1-p for the miscleaved molecule. We make this parameter varying between 0.4 and 1, but we assume its value to be known.

Two data processing strategies are studied. Firstly, the digestion variability is ignored and the protein content is estimated from the 2 cleaved peptides. Secondly, the miscleaved molecule is introduced and the digestion variability is monitored. Parameters are estimated within the Bayesian hierarchical reconstruction. 321 spectrograms are simulated in order to compare the performances of the two strategies. Coefficient of variation of estimated concentrations is computed for first (CV1) and second strategy (CV2).

The ratio CV2/CV1 is respectively 0.8, 0.7, 0.7, 0.2, 0.05 for concentrations of 0, 50, 100, 200 and 400 ng/ml, demonstrating a significant improvement for the largest concentrations.

[1] Strubel et al. (2007), 29th IEEE EMBS Conference, 5979-5982. [2] Grangeat et al. (2010), 58th ASMS Conference on Mass Spectrometry. [3] Brun et al. (2007), Mol. and Cell. Proteomics 6.12, 2139-2149.

[4] Finehout et al. (2005), Proteomics, 5, 2319-2321.

Novel aspect :

Bayesian hierarchical inversion, technological variability, digestion model, robust protein quantification.

Références

Documents relatifs

Il n'est pas rare que les éphémères confédérations tribales, sur ces territoires de nomadisme pastoral et forestier, comprenant des éléments appartenant aux grandes familles

Pour tracer la Sélection avec les formes géométriques, cliquez sur votre image et déplacez la souris en biais en tenant le clic gauche enfoncé puis relachez quand les dimensions

In the following, we consider the so-called generalized inverse Gaussian (GIG) distribution, denoted by GIG(ρ, φ, ω) with parameters ρ, φ and ω defined in [35] as hierarchical prior

101 - 54602 Villers lès Nancy Cedex France Unité de recherche INRIA Rennes : IRISA, Campus universitaire de Beaulieu - 35042 Rennes Cedex France Unité de recherche INRIA Rhône-Alpes

Subsequently, the full DNA sequence of the pgp-9 gene in Cylicocyclus elongatus was published and in this study, IVM was shown to inhibit pgp-9 mediated protection of yeast

The partial ecological response of the nine most important stressor descriptors revealed various patterns, including nonlinear relationships and thresholds shifts, whereas the

Dans le calcul des parois des fosses à purin et à lisier, on ne devrait pas compter sur les pressions horizontales du sol pour s’opposer aux pressions exercées par le contenu

Computed tomography reconstruction based on a hierarchical model and variational Bayesian method?. Li Wang, Ali Mohammad-Djafari, Nicolas Gac,