• Aucun résultat trouvé

Network analysis in response to calorie restriction

N/A
N/A
Protected

Academic year: 2022

Partager "Network analysis in response to calorie restriction"

Copied!
35
0
0

Texte intégral

(1)

Network analysis in response to calorie restriction

Émilie Montastier, Nathalie Villa-Vialaneix et de nombreux co-auteurs !

INSERM, Obesity Research Laboratory IM2C

& INRA, Unité MIA-T

Journée Régionale GenoToul BioInfo & BioStats

(2)

Maintenance of weight loss: an obstacle in successful treatment of obese individuals

treatment of obese individuals

Weight Weight

- +

Weight

gain Weight

loss Energy

expenditure

Intake

Overweight /obesity Medical treatment of obesity: 80%

Overweight /obesity Medical treatment of obesity: 80% 

failled after one year  (Wing RR, Am J Clin Nutr,  2005) : it doesn’t work!

Surplus energy storage in adipose tissue

2005) : it doesn t work!

p

(3)

Weight follow‐up after energy restriction induced weight loss

Effect of glycemic index and protein content EU project 8 centres 450 families

Follow-up EU project, 8 centres, 450 families

Restriction 800 kcal/d

Modifast®

Follow up

Ad libitum, 5 dietary branches:

Low/high GI Low/high protein

Control

CID2

Randomization

CID3

Control

CID1 8 weeks 6 months

(>- 8% weight loss)

CID: « Clinical Intervention Day » CID: « Clinical Intervention Day »

•Anthropometry

•Blood and urine sampling

•Adipose tissue biopsies

(4)

3 types of data in 135 women

Ad libitum

L /hi h GI

LCD

CID2 CID3

Low/high GI Low/high protein

Control

CID1

8 weeks 6 months

Follow-up

Bio‐clinical data:

Body composition, RMR

CID2

(>- 8% weight loss) CID1 CID3

MetS, insulin sensitivity Plasma measurements

Adipose tissue fatty acid composition

221 genes: 

Metabolism:

Glycolysis Citrate cycle Citrate cycle

Lipogenesis, fatty acid transport

Immunity: chemokines, receptors,

complement

(5)

Standard issues in network analysis

Inference

Giving expression data, how to build a graph whose edges represent thedirectlinks between genes?

Example: co-expression networks built from microarray/RNAseq data (nodes = genes; edges = significant “direct links” between expressions of two genes)

(6)

Standard issues in network analysis

Inference

Giving expression data, how to build a graph whose edges represent thedirectlinks between genes?

Graph mining (examples)

1 Network visualization: nodesare nota priori given a position.

Random positions Positions aiming at representing connected nodes closer

É. Montastier & NV2|network analysis in response to calorie restriction 2/10

(7)

Standard issues in network analysis

Inference

Giving expression data, how to build a graph whose edges represent thedirectlinks between genes?

Graph mining (examples)

1 Network visualization: nodesare nota priori given a position.

2 Network clustering: identify “communities”

(8)

Data

Experimental protocol

135 obese women and 3 times: before LCD, after a 2-month LCD and 6 months later (between the end of LCD and the last

measurement, women are randomized into one of 5 recommended diet groups).

At every time step,221 gene expressions,28 fatty acidsand15 clinical variables(i.e., weight, HDL, ...)

É. Montastier & NV2|network analysis in response to calorie restriction 3/10

(9)

Data

Experimental protocol

135 obese women and 3 times: before LCD, after a 2-month LCD and 6 months later (between the end of LCD and the last

measurement, women are randomized into one of 5 recommended diet groups).

At every time step,221 gene expressions,28 fatty acidsand15 clinical variables(i.e., weight, HDL, ...)

Correlations between gene expressions and between a gene expression and a fatty acid levels are not of the same order:

inference method must be differentinsidethe groups andbetween two groups.

(10)

Data

Data pre-processing

At CID3, individuals aresplit into three groups: weight loss, weight regain and stable weight (groups are not correlated to the diet group according toχ2-test).

É. Montastier & NV2|network analysis in response to calorie restriction 3/10

(11)

Method for CID1, CID2 and 3 × CID3

Network inference Clustering Mining

3 inter-dataset networks rCCA

merge into one network

3 intra-dataset networks sparse partial correlation

5 networks

CID1 CID2

3×CID3

Extract important nodes Study/Compare clusters

(12)

Inference

Intra-level networks: use of partial correlations and a sparse approach (graphical Lasso as in the R packagegLasso) to select edges [Friedman et al., 2008]

É. Montastier & NV2|network analysis in response to calorie restriction 5/10

(13)

Inference

Intra-level networks: use of partial correlations and a sparse approach (graphical Lasso as in the R packagegLasso) to select edges [Friedman et al., 2008]

Inter-levels networks: use of regularized CCA (as in the R package mixOmics) to evaluate strength of the correlations

[Lê Cao et al., 2009]

(14)

Inference

Intra-level networks: use of partial correlations and a sparse approach (graphical Lasso as in the R packagegLasso) to select edges [Friedman et al., 2008]

Inter-levels networks: use of regularized CCA (as in the R package mixOmics) to evaluate strength of the correlations

[Lê Cao et al., 2009]

Combination of the 6 informations: tune the number of edges intra or inter-levels so that it is of the order of the number of nodes in the corresponding level(s)

É. Montastier & NV2|network analysis in response to calorie restriction 5/10

(15)

Visualization

Purpose: How to display the nodes in ameaningfulandaesthetic way?

(16)

Visualization

Purpose: How to display the nodes in ameaningfulandaesthetic way?

Standard approach:force directed placementalgorithms (FDP) (e.g.,[Fruchterman and Reingold, 1991])

É. Montastier & NV2|network analysis in response to calorie restriction 6/10

(17)

Visualization

Purpose: How to display the nodes in ameaningfulandaesthetic way?

Standard approach:force directed placementalgorithms (FDP) (e.g.,[Fruchterman and Reingold, 1991])

attractive forces: similar to springs along the edges

(18)

Visualization

Purpose: How to display the nodes in ameaningfulandaesthetic way?

Standard approach:force directed placementalgorithms (FDP) (e.g.,[Fruchterman and Reingold, 1991])

attractive forces: similar to springs along the edges

repulsive forces: similar to electric forces between all pairs of vertices

É. Montastier & NV2|network analysis in response to calorie restriction 6/10

(19)

Visualization

Purpose: How to display the nodes in ameaningfulandaesthetic way?

Standard approach:force directed placementalgorithms (FDP) (e.g.,[Fruchterman and Reingold, 1991])

(20)

Important node extraction

1 vertex degree: number of edges adjacent to a given vertex.

Vertices with a high degree are calledhubs: measure of the vertex popularity.

É. Montastier & NV2|network analysis in response to calorie restriction 7/10

(21)

Important node extraction

1 vertex degree: number of edges adjacent to a given vertex.

Vertices with a high degree are calledhubs: measure of the vertex popularity.

2 vertex betweenness: number of shortest paths between all pairs of vertices that pass through the vertex. Betweenness is a centrality measure(vertices with a large betweenness that are the most likely to disconnect the network if removed).

(22)

Fatty acids are highest centrality hubs

Baseline (CID1) Baseline (CID1)

After restriction (CID2)

(23)

End of intervention  (CID 3)

(CID 3)

“Weight regain” group “Weight loss” group

“Weight maintain” group  g g p

(24)

Vertex clustering

Cluster vertexes into groups that aredensely connectedand share a few links(comparatively)with the other groups. Clusters are often calledcommunities(social sciences) ormodules(biology).

Node modules are known to be more robust and meaningful than individual relationships between pairs of nodes.

É. Montastier & NV2|network analysis in response to calorie restriction 8/10

(25)

Vertex clustering

Cluster vertexes into groups that aredensely connectedand share a few links(comparatively)with the other groups. Clusters are often calledcommunities(social sciences) ormodules(biology).

Node modules are known to be more robust and meaningful than individual relationships between pairs of nodes.

Nodes were clustered usingmodularity maximization [Newman and Girvan, 2004] performed with a deterministic annealing algorithm as described in

[Reichardt and Bornholdt, 2006] (after comparison of several approaches) and implemented in the function

spinglass.communityof the R packageigraph.

(26)

Brief overview on results

5 networks inferred with 264 nodes each:

CID1 CID2 CID3g1 CID3g2 CID3g3

size LCC 244 251 240 259 258

density 2.3% 2.3% 2.3% 2.3% 2.3%

transitivity 17.2% 11.9% 21.6% 10.6% 10.4%

nb clusters 14 (2-52) 10 (4-52) 11 (2-46) 12 (2-51) 12 (3-54)

clusters werevisualizedand analyzed forimportant node extraction

É. Montastier & NV2|network analysis in response to calorie restriction 9/10

(27)

Spin glass vertexes classification

14 clusters, 3 of them with at least 2 types of variables 

At Baseline:

Adhesion and diapedesis

Fatty acids Immune response

y

biosynthesis

(28)

Waist circumference is correlated with metabolic  syndrome transcripts independently of weight change syndrome transcripts independently of weight change

After restriction (CID 2)

At baseline (CID1)

(29)

Waist circumference is correlated with metabolic  syndrome transcripts independently of weight change syndrome transcripts independently of weight change

« Weight loss » group (CID 3)

« Weight loss » group (CID 3)

At baseline (CID1)

(30)

Waist circumference is correlated with metabolic  syndrome transcripts independently of weight change syndrome transcripts independently of weight change

« Weight regain» group (CID 3) 

At baseline (CID1)

(31)

Increase in growth factors, angiogenesis and  proliferation signaling in women regaining weight

End of intervention:

proliferation signaling in women regaining weight 

Angiogenesis inhibition by TSP1 Cancer signal

Growth Hormone signaling

(32)

Positive relationship between AT myristoleic acid content  and de novo lipogenesis mRNAs in women losing weight and de novo lipogenesis mRNAs in women losing weight

After restriction  End of intervention, “weight loss” group (CID 2)

Fatty acids biosynthesis

Fatty acids biosynthesis

(33)

In conclusion:

• For the first time:

• Integrated approach of 2 omics (from the same biopsy)

• Integrated approach of 2 omics (from the same biopsy)

• In adipose tissue 

• Of a large number of patients

• In a longitudinal dietary intervention

• In a longitudinal dietary intervention

• Well characterized individuals 

• Myristoleic acid as a main lipidic biomarkers for de novo 

lipogenesis: unexpected, and quantitatively minor fatty acid in  adipose tissue and plasma

adipose tissue and plasma

• This original approach authorizes new advances in obesity and  g pp y insulin sensitivity patho‐physiology understanding

Biostatistics post‐doctoral position open!

(34)

Thank you for your attention...

... questions?

É. Montastier & NV2|network analysis in response to calorie restriction 10/10

(35)

Friedman, J., Hastie, T., and Tibshirani, R. (2008).

Sparse inverse covariance estimation with the graphical lasso.

Biostatistics, 9(3):432–441.

Fruchterman, T. and Reingold, B. (1991).

Graph drawing by force-directed placement.

Software, Practice and Experience, 21:1129–1164.

Lê Cao, K., González, I., and Déjean, S. (2009).

*****Omics: an R package to unravel relationships between two omics data sets.

Bioinformatics, 25(21):2855–2856.

Newman, M. and Girvan, M. (2004).

Finding and evaluating community structure in networks.

Physical Review, E, 69:026113.

Reichardt, J. and Bornholdt, S. (2006).

Statistical mechanics of community detection.

Physical Review, E, 74(016110).

Références

Documents relatifs

exposure period trends were observed and having high interleukin-6 twice over the 5-year exposure was associated with reduced odds of absence of diabetes, good respiratory functioning

Modulation of HIV-1 replication in primary dendritic cells in contact with autologous CD4 T-lymphocytes Bin Su, Marina Elizabeth Biedma, Alexandre Lederle, Mélanie

The last stage of biofilm life cycle corresponds to the dispersal phase (Figure 1d), where alterations in the mucus environment (oxygen, nutrients availability, pH) trigger

The R package FANet: sparse factor analysis model for high dimensional gene co-expression networks.. Anne Blum, Magalie Houee, Sandrine Lagarrigue,

Left: Ratios of number of function evaluations performed in single runs to reach a precision 10 −10 close to the global optimal f value of the proposed approach over CMA-ES depending

The R package Animal Network Toolkit (ANT) was developed with the aim of implementing in one package the many different social network analysis

An important remark is that achieving a logit equilibrium implies that at a given game stage, radio devices are able to build logit best responses in order to face the other

Static code analyzers are tools that can be used very early in the coding phase, since they are designed to analyze statically the source code, i.e.: not running..