• Aucun résultat trouvé

TRANS-C³ - The Transcriptome of downy oak (Quercus pubescens Willd.) in Response to Climate Change

N/A
N/A
Protected

Academic year: 2021

Partager "TRANS-C³ - The Transcriptome of downy oak (Quercus pubescens Willd.) in Response to Climate Change"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-01857993

https://hal.archives-ouvertes.fr/hal-01857993

Submitted on 17 Aug 2018

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

TRANS-C

3

- The Transcriptome of downy oak (Quercus

pubescens Willd.) in Response to Climate Change

Xi Liu, Béatrice Loriod, Stefano Caffarri, Erwan Corre, Jean-Philippe Mevy

To cite this version:

Xi Liu, Béatrice Loriod, Stefano Caffarri, Erwan Corre, Jean-Philippe Mevy. TRANS-C3 - The

Tran-scriptome of downy oak (Quercus pubescens Willd.) in Response to Climate Change. Journées Ou-vertes Biologie, Informatique et Mathématiques (JOBIM), Jul 2018, Marseille, France. �hal-01857993�

(2)

[1] Bolger, A. M. et al. Trimmomatic: A flexible trimmer for Illumina Sequence Data. Bioinformatics, btu170 (2014).

[2] Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotech advance online publication, (2011). [3] https://www.qiagenbioinformatics.com/products/clc-assembly-cell/

[4] Liu J, Li G, Chang Z, et al. BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data. Lengauer T, ed. PLoS Computational Biology. 2016;12(2):e1004772. doi:10.1371/journal.pcbi.1004772. [5] Sirén J, Välimäki N, Mäkinen V (2014) Indexing graphs for path queries with applications in genome research. IEEE/ACM Transactions on Computational Biology and Bioinformatics 11: 375–388. doi:

10.1109/tcbb.2013.2297101

[6] Li H.*, Handsaker B.*, Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. and 1000 Genome Project Data Processing Subgroup (2009) The Sequence alignment/map (SAM) format and SAMtools.

Bioinformatics, 25, 2078-9. [PMID: 19505943]

[7] Nicolas L Bray, Harold Pimentel, Páll Melsted and Lior Pachter, Near-optimal probabilistic RNA-seq quantification, Nature Biotechnology 34, 525–527 (2016), doi:10.1038/nbt.3519

[8] Felipe A. Simão, Robert M. Waterhouse, Panagiotis Ioannidis, Evgenia V. Kriventseva, Evgeny M. Zdobnov; BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs, Bioinformatics, Volume 31, Issue 19, 1 October 2015, Pages 3210–3212, https://doi.org/10.1093/bioinformatics/btv351

[9] Love MI, Huber W, Anders S (2014). “Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.” Genome Biology, 15, 550. doi: 10.1186/s13059-014-0550-8.

[10] Robinson MD, McCarthy DJ, Smyth GK (2010). “edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.” Bioinformatics, 26(1), 139-140. [11] Swinton J (2009). Venn diagrams in R with the Vennerable package

Xi LIU

1

, Béatrice LORIOD

2

, Stefano CAFFARRI

3

, Erwan CORRE

1

, Jean-Philippe MEVY

4

1

CNRS, Sorbonne Université, FR2424, ABiMS platform, Station Biologique, 29680, Roscoff, France - xi.liu@sb-roscoff.fr, erwan.corre@sb-roscoff.fr

2

Transcriptomique et Génomique Marseille-Luminy (TGML). Campus Universitaire de Luminy 13009 Marseille. TAGC U1090 Inserm, Aix Marseille Universite

- beatrice.loriod@inserm.fr

3

CEA-CNRS, UMR 6191 Biologie cellulaire et moléculaire et des plantes et bactéries, Institut de Biologie Environnementale et Biotechnologie (IBEB), Laboratoire de génétique et biophysique des plantes (LGBP), Faculté des Sciences de Luminy, Case 901, 163 av de Luminy13288

Marseille cedex 9 - stefano.caffarri@univ-amu.fr

4

IMBE - UMR CNRS 7263 / IRD 237 Equipe Diversité et Fonctionnement : des Molécules aux Ecosystèmes(DFME), Aix-Marseille Université, Centre Saint-Charles - Case 4 - Jean-Philippe.mevy@imbe.fr

JOURNÉES OUVERTES DE BIOLOGIE INFORMATIQUE & MATHÉMATIQUES 2018 MARSEILLE

TRANS-C

3

- The Transcriptome of downy oak

(Quercus pubescens Willd.) in Response to Climate Change

The TRANS-C3 project fits into the context of global change and aimed to understand the response of forest trees to the aridification of the Mediterranean climate. For this purpose, researches have

been conducted using downy oak model at the O3HP (Oak Observatory at the OHP) site, where the drought prediction is simulated by a reduction in rainfall amount to about 30%. Precisely the

objective was to understand in situ plant response to climate change both in terms of gene expression, metabolic footprint and the impact on an essential physiological process: photosynthesis.

Keywords: RNA-seq, Genome-guided de novo transcriptome assembly, Gene expression, Oak, Global change, Cross-cutting approach, Photosynthesis, Metabolomics

1. Background

2. Samples

Tissue

Leaves from Quercus pubescens Willd.

40 Samples and 4 conditions

2 seasons (summer – spring) x 2 treatments (water stress – control) x 10 trees

Sequencing

Illumina Next-Seq 500: 150 bp paired-end x 40 libraries (≈50M reads/sample)

Assembly

Nb Seq

Nb Residue

Average

N50

Trinity

124070

53333443

429.87

465

CLC de novo

[3]

458607

203955898

444.73

452

BinPacker

[4]

396590

478221122

1205.83

2102

Trinity genome-guided

530080

315371132

594.95

780

3. Assembly

Preprocessing

Trimmomatic

[1]

: adapter removing, quality trimming and filtering

% of paired reads surviving vary from 89% to 93%, with no notable difference between the

samples: 1.7Md pair reads pass the cleaning

In silico reads normalization

Trinity

[2]

: normalization by set then global normalization

• %selection by read set vary from 16% to 26% (360M pair reads in total), e.g. reads selected

after normalization in summer samples > spring samples

summer samples display higher

transcriptomic diversity

• 86M paired reads after final step of global normalization

De novo assemblies

Table 1 | comparison of different assembly metrics. Reads were assembled in FR orientation, K-mer was set or estimated at 25.

BinPacker was used by activating option -d. HiSta2

[5]

and Samtools

[6]

were used in upstream for Trinity genome-guided de novo

assembly. 155986 sequences in Trinity genome-guided assembly possesses the TPM value superior to 1 cross all the samples.

• Trinity de novo should have generated more than 124070 sequences

• BinPacker generated longer sequences and assembled more residues than the other

assembler tested

• Trinity genome-guided using genome from close species Q. robur

assembled an higher

number of transcripts than other assemblers

Pseudo-mapping reads back to the assemblies

Figure 1 | Comparison of pseudoalignment rate between

different assemblies.

Kallisto

[7]

analysis with FR strand-specific paired-end

pseudoalignment option. We also mapped reads back to a

public reference transcriptome, but the pseudo-mapping rate

was just around 40%.

50 55 60 65 70 75 80 85 90 p C 05 5 p C 06 0 p C 06 2 p C 06 3 p C 06 5 p C 06 8 p C 09 0 p C 09 4 p C 04 4 p C 04 9 p E0 7 1 p E0 7 2 p E0 7 4 p E0 7 5 p E0 7 8 p E0 7 9 p E1 0 2 p E1 0 3 p E1 0 4 p E1 0 5 e C 05 5 e C 06 0 e C 06 2 e C 06 3 e C 06 5 e C 06 8 e C 09 0 e C 09 4 e C 05 9 e C 90 5 e E0 7 1 e E0 7 2 e E0 7 4 e E0 7 5 e E0 7 8 e E0 7 9 e E1 0 2 e E1 0 3 e E1 0 4 e E1 0 5 %ma p p in g Samples

BinPacker

CLC

Trinity

Trinity genome-guided

Trinity genome-guided de novo assembly displays the best pseudoalignment rate compare to

the other de novo assemblers for the diploidic species assembly.

Trinity genome-guided de novo assembly evaluation

Contig Ex90N50 Statistic

: Ex90N50 is 1544bp and corresponds to 34572 transcripts

BUSCO

[8]

: C:87.9%[S:41.7%,D:46.2%],F:7.8%,M:4.3%,n:1440

BUSCO analysis result displays a good completude (1440 BUSCO groups searched) and

completeness (87.9% complete BUSCOs orthogroups) of transcriptome, but displays a high

number of duplicated groups (46.2%).

Trinity genome-guided de novo assembly was chosen as the

reference transcriptome for further analyses.

Samples

eC

eE

pC

pE

eC

/

15

3122 3529

eE

31

/

3460 3709

pC

2312 2613

/

2

pE

2569 1949

11

/

4. Differential analysis

Table 2 | Differentially expressed genes (DEG) in response to the aridification.

The thresholds of statistical tests were fixed at Padj<1e

-3

and log

2

FC≥2. 5724

and 4153 DEG were detected by DESeq2

[9]

and edgeR

[10]

respectively (detail

shown in black and grey). Abbreviations of the conditions: e and p for summer

and spring, C and E for control and water stress.

Figure 2 | Venn diagram of

edgeR results. Venn diagram

were realized with R packages

Vennerable

[11]

.

Genetic expression in response to the aridification varies with the season, according to edgeR

results, 4 transcripts from 42 were expressed independently to the season.

5. Water stress vs control in summer: annotation with GO-enrichment

0

1

2

3

4

5

6

7

8

ribonuclea

se

inhibi

tor…

4-hydroxy-4-m

ethyl-2-…

ferrochel

atase

ac

ti

vi

ty

oxaloaceta

te

de

carboxylase

lyas

e ac

ti

vit

y

he

me

biosyn

th

eti

c proc

ess

he

me

me

ta

bol

ic

pro

cess

oxo-acid

-l

yas

e a

ctiv

ity

resp

ons

e

to fruct

ose

resp

ons

e

to g

lucose

thi

o

l-depen

d

en

t ub

iqu

itin

-…

resp

ons

e

to hexose

resp

ons

e

to mon

o

sacch

ar

ide

ub

iqu

itin

-l

ike protei

n-…

resp

ons

e

to s

uc

rose

resp

ons

e

to disa

cc

hari

d

e

porphyri

n-co

n

ta

ining

-ln(P)

Termes GO

Up-regulated (a)

0

1

2

3

4

5

6

7

8

regulati

on of

tr

yptoph

an

m

etabolic…

pos

itiv

e regul

ati

on of

tr

yptoph

an…

pe

pti

de

-me

thi

onine (S)

-S-o

xi

de

me

thy

lati

on-d

ep

en

de

nt c

h

roma

ti

n…

aux

in efflux

pos

itiv

e regul

ati

on of

hormone…

pos

itiv

e regul

ati

on of

au

xi

n…

molybde

nu

m inc

or

porati

o

n

in

to…

me

tal i

nc

orpor

ation into met

al

lo-…

sing

let oxyg

en

-m

ed

iated…

regulati

on of

au

xi

n m

etabolic proc

ess

ph

os

ph

ate ion tr

ans

m

emb

rane

-ln(P)

Termes GO

Down-regulated (b)

Figure 3 | Enrichment analysis based on GO functional categories for transcripts that were significantly at p-value<0.01 (a) up-regulated and (b)

down-regulated in response to Q. pubescens drought treatment during summer.

6. Plasticity of photosynthetic machinery: light reactions

Figure 4 | Chlorophyll fluorescence parameter measurements during summer: Kinetics of Photosystem II quantum yield (a), photochemical

quenching (b) and Non-photochemical quenching coefficients (c).

0 1 2 3 4 5 6 7 0 500 1000 1500 2000 NPQ

PAR (µmole photons. m-2.s-1)

NPQ (c)

0 0,4 0,8 1,2 0 500 1000 1500 qP

PAR (µmole photons. m-2.s-1)

qP (b)

During drought, light energy absorbed in excess is mainly dissipated as heat, as shown by the

increase of the Non-photochemical quenching (NPQ) coefficient.

0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 0 500 1000 1500 2000 Y ie ld

PAR (µmole photons. m-2.s-1)

PSII actual Quantum yield (a)

Exclusion

Control

Photosynthetic pigments

PsbO Oxygen Evolving Complex

CP43 PSII core

Lhcb1 PSII antenna

PsbS PSII protein, NPQ activation

PsaD PSI core

Lhca2 PSI antenna

Lhcb3 PSII antenna

CP24 PSII minor antenna

7. What relationship with pigments?

Figure 5 | Q. pubescens leaf chlorophylls and carotenoids

relative contents by HPLC analysis.

0 5 10 15 20 25 30 35 40 45 50

C Spring

Ex Spring

C Summer

Ex Summer

Only total carotenoids change with

rain exclusion and seasons

Figure 6 | Relative abundance

of photosystems II and I

proteins after electrophoresis

separation

8. The proteins of photosystems I and II?

• CP24 increases with the exclusion rain during

the summer

• Very clear effect of an increased synthesis of

Lhcb3 in summer and spring

11. Conclusion and Perspective

This project has identified genetic, biochemical, chemical and physiological markers of plant

adaptation to climate change in a cross-cutting approach. It has been carried out around

several disciplinary fields: Genomics, Biochemistry, Ecophysiology, Chemistry, Bioinformatics

and Modeling, opening perspectives for more ambitious projects.

10. Enrichment of the metabolic pathway

Figure 7 | Metabolite

markers

pathway

enrichment

analysis

and pathway impact

values

from

the

pathway

topology

analysis.

The water deficit significantly activates the

galactose metabolism pathway

9. Targeted Metabolomics: Differential

Analysis

Table 3 | Metabolite

markers in response to

drought throughout the

seasons: Exclusion (E)

versus Control (C) and

spring versus summer.

Pyroglutamic acid identified as a metabolic

marker of drought. Sugars as Xylulose and

Sorbitol are seasonal metabolite markers

Références

Documents relatifs

Successful assembly of a bacterial genome in one circular contig Discovery of unknown phage structural variants. Degnan

Abstract – In the search for a trophic control of rhythmic growth of young oak seedlings, sucrose synthase accumulation, localization and activity were studied in the apex,

Keywords: Transcriptome assembly, RNA‑seq, Repeats, Alternative splicing, Formal model for representing repeats, Enumeration algorithm, De Bruijn graph topology, Assembly

Response of Scots pine (Pinus sylvestris L.) and pubescent oak (Quercus pubescens Willd.) to soil and atmospheric water deficits under Mediterranean mountain climate... DOI:

Climate change and oak growth decline: Dendroecology and stand productivity of a Turkey oak (Quercus cerris L.) old stored coppice in Central Italy... DOI:

The aims of this work were (1) to detect drivers of current pine and oak occurrence in a mountain region of the south- western Alps, by fitting species distribution models (SDM)

Predicted 10-year probability of individual-tree mortality of pyrenean oak as a function of tree dbh and h /Ho ratio for observed dead and live trees for model data set (a) and (c)

The contigs (Figure 1) and variants (Figure 2) overview pages include a set of graphics showing general statistics, containing for example the contig length histogram, alignment