• Aucun résultat trouvé

A mosaic monoploid reference sequence for the

N/A
N/A
Protected

Academic year: 2021

Partager "A mosaic monoploid reference sequence for the"

Copied!
20
0
0

Texte intégral

(1)

A mosaic monoploid reference sequence for the

highly complex genome of sugarcane

Olivier GARSMEUR

(2)

S. officinarum 2n = 80 Domestication

Highly polyploid

Aneuploid

Interspecific

S. spontaneum 2n = 40 --> 128 S. robustum 2n = 60,80-->200

modern cultivars

2n ~ 110-130

(3)

R570, 2n=ca 115

80 % S. officinarum

10% S. spontaneum

10% recombinants

S. officinarum recombinants

S. spontaneum x=10 x=8 Total genome size: 10 Gb Sorghum 750 Mb Rice 400 Mb Monoploid genome size: 930 Mb

Global organization of the complex genome of sugarcane

(photo and figure adapted from D’Hont et al, 1996; D’Hont et al, 1998; D’Hont 2005, Piperidis et al 2010)

Genomic in situ hybridization

Homoeologous chromosomes

(4)

The sugarcane genome poses challenges that have not been addressed in any prior sequencing

project.

- its high level of heterozygosity which makes an assembly of the whole genome very challenging through a classical shotgun sequencing approach.

The main difficulties reside in

- its high polyploidy, with a complete set of homoeologous chromosomes predicted to range from 10 to 12 copies

Sugarcane

Homoeologous chromosomes

(5)

Thirteen hom(oe)ologous BAC clones

corresponding to the Adh1 region

Seven hom(oe)ologous BAC clones

corresponding to the Bru1 region

S. spontaneum

S. officinarum

recombinants

Adh1

Bru1

 Comparison of sugarcane homoeologous regions

Four hom(oe)ologous BAC clones

corresponding to the Pst2 region

Six hom(oe)ologous BAC clones

corresponding to the CAD region

Pst2

CAD

(6)

Garsmeur et al. New Phytologist 2010 unpublished data Hap. I Hap. II Hap. III Hap. IV Hap. V Hap. VI Hap. VII Hap. VIII Hap. IX Hap. X Hap. XI Hap. XII Hap. XIII 10 Kb Sugarcane homoeologous BAC clones Sorghum Orthologous region

 Gene structure conservation among sugarcane homeologs  All hom(oe)o-alleles predicted functional

LINE LINE I. Sh051L01 II. SH186P07 III. Sh265O22 IV. Sh192N12 V. Sh188C19 VI. Sh102M23 VII. Sh111P05 VIII. Sh182G15 IX. Sh209M19 X. Sh245F09 XI. Sh242M02 XII. Sh172H13 XIII. Sh206M17 Sorghum 84 96 100 89 100 94 35 100 55 29 100 0.01 Haplotype I Haplotype II Haplotype IV Haplotype V Haplotype VI Haplotype VII Haplotype VIII Haplotype IX Haplotype X Haplotype XI Haplotype XII Haplotype XIII Haplotype III

Comparison of sugarcane homeologs and comparison with sorghum

Transposable elements variations  High colinearity with sorghum

(7)

Sorghum sequence

S. officinarum S. spontaneum

The sugarcane sequencing strategy

S. officinarum x=10 S. spontaneum x=8 Mosaic monoploid genome Sugarcane BACs on align on sorghum Global similar organization (synteny conservation)

 Sugarcane BAC sequences represent a mosaic of the monoploid genome

(8)

Sugarcane BAC

Sb01 Sb02 Sb03 Sb04 Sb05 Sb06 Sb07 Sb08 Sb09 Sb10

Sorghum Genes

Sorghum transposable elements

 11,732 R570 sugarcane BACs anchored onto the sorghum

Distribution of the sugarcane BACs on the sorghum genome

Sugarcane BAC selection using the sorghum genome sequence

 Whole Genome Profiling (WGP) technology

produce short illumina sequence tags from terminal ends of restricted BAC fragments 20,736 sugarcane BAC  455,656 uniq WGP tags

(9)
(10)

= minimum set of BACs to be sequenced to obtain the best coverage of chromosomes

Selection of a Minimum Tiling Path (MTP) of sugarcane BACs

sorghum SUGARCANE BAC1 SUGARCANE BAC2 SUGARCANE BAC3 SUGARCANE BAC4 SUGARCANE BAC5

x

x

11,732 BAC anchored

11,732 BAC anchored => Minimun Tiling Path 4,660 sugarcane BACs to be sequenced Represent the number of BAC to be sequenced to cover the gene-rich part of a

(11)

Sequencing a Minimum Tiling Path (MTP) of R570 BAC

Aim = High quality sequence and BAC assembled in 1 contig

BAC sequenced through international collaboration

BAC sequenced using PacBio RSII technology and 100X depth coverage Single Molecule Real Time sequencing (SMRT)  produce long reads (several kb)

high 100X sequencing  High quality sequence

 4535 BAC sequenced

 97.5% of BAC assembled in <= 3 contigs

 531 Mb of high quality sequences

(12)

Assembly of Single tiling path (STP) of sugarcane BACs

Sorghum gene models Sugarcane BAC MTP

From

MTP = 531 Mb

 Assembly

STP = 382 Mb

STP

(13)

Global genome comparison with sorghum

 12,468 Single dose SNPs assembled in 132 Cosegregation Groups (CG)

Markers aligned onto the sorghum genome

 95 % of the CGs with most markers aligning on one sorghum chromosome  Allow assembling CGs in sugarcane Homoeology Groups (HG)

 Construction of a R570 SNP genetic map

Data analyzed with in-house pipeline : Process_reseq and vcf2pop Available at https://github.com/SouthGreenPlatform/VcfHunter/

(14)

origin of R570 cultivar CGs

DArTseq Genotyping of a panel of 34 Saccharum accessions

-12

S. officinarum

- 9

S. robustum

- 13

S. spontaneum

 Identification diagnostic SNP

Sb03 S. officinarum/robustum VS S. spontaneum

(15)

Sb04 Sb01 HG01 HG04 Sb10 Sb03 HG02 HG03 Sb06 HG07 Sb07 HG08 HG10 Sb05 Sb02 HG05 Sb08 HG09 Sb09 HG06 Sb03 Sb01 Sb04 Sb10 x=10 S. of fic inarum Sb02 Sb08 Sb09 Sb06 Sb05 Sb07 So rgh u m

Global genome comparison with sorghum

 125 CGs assembled in 10 homoeology groups aligning on one sorghum chromosome

S. officinarum

(16)

Sb07

Sb05

Sb06

sHG07

sHG08

Sb06

Sb05

Sb07

Sb08

Sb08

Sb02

Sb09

sHG05

sHG06

Sb02

Sb09

 For a few CGs most markers aligning on two sorghum chromosomes

Global genome comparison with sorghum

Sorghum

S. spontaneum

S. spontaneum

(17)

Global Sorghum and Sugarcane genomes comparison

x=10 S. officinarum S. spontaneum Sb03 Sb01 Sb04 Sb10 Sb02 Sb08 Sb09 9 Sb06 Sb05 Sb07 5 6 7 8 Sorghum 2 1 3 4 10 7 5 2 1 3 4 6 8 x=10 X=8

(18)

Conclusions

☑ A high quality sequences of 382 Mb representing the gene rich fraction of the sugarcane monoploid genome

☑ 25,316 protein-coding gene models within their genomic sequence context providing access to regulatory elements for functional studies

Serve as high quality framework to help assemble the whole genome of sugarcane  Project in progress in progress with JGI

☑ Confirm global colinarity between sugarcane and sorghum

☑ A few chromosome rearrangements between Sorghum/S. officinarum and S. spontaneum  These rearrangements corroborate and explain a variation of basic chromosome

numbers in S. officinarum vs S. spontaneum with x= 10 and x=8, respectively  x=10 is recognized to be ancestral in the Saccharinae-Sorghinae, these

rearrangements thus must have arisen in the S. spontaneum lineage after its divergence from the lineage of S. officinarum

(19)

An integrated web-based database providing centralized access

to the sugarcane reference genome sequences and genomic resources.

(20)

Edwin van der Vossen Rudie Antonise

International Consortium for Sugarcane Biotechnology

CHARCA, EEAOC, SRA,CTC, CENICANA,CINCAE,CEGICANA, VSI, MSIRI, SASRI, Mitr Phol, FSCL, HARC, ASCL, RGVSG

Olivier Garsmeur Gaetan Droc Guillaume martin Carine Charron Catherine Hervouet Angélique D’Hont FRANCE AUSTRALIA Karen Aitken Robert Henry Bernard Potier SOUTH AFRICA

Marie-Anne Van Sluys

BRAZIL Jeremy Schmutz Jane Grimwood Jerry Jenkins Blake Simmons USA Hélène Bergès FRANCE NETHERLANDS

Références

Documents relatifs

In addition, RNASeq or whole transcriptome shotgun sequencing approach based on total RNA extracted from an ampelovirus infected sugarcane variety yielded a

With sugarcane mosaic virus (SCMV, not SCSMV) in Australia, infected leaf extracts are applied to the surface of test plant leaves via an infested scourer pad and this leads

A minimum tiling path of 4,660 sugarcane BAC clones that best cover the gene-rich part of the sorghum genome was selected, sequenced and assembled in a 382 Mb single tiling path

Dans des études précédentes, nous avions montré que les chromosomes hom(e)ologues du génome de la canne à sucre partagent entre eux un très haut niveau de micro-colinéarité

This high quality reference sequence that corresponds to the gene rich part of a sugarcane monoploid genome will represent a very important resource for genetic, structural

This high quality reference sequence that corresponds to the gene rich part of a sugarcane monoploid genome will represent a very important resource for genetic, structural

The main difficulties reside in the high polyploidy (2n ~ 12x ~ 120), and the high level of heterozygosity of cultivars which make an assembly of the genome very challenging

Abstract: Toward a Reference Sequence of the Gene-Rich Part of the Highly Polyploid Sugarcane Genome (Plant and Animal Genome XXIII