• Aucun résultat trouvé

The animal phylogeny and the fundamental importance of taxon sampling

N/A
N/A
Protected

Academic year: 2021

Partager "The animal phylogeny and the fundamental importance of taxon sampling"

Copied!
19
0
0

Texte intégral

(1)

The Animal Phylogeny

and the Fundamental Importance

of Taxon Sampling

Hervé Philippe, Henner Brinkmann, Denis Baurain

(2)

The New Animal Phylogeny

Philippe et al. (2005) Mol Biol Evol 22:1246-1253

71 slow-evolving genes x 37 animal taxa (20,705 sites) ; ML tree

(3)

bilaterians triploblasts

Protostomia

The New Animal Phylogeny

Philippe et al. (2005) Mol Biol Evol 22:1246-1253 diploblasts

Ecdysozoa

Lophotrochozoa

71 slow-evolving genes x 37 animal taxa (20,705 sites) ; ML tree

(4)

Rokas et al. (2005) Science 310:1933-1938

50 genes x 17 animal taxa (12,060 sites) ; ML/MP tree

Lack of resolution among most metazoan phyla

(5)

bilaterians/triploblasts

diploblasts Ecdysozoa

Lophotrochozoa

Rokas et al. (2005) Science 310:1933-1938

50 genes x 17 animal taxa (12,060 sites) ; ML/MP tree

Lack of resolution among most metazoan phyla

(6)

Possible causes for the lack of resolution

• long branch attraction artifacts

– removal of nematods and platyhelminths

• presence of rogue taxa

– computation of leaf stability indices (bootstrap-based) – removal of poriferans and cnidarians

• deviations in amino-acid composition

– removal of 6 scattered taxa violating the

homogeneity assumption

– computation of a LogDet NJ tree

(7)

Effect of the removal of 3 long-branched taxa ML/MP clade nd 57/68 Cnidaria 13 79/07 priapulids + arthropods 9 62/27 Protostomia 8 93/50 mollusks + annelids 7 74/32 Bilateria 6 96/60 Chordata 3

supplementary data of Rokas et al. (2005) Science 310:1933-1938 after before clade nd 55/63 57/68 Cnidaria 13 69/35 79/07 priapulids + arthropods 9 100/72 62/27 Protostomia 8 97/78 93/50 mollusks + annelids 7 100/72 74/32 Bilateria 6 94/66 96/60 Chordata 3

(8)

Investigating the apparent systemic lack of resolution

• amount of missing data

– design target: 20%

– removal of priapulid (68%) and mollusk (54%)

• total amount of data (12,060 sites)

– 56% of sites are variable

– 31% are parsimony-informative

• distribution of informative sites among topologies

– likelihood mapping (no resolution for diploblasts)

– resampling up to 100,000 sites (slightly better support)

(9)

The rationale for comparing Metazoa and Fungi

• aim is to distinguish between 2 hypotheses

– mutational saturation has erased phylogenetic signal – lack of resolution as a signature of closely spaced

cladogenetic events in early animal evolution

• Why Fungi?

– sistergroup relationship (Opisthokonta)

– approximately same date of origin (according to fossils and molecular clocks)

– same set of genes available (49 out of 50)

– roughly same tempo (ML-estimated distances) and same mode of evolution (AA substitution matrices)

(10)

50 genes x (17 + 15) taxa (12,060 sites) ; ML/MP tree Rokas et al. (2005) Science 310:1933-1938

Fungi are better resolved than Metazoa

(11)

The case for a radiation compressed in time

• molecular data

– Fungi have a higher stemminess (F = 0.201) than Metazoa (F = 0.121)

– Fungi have more parsimony-informative sites

(5,015 vs 3,701) and less singleton sites (1,518 vs 3,080) than Metazoa

• paleontological data

– poriferans, cnidarians and bilaterian fossils appear within a 50 MY time-frame

(12)

42 MY ; 600 MYA

Simulation analysis of the mammalian radiation

Rokas et al. (2005) Science 310:1933-1938

42 MY ; 107 MYA

covarion model ; 16,000 sites ; MP (NJ)

> 10 MY

The lack of resolution of the animal tree is interpreted as the signature of a radiation compressed in time.

(13)

Debunking Rokas et al. (2005)...

• Rokas et al. (2005) are likely wrong

– very few taxa

– extensive use of MP

– ignorance of previous studies

– dismissing of alternative explanatory hypotheses – misleading simulations

– biased conclusions

• We will demonstrate that...

– animal evolution can be resolved (no explosion) – taxon sampling is of fundamental importance

(14)

A tree with more genes and a denser taxon sampling spider cattle tick deer tick shrimp lobster water flea locust honey bee jewel wasp red flour beetle

fall armyworm silkworm planarian 1 planarian 2 tapeworm liver fluke blood fluke 1 blood fluke 2 polychaete 1 polychaete 2 leech earthworm squid scallop oyster snail

giant owl limpet

sea squirt 1 sea squirt 2 sea squirt 3 haghfish lamprey zebrafish frog mammal bird Hydractinia echinata Hydra magnipapillata Nematostella vectensis Acropora millepora Reniera sp. Suberites domuncula Monosiga ovata Proterospongia Monosiga brevicollis Neocallimastix Blastocladiella Glomus Rhizopus Cryptococcus Ustilago Schizosaccharomyces Saccharomyces Yarrowia 0.1 64 59 99 99 97 98 99 89 71 Arthropoda Platy-helminthes Annelida Mollusca Tunicata Vertebrata Cnidaria Porifera Choanoflagellata Fungi Xiphinematardigrad

Philippe et al. (unpublished)

133 genes x 47 animal taxa (31,092 sites) ML tree ; bullets = 100% bootstrap support

(15)

triploblasts Bilateria Protostomia Ecdysozoa Lophotro-chozoa Deuterostomia diploblasts

A tree with more genes and a denser taxon sampling

spider cattle tick deer tick shrimp lobster water flea locust honey bee jewel wasp red flour beetle

fall armyworm silkworm planarian 1 planarian 2 tapeworm liver fluke blood fluke 1 blood fluke 2 polychaete 1 polychaete 2 leech earthworm squid scallop oyster snail

giant owl limpet

sea squirt 1 sea squirt 2 sea squirt 3 haghfish lamprey zebrafish frog mammal bird Hydractinia echinata Hydra magnipapillata Nematostella vectensis Acropora millepora Reniera sp. Suberites domuncula Monosiga ovata Proterospongia Monosiga brevicollis Neocallimastix Blastocladiella Glomus Rhizopus Cryptococcus Ustilago Schizosaccharomyces Saccharomyces Yarrowia 0.1 64 59 99 99 97 98 99 89 71 Arthropoda Platy-helminthes Annelida Mollusca Tunicata Vertebrata Cnidaria Porifera Choanoflagellata Fungi Xiphinematardigrad

Philippe et al. (unpublished)

133 genes x 47 animal taxa (31,092 sites) ML tree ; bullets = 100% bootstrap support

(16)

More genes or more/better taxa?

• Philippe et al. (unpublished) vs Rokas et al. (2005)

– 133 (31,092 sites) vs 50 (12,060 sites)

– among 10 random drawings of 50 genes, only one unresolved tree was retrieved

• Philippe et al. (2005)

– 71 slow-evolving genes (20,705 sites) are able to resolve most internodes in the animal tree

A large dataset is needed but not enough to accurately resolve difficult phylogenetic relationships.

(17)

Taxon sampling and non-phylogenetic signal tardigrad spider cattle tick deer tick shrimp lobster water flea locust honey bee jewel wasp red flour beetle fall armyworm silkworm Caenorhabditis planarian 1 planarian 2 tapeworm liver fluke blood fluke 1 blood fluke 2 polychaete 1 polychaete 2 leech earthworm squid scallop oyster snail giant owl limpet

sea squirt 1 sea squirt 2 sea squirt 3 haghfish lamprey zebrafish frog mammal bird Hydractinia echinata Hydra magnipapillata Nematostella vectensis Acropora millepora Reniera sp. Suberites domuncula Monosiga ovata Proterospongia Monosiga brevicollis Neocallimastix Blastocladiella Glomus Rhizopus Cryptococcus Ustilago Schizosaccharomyces Saccharomyces Yarrowia 0.1 67 72 99 99 99 97 57 57 64 99 84 57 B spider cattle tick deer tick shrimp lobster water flea locust honey bee jewel wasp red flour beetle

fall armyworm silkworm planarian 1 planarian 2 tapeworm liver fluke blood fluke 1 blood fluke 2 polychaete 1 polychaete 2 leech earthworm squid scallop oyster snail

giant owl limpet

sea squirt 1 sea squirt 2 sea squirt 3 haghfish lamprey zebrafish frog mammal bird Hydractinia echinata Hydra magnipapillata Nematostella vectensis Acropora millepora Reniera sp. Suberites domuncula Monosiga ovata Proterospongia Monosiga brevicollis Neocallimastix Blastocladiella Glomus Rhizopus Cryptococcus Ustilago Schizosaccharomyces Saccharomyces Yarrowia 0.1 64 59 99 99 97 98 99 89 71 Arthropoda Platy-helminthes Annelida Mollusca Tunicata Vertebrata Cnidaria Porifera Choanoflagellata Fungi Xiphinematardigrad A

(18)

Taxon sampling and non-phylogenetic signal tardigrad spider cattle tick deer tick shrimp lobster water flea locust honey bee jewel wasp red flour beetle fall armyworm silkworm Caenorhabditis planarian 1 planarian 2 tapeworm liver fluke blood fluke 1 blood fluke 2 polychaete 1 polychaete 2 leech earthworm squid scallop oyster snail giant owl limpet

sea squirt 1 sea squirt 2 sea squirt 3 haghfish lamprey zebrafish frog mammal bird Hydractinia echinata Hydra magnipapillata Nematostella vectensis Acropora millepora Reniera sp. Suberites domuncula Monosiga ovata Proterospongia Monosiga brevicollis Neocallimastix Blastocladiella Glomus Rhizopus Cryptococcus Ustilago Schizosaccharomyces Saccharomyces Yarrowia 0.1 67 72 99 99 99 97 57 57 64 99 84 57 B spider cattle tick deer tick shrimp lobster water flea locust honey bee jewel wasp red flour beetle

fall armyworm silkworm planarian 1 planarian 2 tapeworm liver fluke blood fluke 1 blood fluke 2 polychaete 1 polychaete 2 leech earthworm squid scallop oyster snail

giant owl limpet

sea squirt 1 sea squirt 2 sea squirt 3 haghfish lamprey zebrafish frog mammal bird Hydractinia echinata Hydra magnipapillata Nematostella vectensis Acropora millepora Reniera sp. Suberites domuncula Monosiga ovata Proterospongia Monosiga brevicollis Neocallimastix Blastocladiella Glomus Rhizopus Cryptococcus Ustilago Schizosaccharomyces Saccharomyces Yarrowia 0.1 64 59 99 99 97 98 99 89 71 Arthropoda Platy-helminthes Annelida Mollusca Tunicata Vertebrata Cnidaria Porifera Choanoflagellata Fungi Xiphinematardigrad A

Philippe et al. (unpublished) 2 conflicting signals in tree B

- phylogenetic signal (Ecdysozoa) - non-phylogenetic signal (LBA) hence,

- global loss of resolution - artefactual topology

18 more nematodes added to tree B are unable to retrieve tree A. One slow-evolving taxon is better than a bunch of fast-evolving taxa.

(19)

The Cambrian Explosion revisited

Références

Documents relatifs

They are not simply the sum of the effects caused by each individual parasite; instead, they are the product of a combination of known and novel effects acting on key

Relative amounts of the a subunit of ATPase (ssu a ATPase), a actinin, a tubulin and protein spot 121 transcripts in N -3- oxododecanoyl- L -homoserine lactone (3-oxo-C

Stochastic algorithm, Robbins-Monro, variance reduction, Central limit theorem, Statistical Romberg method, Euler scheme, Heston model..

In conclusion, by genetic investigation of environmental popula- tions of Cryptococcus neoformans and Cryptococcus gattii species com- plexes, our study has revealed that clinical

Multiplication/Division Avec Les Chiffres Romains (A) Réponses.. Répondez à chaque question en

Table 4 also includes test statistics computed from models estimated on roughly one-seventh the sample size (818 observations). At this point, the test statistics fail to reject

Any information which readers of Environmental Conservation may be able to provide will help us in understanding what is happening with the current body of international conventions,

Etroitement dépendants de l’élaboration de la surface topographique récente, les matériaux bréchiques, calcaires ou argileux qui assurent le comblement des fractures, fissures