• Aucun résultat trouvé

Contents 1

N/A
N/A
Protected

Academic year: 2021

Partager "Contents 1"

Copied!
3
0
0

Texte intégral

(1)

iii

Contents

1 Introduction ...1

1.1 Mobile genetic elements and horizontal transfer...4

1.2 Impact of MGEs on bacterial genomic structure...6

1.3 MGE mosaicism: Playing with Lego ...7

1.4 The ACLAME project...8

1.5 Bacteriophages...10

1.5.1 Prophages...11

1.5.2 Phage genomic structure...13

1.5.3 Functional modules ...14

1.5.4 Phage Evolution ...18

1.5.5 Phage Taxonomy...21

1.5.6 The need for a new classification system ...26

1.6 Methods for detecting HGT and MGEs...27

1.6.1 Phylogenetic methods ...27

1.6.2 Compositional methods...28

1.6.3 Predicting integrated mobile genetic elements ...30

1.7 Graph and Network analysis: Applications in Biology ...34

1.7.1 Introduction to graph-theory ...34

1.7.2 Graph-theory in Biology ...39

1.8 Classification...41

1.8.1 Clustering...41

1.8.2 Evaluation of a clustering result ...47

2 Objectives of the thesis ...51

3 Analysis of the phage protein sequence space classified into the ACLAME database...52

4 Graph-based analysis and modularity-based classification of phage...55

4.1 Reticulate representation of evolutionary and functional relationships between phage genomes...61

4.1.1 Abstract...61

4.1.2 Introduction ...62

4.1.3 Methods...63

4.1.4 Results...70

4.1.5 Discussion ...86

4.1.6 Conclusions...90

4.1.7 Supplementary Materials...91

4.1.8 Acknowledgments ...91

4.1.9 Literature Cited ...92

5 Prophinder: a computational tool for prophage prediction in prokaryotic genomes...95

5.1 Method...95

5.1.1 Input data...95

5.1.2 Prophinder procedure...95

5.1.3 Detection of phage-like proteins in the bacterial genome ...95

(2)

iv

5.2 Results and Discussion...101

5.2.1 Overview of the results ...101

5.2.2 Evaluation ...101

5.2.3 Filling the gap in the phage sequence landscape ...116

6 Discussion and Perspectives ...123

6.1 Phage Evolution: Modules as elementary building blocks...123

6.2 Reticulate Classification for Phage ...124

6.2.1 The phage sequence landscape ...125

6.2.2 Module-based classification ...126

6.3 (Pro)phage impact in bacterial evolution ...127

6.4 Towards classification of other MGEs ...128

7 References ...131

Annex 1. Phage Gene Ontology: PhiGO...143

Annex 2. Evolutionary Cohesive Modules...145

Annex 3. Prophinder: algorithm, database, web server and web service ...149

Annex 4. Global analysis of plasmids ...151

(3)

vii Abstract

Bacteriophage genomes show pervasive mosaicism, indicating that horizontal gene exchange plays a crucial role in their evolution. Phage genomes represent unique combinations of modules, each of them with a different phylogenetic history. Thus, a web-like, rather than a hierarchical scheme is needed for an appropriate representation of phage evolutionary relationships. Part of the virology community has long recognized this fact and calls for changing the traditional taxonomy that classifies tailed phages according to the type of genetic materials and phage tail and head/capsid morphologies. Moreover, based on morphological features, the current system depends on inspection of phage virions under the electron microscope and cannot directly classify prophages. With the genomic era, many phages have been sequenced that are not classified, calling for development of an automatic classification procedure that can cope with the sequencing pace. The ACLAME database provides a classification of phage proteins into families and assigns the families with at least 3 members to one or several functions.

In the first contribution of this work, the relative contribution of those different protein families to the similarities between the phages is assessed using pair-wise similarity matrices. The modular character of phage genomes is readily visualized using heatmaps, which differ depending on the function of the proteins used to measure the similarity.

Next, I propose a framework that allows for a reticulate classification of phages based on gene content (with statistical assessment of the significance of number of shared genes). Starting from gene/protein families, we built a weighted graph, where nodes represent phages and edges represent phage-phage similarities in terms of shared families. The topology of the network shows that most dsDNA phages form an interconnected group, confirming that dsDNA phages share a common gene pool, as proposed earlier. Differences are observed between temperate and virulent phages in the values of several centrality measures, which may correlate with different constraints to rampant recombination dictated by the phage lifestyle, and thus with a distinct evolutionary role in the phage population.

To this graph I applied a two-steps clustering method to generate a fuzzy classification of phages.

Using this methodology, each phage is associated with a membership vector, which quantitatively characterizes the membership of the phage to the clusters. Alternatively, genes were clustered based on their ‘phylogenetic profiles’ to define ‘evolutionary cohesive modules’. Phages can then be described as composite of a set of modules from the collection of modules of the whole phage population. The relationships between phages define a network based on module sharing. Unlike the first network built from statistical significant number of shared genes, this second network allows for a direct exploration of the nature of the functions shared between the connected phages. This functionality of the module-based network runs at the expense of missing links due to genes that are not part of modules, but which are encoded in the first network.

These approaches can easily focus on pre-defined modules for tracing one or several traits across the population. They provide an automatic and dynamic way to study relationships within the phage population. Moreover, they can be extended to the representation of populations of other mobile genetic elements or even to the entire mobilome.

Finally, to enrich the phage sequence space, which in turn allows for a better assessment of phage diversity and evolution, I devise a prophage prediction tool. With this methodology, approximately 800 prophages are predicted in 266 among 800 replicons screened. The comparison of a subset of these predictions with a manually annotated set shows a sensitivity of 79% and a positive predictive value of 91%, this later value suggesting that the procedure makes few false predictions. The preliminary analysis of the predicted prophages indicates that many may constitute novel phage types.

This work allows tracing guidelines for the classification and analysis of other mobile genetic elements. One can foresee that a pool of putative mobile genetic elements sequences can be extracted from the prokaryotic genomes and be further broken down in groups of related elements and evolutionary conserved modules. This would allow widening the picture of the evolutionary and functional relationships between these elements.

Références

Documents relatifs

destinations in a network can be obtained by recognizing four kinds of named objects at or leading to every destination (services, nodes, attachment points, and routes) and

Application A1 on host H1 and application A2 on host H2 first get in contact using the standard IP over ATM to exchange their ATM address (atm1, atm2) and to define a

Calls placed by a Merit member/affiliate user to these external dial-in services are authenticated by having each of those services forward RADIUS authentication requests

The optional javaCodebase attribute is used to store the locations of the object factory’s and the object’s class definitions.. A directory entry containing a JNDI reference

This document and the information contained herein is provided on an "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIMS

Por favor, seleccione los marcos azul para acceder al

In addition to lacking key functions required by a phage, the RcGTA structural gene cluster lacks genes likely to encode functions necessary for its gene transfer activity, such as

Various behavioral techniques have been employed and researched in terms of enhancing the social behavior of children with ASD such as: trial training, making use of stereotypic