• Aucun résultat trouvé

An enrichment model for wheat gene annotations using phylogeny, orthology and existing gene ontologies in 9 plant species

N/A
N/A
Protected

Academic year: 2021

Partager "An enrichment model for wheat gene annotations using phylogeny, orthology and existing gene ontologies in 9 plant species"

Copied!
2
0
0

Texte intégral

(1)

READ THESE TERMS AND CONDITIONS CAREFULLY BEFORE USING THIS WEBSITE.

https://nrc-publications.canada.ca/eng/copyright

Vous avez des questions? Nous pouvons vous aider. Pour communiquer directement avec un auteur, consultez la

première page de la revue dans laquelle son article a été publié afin de trouver ses coordonnées. Si vous n’arrivez pas à les repérer, communiquez avec nous à PublicationsArchive-ArchivesPublications@nrc-cnrc.gc.ca.

Questions? Contact the NRC Publications Archive team at

PublicationsArchive-ArchivesPublications@nrc-cnrc.gc.ca. If you wish to email the authors directly, please see the first page of the publication for their contact information.

NRC Publications Archive

Archives des publications du CNRC

This publication could be one of several versions: author’s original, accepted manuscript or the publisher’s version. / La version de cette publication peut être l’une des suivantes : la version prépublication de l’auteur, la version acceptée du manuscrit ou la version de l’éditeur.

Access and use of this website and the material on it are subject to the Terms and Conditions set forth at An enrichment model for wheat gene annotations using phylogeny, orthology and existing gene ontologies in 9 plant species

Tulpan, Dan; Léger, Serge; Tchagang, Alain B.; Pan, Youlian

https://publications-cnrc.canada.ca/fra/droits

L’accès à ce site Web et l’utilisation de son contenu sont assujettis aux conditions présentées dans le site LISEZ CES CONDITIONS ATTENTIVEMENT AVANT D’UTILISER CE SITE WEB.

NRC Publications Record / Notice d'Archives des publications de CNRC:

https://nrc-publications.canada.ca/eng/view/object/?id=dd263013-adbd-4878-ae67-379efbf7e787 https://publications-cnrc.canada.ca/fra/voir/objet/?id=dd263013-adbd-4878-ae67-379efbf7e787

(2)

An enrichment model for wheat gene annotations using

phylogeny, orthology and existing gene

ontologies in 9 plant species

Introduction

Genome sequencing efforts for the Triticum aestivum genome produce massive amounts of contigs, preliminary assemblies and putative genes/proteins, nevertheless their annotation is still in its infancy. Given the much larger percentage of annotated genes in other previously sequenced plant genomes such as Arabidopsis thaliana and Oryza sativa and the known phylogenetic and orthology relationship among these plant species and their corresponding genes, we propose an enrichment model that will further expand the horizon of wheat gene annotations.

Dan Tulpan, Serge Léger, Alain Tchagang, Youlian Pan

Information and Communication Technologies, National Research Council, Canada

{dan.tulpan, serge.leger, alain.tchagang, youlian.pan}@nrc-cnrc.gc.ca

Canadian Wheat Alliance

OUR PARTNERS

References

[1] Tulpan, D., Leger, S., Tchagang, A., Pan, Y. Enrichment of Triticum aestivum gene annotations using ortholog cliques and gene ontologies in other plants. BMC Genomics

2015, 16:299.

[2] Niskanen, S., Östergård, P.R.J. Cliquer User's Guide, Version 1.0. Communications Laboratory, Helsinki University of Technology, Espoo, Finland, Tech. Rep. T48, 2003.

[3] Patel, R.V., Nahal, H.K., Breit, R., Chen, Y., Provart, N.J. BAR Expressolog identification: expression profile ranking of “orthologous” genes in plant species. The Plant

Journal 2012, 71:1038–1050.

[4] Supek, F., Bošnjak, M., Škunca, N., Šmuc, T. REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms. PLoS ONE 2011, 6(7):e21800.

The gene annotation enrichment model

Our sequences and annotations base includes data from Ensembl Plants for wheat and 9 other plant species: Aegilops tauschii, Arabidopsis

thaliana, Brachypodium distachyon, Brassica rapa, Hordeum vulgare, Oryza sativa subsp. japonica, Sorghum bicolor, Triticum urartu and Zea mays. Orthology relationships between wheat genes and each of the 9 plant species are predicted using an in-house software package. Next,

ortholog cliques are identified such that each set of genes within a clique represents pairwise orthologs. Using the phylogenetic distances between wheat and each plant species to quantify the level of confidence for gene ontology assignments within each ortholog clique, new gene annotations are assigned to wheat genes such that either novel or more specific GO terms are associated with those genes.

Gene ontologies

Gene annotations

Phylogenetic

information

Protein sequences

DNA sequences

Orthology prediction

OrthoPred [1]

Cliques of orthologs

Ortholog cliques discovery

Cliquer [2]

1-to-1 pairwise

orthologs

GO scores

GO term selection criteria

≥ 0.5; longer depth; unique path

Novel GO terms-to-gene

assignments

Results

Overall, based on clique size equal to or larger than 3, our model enriched the existing gene-GO term associations for 7,838 (8%) wheat genes, of which 2,139 had no previous annotation. For the particular case of ortholog cliques of size 10 (13 in total) where all 10 genes within a clique are tightly connected via pairwise orthology, 85 new and more specific GO terms were identified, which represent a 65% increase compared with the previously 130 known GO terms. These observations are further supported for 4 out of the 10 plant species considered in this work by experimental evidence using expressologs [3].

ReviGO [4] display of 96 Biological Process GO terms for the cliques of size 10.

ReviGO [4] display of 29 Molecular Function GO terms for the cliques of size 10.

GO

s

c

ore

GO scores for wheat genes with newly assigned GO terms. GO ter m path depth

GO term path depth for wheat genes with newly assigned GO terms. T. aestivum, T. urartu 1 A. tauschii, H. vulgare 2 B. distachyon 3 O. sativa 4 S. bicolor, Z. mays 5 A. thaliana, B. rapa 6

Références

Documents relatifs

The Functional Transfer of the Wheat Gene Lr34 into Rice and Its Resistance Against Rice Blast Justine Sucher , University of Zurich, Zurich, Switzerland.. Simon Krattinger

Construis, en bleu, l'image du cercle de centre O par la translation qui transforme B en A.. Construis, en vert, l'image du cercle de centre O par la symétrie de

axioms, but GO also contains a rich set of inter-ontology axioms, leveraging ex- ternal ontologies (The plant ontology, Uberon, the Cell type ontology, and the CHEBI ontology

individuals concordantly for microsatellite and CRTISO data, while the orange group was composed of individuals mainly assigned to the Western genetic group, with some

I will successively discuss four episodes in this complex history: the early observations on pseudoalleles and gene complexes, and the mechanistic models provided to explain

It is a pleasant surprise to find that we are able to propose an answer to this and other difficult evolutionary questions, such as: Why, for example, do

The main modeling issue we consider in this paper is the treatment of the proteins in terms of their feedback interactions with one isolated gene, i.e., the regulation of a single

(ii) Gene type = def A role holder played by a genomic segment, which has both of the two kinds of ge- netic information, i.e., information for self-