• Aucun résultat trouvé

ViSEAGO: Easier data mining of biological functions organized into clusters using Gene Ontology and semantic similarity

N/A
N/A
Protected

Academic year: 2021

Partager "ViSEAGO: Easier data mining of biological functions organized into clusters using Gene Ontology and semantic similarity"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-02786990

https://hal.inrae.fr/hal-02786990

Submitted on 5 Jun 2020

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

ViSEAGO: Easier data mining of biological functions organized into clusters using Gene Ontology and

semantic similarity

Christelle Hennequet-Antier, Aurélien Brionne, Amélie Juanchich

To cite this version:

Christelle Hennequet-Antier, Aurélien Brionne, Amélie Juanchich. ViSEAGO: Easier data mining of biological functions organized into clusters using Gene Ontology and semantic similarity. Journées Ouvertes Biologie, Informatique et Mathématiques JOBIM 2019, Jul 2019, Nantes, France. 1 p, 2019.

�hal-02786990�

(2)

ViSEAGO: Easier data mining of biological functions organized into clusters using Gene Ontology and semantic similarity

Christelle H

ENNEQUET

-A

NTIER

, Aurélien B

RIONNE

and Amelie J

UANCHICH 1§

1

BOA, INRA, Université de Tours, 37380, Nouzilly, FRANCE

§

Contributed equally to this work

Corresponding Author: Christelle.hennequet-antier@inra.fr

Abstract

The main objective of ViSEAGO workflow is to carry out a data mining of biological functions and establish links between genes involved in the study. We developed ViSEAGO in R to facilitate functional Gene Ontology (GO) analysis of complex experimental design with multiple comparisons of interest. It allows to study large- scale datasets together and visualize GO profiles to capture biological knowledge. The acronym stands for three major concepts of the analysis: Visualization, Semantic similarity and Enrichment Analysis of Gene Ontology. It provides access to the last current GO annotations, which are retrieved from one of NCBI EntrezGene, Ensembl or Uniprot databases for available species. ViSEAGO extends classical functional GO analysis to focus on functional coherence by aggregating closely related biological themes while studying multiple datasets at once. It provides both a synthetic and detailed view using interactive functionalities respecting the GO graph structure and ensuring functional coherence supplied by semantic similarity.

ViSEAGO has been successfully applied on several datasets from different species with a variety of biological questions. Results can be easily shared between bioinformaticians and biologists, enhancing reporting capabilities while maintaining reproducibility.

ViSEAGO is publicly available on https://forgemia.inra.fr/umr-boa/viseago.

Références

Documents relatifs

The new field of Semantic Data Mining, which has emerged in the past few years, has suggested a possible solution to this problem: Domain knowledge can be derived from semantic

The advantages of agglomerative algorithms are that we don’t have to know the size and the number of modules and the result of these algorithms depend on the cho- sen

known as the “father of biodiversity” said: “It is reckless to suppose humanity

This paper deploys an ontology-based semantic similarity combined with self-organizing maps for studying the temporal evolution of co- hesion among CLL-related genes and the

The primary goal of this work was to enable new biologically interesting queries to be asked of the data in the Gene Expression Atlas by integration with external resources

Section 3 first describes a recent experiment to use different biomedical ontologies as reference ontologies without using semantic similarity to improve alignment results

Variation of the global F-score values when (A) varying the number of gen- erated fuzzy clusters K with the fuzzy C-Means algorithm using IntelliGO similarity measure and (B)

When compared with all measures in both tables 2 and 3, WP had the lowest SAQ value (rank of 15). If the ranking was not identical across all OA systems, a ranking was only