• Aucun résultat trouvé

Semantic integration and exploitation of orthology information and genetic disorders

N/A
N/A
Protected

Academic year: 2022

Partager "Semantic integration and exploitation of orthology information and genetic disorders"

Copied!
2
0
0

Texte intégral

(1)

Semantic Integration and Exploitation of Orthology Information and Genetic Disorders

J. A. Miñarro-Giménez1, Marisa Madrid2, J. T. Fernández-Breis1

1Departamento de Informática y Sistemas, Universidad de Murcia, Spain

2Cell Division Group, Paterson Institute for Cancer Research, CR-UK, University of Manchester, UK

1{jose.minyarro,jfernand}@um.es

2Mmadrid@picr.man.ac.uk

Abstract. Translational bioinformatics includes research on the devel- opment of novel techniques for the integration of biological and clinical data and the evolution of clinical informatics methodology to encompass biological observations. In this way, the integration of information about gene-related diseases with information about gene orthology would be very helpful for clinical investigations.

Keywords: Translational bioinformatics, Semantic integration, Genetic disorders, Orthology information, Semantic Web Technology.

1 Methodology and Results

In this work, we address the semantic integration of genetic disorders information provided by the Online Mendelian Inheritance in Man (OMIM) [1] and the OGO ontological repository of orthology-related information and knowledge [2], which was recently developed by our research group.

First, the global domain ontology was obtained through the reuse of the OGO ontology and its subsequent extension with the concepts, relations, prop- erties and restrictions of the domain of genetic diseases. The inclusion of domain knowledge of genetic diseases with domain knowledge of orthologous genes and proteins allows the relations between individuals to be analyzed for translational bioinformatics.

The genetic diseases information was integrated into the OGO repository taking the relationships and restrictions of the domain into account in order to dene the mapping rules. Thus, the information is properly categorized in the ontology, allowing only the instantiation of the relationships dened in the ontology and controlling the integration process by means of checking the dened restrictions and properties of the domain.

A semantic query interface is provided in order to exploit the integrated repository. This interface provides two query methods. One method allows to query the repository by means of gene identication and to dene conditions in the query like the organism and repository resource to look into. The retrieved results contain the orthologous genes and its associated genetic diseases links.

(2)

Following these genetic diseases links more information about genetic diseases can be retrieved.

The other method consists in searching by genetic diseases names; the result contains the properties information dened in the ontology and the references to related genetic diseases, the PubMed articles with information about them and the genes references that are involved in the genetic diseases and provides links to its orthologous genes information.

2 Conclusions

As a result of this work the addition of genetic diseases knowledge domain in the OGO ontology was obtained. Since the ontology contains the formal concep- tualization of the orthology and genetic disease domain it is possible to share the knowledge model and reuse it in other systems. Due to properties and re- strictions, the domain ontology allows us to perform more complicated reasoning than other formal knowledge representation. This integration associates seman- tically the genes that are involved in a human genetic disorder with those genes that are orthologous to from other organisms, what can facilitate the labour of researchers.

Using a global ontology that cover the domain of the information resources facilitates the integration process and reduces information heterogeneity. The semantic integration is performed by dening mapping rules that take into ac- count the domain knowledge and facilitating the evaluation of the consistency of the integrated repository.

Acknowledgments. Jose Antonio Miñarro is supported by the Fundación Séneca through the fellowship 07836/BPS/07. Marisa Madrid is supported by the EMBO Long-Term Fellowship ALTF 212-2008. This work has been possible thanks to the Spanish Ministry for Science and Innovation through the project TSI2007-66575-C02-02.

References

1. OMIM: Online mendelian inheritance in man, mckusick-nathans institute of genetic medicine, johns hopkins university (baltimore, md) and national cen- ter for biotechnology information, national library of medicine (bethesda, md).

http://www.ncbi.nlm.nih.gov/omim/

2. Miñarro-Gimenez, J.A., Madrid, M., Fernández-Breis, J.T.: Ogo: an ontological ap- proach for integrating knowledge about orthology. BMC Bioinformatics 10:S10:S13 (2009)

Références

Documents relatifs

Knowledge of the genetic origin of Robusta cultivated varieties in countries as important as Vietnam and Mexico is therefore of high interest.. Through the use of the DArTseq method

- Given the similar characteristics of the climatic areas and relatively high altitude where Robusta is grown in Uganda, Mexico and Vietnam, and their common genetic origin, we

2 and 3, where blue represents cultivated individuals from Congo, Uganda, Vietnam and Mexico, known to belong to the Robusta Congo – Uganda group; Orange and yellow represent

[r]

Combiné en proportions convenables avec des plastifiants, des stabilisants et des pigments, ce matériau a été utilisé pendant un certain nombre d'années pour

BA: bile acid; BMI: body mass index; BMR: basal metabolic rate; CA: cholic acid; CDCA: chenodeoxycholic acid; DCA: deoxycholic acid; DIO-2: type 2 iodothyronine deiodinase;

des raisins secs– des yeux– une cerise– un nez– un grand sourire–. des grains de café– les boutons de son habit -

La primera Dirección es la de Coordinación Permanente de Contingencias (COPECO) y la segunda la de Coordinación Permanente de Gestión de Riesgos (COPEGER). La COPECO es la