• Aucun résultat trouvé

Translating MeSH Concepts.

N/A
N/A
Protected

Academic year: 2021

Partager "Translating MeSH Concepts."

Copied!
2
0
0

Texte intégral

(1)

HAL Id: inserm-00854292

https://www.hal.inserm.fr/inserm-00854292

Submitted on 27 Aug 2013

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Translating MeSH Concepts.

Lina Soualmia, Catherine Letord, Tayeb Merabti, Nicolas Griffon, Jacques Manel, Stéfan Darmoni

To cite this version:

Lina Soualmia, Catherine Letord, Tayeb Merabti, Nicolas Griffon, Jacques Manel, et al.. Translating MeSH Concepts.. Studies in Health Technology and Informatics, IOS Press, 2013, 192, pp.1007.

�inserm-00854292�

(2)

Translating MeSH Concepts

Lina F. Soualmia

a,b

, Catherine Letord

a

, Tayeb Merabti

a

, Nicolas Griffon

a

, Jacques Manel

c

, Stéfan J. Darmoni

a,b

a

LITIS-TIBS EA 4108 & CISMeF, Rouen University Hospital, Rouen, France.

b

INSERM, UMR_S 872, ICS, équipe 20, Paris, France.

c

Poisons and Toxicovigilance Centre, Nancy University Hospital, Nancy, France.

Abstract and Objective

The concept-oriented structure of the MeSH® thesaurus is not yet in common use. Nevertheless, it has been shown that a concept-based querying of PubMed may be of interest. To take full advantage of the concept-oriented structure of MeSH in the information retrieval tool associated with the CISMeF catalogue, it was necessary to translate such concepts into French.

Keywords:

MeSH Headings; Translations; Controlled Vocabularies;

Introduction

A recent study [1] has demonstrated the interest of using MeSH Concepts for improving information retrieval. To index documents with MeSH Concepts instead of MeSH Descriptors or MeSH Supplementary Concepts, MeSH Concepts need to be translated into French to be used in the CISMeF catalogue.

Material and Methods

In addition to "broader", "narrower" and "related" relations, certain MeSH Concepts may have other attributes: a Registry Number (RN) or a Related Registry Number (RRN) which may correspond to the Chemical Abstracts Service (CAS) Registry Numbers. Each MeSH Concept may have a RN. In this case, the RN is the CAS Number of the Concept itself.

Only MeSH Concepts that are the preferred concept for a MeSH Descriptor or a MeSH Supplementary Concept may have a RRN. In this case, the RRN is the CAS Number of a subordinate Concept. Many MeSH Concepts (n=81,255;

25.1%) have a CAS Number. The CISMeF team has also inte- grated into the HeTOP [2] a French terminology on main toxic substances for humans (BNPC French acronym). It contains 91,750 terms in French.

Mapping MeSH Concepts and BNPC terms

The most straightforward method consisted in a simple join between MeSH Concept and BNPC term using CAS Number.

It was then possible to automatically translate this MeSH Con- cept into French and the corresponding BNPC term into Eng- lish. When RRN is involved, a "see also" relation between two terms from two different terminologies: the MeSH Descriptor or the MeSH Supplementary Concept, which has a RRN (CAS Number) and the BNPC term with the same CAS Number.

Simple Natural Language Processing Method

The normalization program (“Norm”) which is provided with the Lexical Tools of the UMLS was used in this second meth- od. The MeSH Concept Labels in English were normalized and compared to all the other normalized terms inthe HeTOP portal. When an exact match was found, the translation of the French target term was proposed as one possible translation of the MeSH Concept Label.

Results

Thanks to the mappings based on the CAS Number, 13,440 MeSH Concepts were translated into French. These MeSH Concepts might be:

- preferred concept for MeSH Descriptors for 2,046 of them (7.7%),

- preferred concept for MeSH Supplementary Concept for 9,174 of them (4.5%),

- non-preferred or subordinate concept for MeSH Supple- mentary Concept or MeSH Descriptor for 2,220 of them (2.4%) and the creation of 2,220 "see also" relationships between a BNPC term and a MeSH Descriptor or MeSH Supplementary Concept.

Overall, 13,440 BNPC terms were also translated into English.

The NLP method allowed the translation of 8,701 MeSH Con- cepts: 3,460 preferred concepts for MeSH Descriptors, 3,455 preferred concepts for MeSH Supplementary Concept and 1,786 non-preferred concepts.

As 52,996 MeSH Concepts had already been manually trans- lated before this study there are now 65,621 MeSH Concepts out of 324,191 (20.2%), which have been translated into French. Moreover, 9,516 new synonyms have been added.

References

[1] Darmoni SJ, Soualmia LF, Letord C, et al. Improving in- formation retrieval using MeSH Concepts: a test case on rare and chronic diseases. Journal of Medical Library As- sociation 2012; 100(3): 176–183.

[2] Merabti T, Soualmia LF, Grosjean J, Joubert M, Darmoni SJ : Aligning biomedical terminologies in French: towards semantic interoperability in medical applications. Chapter in Book Medical Informatics, InTech, 2012, 41–68.

MEDINFO 2013 C.U. Lehmann et al. (Eds.)

© 2013 IMIA and IOS Press.

This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License.

doi:10.3233/978-1-61499-289-9-1007

1007

Références

Documents relatifs

In general, during each iteration of the Parks-McClellan algorithm, we have estimates about the location and number of extrema of E; if we take a subinterval defined by two

We describe a method to compute conformal parameteriza- tions with a natural boundary based on a simple differentiable expression measuring angles between edges by using

Neumann or Robin boundary conditions â First results with a mixed formulation Dynamic equation : heat equation. Michel Duprez Rule of the mesh in the Finite Element

The proposed method is compared to Cho’s [11] and Zhan’s [8] methods in terms of imperceptibility using MRMS as well as robustness in terms of correlation against noise

Figure 6 shows the original objects and the watermarked ones. It can be seen that the distortion is negligible and cannot be noticed by the human eye thanks to the use mesh saliency

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come

This section details the two main steps of the surface remeshing method: the optimization of a given number of sampling points on the model (Section 3.2.3.1) and the building of

Some sarcomas following breast cancer may be due to shared genetic components (i.e., in the Li–Fraumeni syn- drome), as well as possibly to shared environmental factors, with