• Aucun résultat trouvé

Towards Enabling the Semantic Access of Phenotypic Information in Clinical Letters

N/A
N/A
Protected

Academic year: 2022

Partager "Towards Enabling the Semantic Access of Phenotypic Information in Clinical Letters"

Copied!
2
0
0

Texte intégral

(1)

Towards Enabling the Semantic Access of Phenotypic Information in Clinical Letters

Shahad Kudama1, Rafael Berlanga1, Henry Houlden2Ernesto Jiménez-Ruiz3, Hallgeir Jonvik2, Adam Milward4, Huw Morris2, Mina Ryten4, Jake Saklatvala5,

Michael A. Simpson5, and Nicholas Wood2

1Universitat Jaume I, Spain 2University College London, UK 3University of Oxford, UK

4Genomics England, UK 5King’s College London, UK

1 Motivation

Over the past 3 years there has been a massive growth in genetic testing both in terms of the scope of testing and the numbers of individuals offered genetic testing. Targeted sequencing of small genomic regions has been replaced by panel testing, whole exome sequencing and most recently whole genome sequencing [4]. Furthermore, genetic test- ing in research, but also clinical settings has extended beyond small numbers of selected individuals with very rare, highly-defined disorders to cover larger populations.

While this information presents new clinical opportunities and opens the way for development of novel therapies, it also presents major challenges. For clinicians, re- liable identification of disease-associated genetic variants from amongst the broader background of variants present in all human genomes that are rare, but not actually pathogenic, is a concern. It is likely that for many rare genetic disorders obtaining clar- ity will require a worldwide effort and so the ability to capture and share key clinical information, as well as genetic information, is becoming increasingly important.

However, at present there are major challenges with regard to the collection and storage of clinical information, particularly in the context of rare genetic disorders. The process of studying a patient with a possible rare genetic disorder typically involves many different clinical specialists with no “standard” patient route. During this process, it is very common to refer from one specialist to another in order to obtain a range of opinions and access different tests. The output of this process is usually clinical let- ters which are used both to document the patient’s progress and communicate findings between specialists (e.g. patient history, examination findings, investigation results and clinical impression).

Since clinical letters are a key source of knowledge, their proper annotation and storage would enable access to the information they contain in a systematic way.

Currently, the creation and processing of clinical letters requires the following steps:

1. Letters are dictated using a speech recognition system by the clinician.

2. The recording is uploaded to a server.

3. The voice data is transcribed using another application.

4. The text is downloaded and checked by qualified personnel.

5. The letter is tagged manually by a specialist responsible for reading and annotating the interesting terms.

This final part of the process requires input from a qualified medical professional and is often time-consuming. This has lead to the relatively limited use of clinical letters and consequently the loss of potentially important information. Thus, suitable software support is required to assist the clinician in the process of annotating and processing clinic letters to make this a part of "normal" working practice.

(2)

Fig. 1.PHENOTAGmain interface

2 Proposed solution

In this paper we introduce PHENOTAG, a prototype software aimed at allowing the an- notation and storage of phenotypic information from clinical letters with the aim of im- proving the accuracy of genetic testing. PHENOTAGhas the added innovation of allow- ing specialists to visualise the information content of their letters and potentially check the quality of annotation within their normal workflow. Figure 1 shows an overview of PHENOTAG’s interface. The annotator underlying PHENOTAGis currently based on the approaches presented in [2, 5] and it can be used in conjunction with a series of coordinated vocabularies (e.g., [3]) making special emphasis in HPO concepts [1].

We envisage the following uses for this software:

1. The annotation and storage of HPO terms contained within clinic letters for the purposes of populating predefined disease-specific data models for a wide range of diseases (of the kind being developed by the 100,000 Genomes project).

2. The annotation and storage of HPO/other pre-defined terms contained within clinic letters for patients consented for research into the genetics of a single disorder (e.g.

Parkinson’s Disease).

3. The annotation and storage of HPO/other predefined terms contained within clinic letters in order to assess the information content and quality of clinical documenta- tion within a clinical genetics service.

References

1. The Human Phenotype Ontology project: linking molecular biology and disease through phe- notype data. Nucleic Acids Research 42(D1), D966–D974 (Jan 2014)

2. Berlanga, R., Nebot, V., Jiménez, E.: Semantic annotation of biomedical texts through concept retrieval. Procesamiento del Lenguaje Natural 45, 247–250 (2010)

3. Jiménez-Ruiz, E., Grau, B.C., Llavori, R.B., Rebholz-Schuhmann, D.: First steps in the logic- based assessment of post-composed phenotypic descriptions. In: SWAT4LS (2010)

4. Katsanis, S.H., Katsanis, N.: Molecular genetic testing and the future of clinical genomics.

Nat Rev Genet 14(6), 415–426 (Jun 2013)

5. Nebot, V., Berlanga, R.: Exploiting semantic annotations for open information extraction: an experience in the biomedical domain. Knowl. Inf. Syst. 38(2), 365–389 (2014)

Références

Documents relatifs

The Commission noted that, while its proposal to prohibit abusive representation had been prompted by a number of complaints about abusive representations of homosexuals, it had

We began introducing background theories and assumptions, particularly: (1) three fundamental postulates on which our vision of knowledge in the organization relies ([GRU

La classification des animaux mythiques proposée par Siganos et appliquée au bestiaire de Les Fleurs du Mal, montre bien que la grande majorité des images animales employées

repères spectraux ayant pour but de nous diriger autre part, ne sont que l’intimation du fait que Proust pense qu’il a une vérité à lui à exprimer sur la

In this article, we are presenting a semantic annotation tool for Arabic texts with a strategy adapted to the automatic location of reported information.. The method used

A partir de simulations de la zone convective solaire (dans laquelle les contours d’isorotation ne sont pas alignés avec l’axe de rotation mais de direction radiale) (e.g. Miesch et

In order to perform a semantic matching of pieces of text, LSA relies on large corpora of texts to build a semantic high-dimensional space containing all words and texts, by means of

Модуль, що відповідає за реалізацію основної логіки програми, складається з декількох класів (рис. 5), основні з яких наступні: Document – клас для збереження