• Aucun résultat trouvé

Text Mining in Biograph

N/A
N/A
Protected

Academic year: 2022

Partager "Text Mining in Biograph"

Copied!
1
0
0

Texte intégral

(1)

Text Mining in Biograph Walter Daelemans

CLiPS, University of Antwerp, Belgium [email protected]

The Biograph project is a biomedical knowledge discovery project combining graph data mining with structured biomedical information and with text mining on medline abstracts. It is a cooperation between the molecular genetics, data mining, and computational linguistics research groups of the University of Antwerp. In this talk, I will outline the general architecture of the system, which is currently applied to the problem of ranking genes according to relevance for diseases (gene prioritization). I will describe the text mining component (rule induction combined with rule certainty and confidence measurement using among others modality and negation analysis), and show results both on text mining benchmarks and gene

prioritization benchmarks. The approach is more generally applicable than for gene prioritization only; I will show how the approach can be interpreted as a general question answering engine.

Références

Documents relatifs

The first tool identifies numeric properties in the patent text and normalises them, the second extracts a list of keywords that are relevant and reveal the invention in the patent

The presented system executes the preprocessing of test documents, and generates hypotheses consisting of the question text combined with text from possible answers for the

Two runs use Term Frequency-Inverted Document Frequency (TF-IDF) of the query words to weight the sentences.. The two unique runs differ in the way that when multiple answers get

Several large-scale terminological resources are used by the OntoGene system in order to detect names of relevant domain entities in biomedical literature (proteins, genes,

We examined only the top five most relevant parts of the publication for each concept and had an autism ontology expert review these text sections to determine

Nev- ertheless, Big Data technologies and massive clustering techniques are now possible so that the release of a TREC style collection, that is still missing, will help the IR and

In response to the above, text mining (TM) aims to automate the above process, by finding relations (such as interactions) that hold between concepts of different types

Abstract— We present an ongoing effort on utilizing text mining methods and existing biological ontologies to help readers to access the information contained in the