Documents relatifs
OCR error rates show that also that blurring and character degradation are the most critical noise for digitized documents; they generated the highest error rate at both character
In this article, we focus on old French texts to evaluate the impact of manual and automatic normalisation before applying five geographical named entity recognition tools, as well
We now describe MV polytopes for the only other rank two affine root system. One could perhaps rework the proof as in the sl b 2 case and directly prove that the MV polytopes
The agglomerative clustering method (using Doc2Vec model for feature extraction) and a simple named entity linking algorithm based on regular expressions were used
Named entity recognition applied on a data base of Medieval Latin charters.. The case
In Section 5, we give a specific implementation of a combined MRG of this form and we compare its speed with other combined MRGs proposed in L’Ecuyer (1996) and L’Ecuyer (1999a),
This paper shows how text-based ontology acquisition meth- ods can be enriched by taking all types of domain specific textual units into account, named entities as well as terms,
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des