Récemment recherché

Aucun résultat trouvé

Étiquettes

Aucun résultat trouvé

Document

Aucun résultat trouvé

Accueil Écoles Thèmes

Connexion

Key Phrase to Text Similarity, Clustering, and Interpretation in Hierarchical Ontologies

Partager "Key Phrase to Text Similarity, Clustering, and Interpretation in Hierarchical Ontologies"

N/A

N/A

Protected

Année scolaire: 2022

Info

Protected

Academic year: 2022

Partager "Key Phrase to Text Similarity, Clustering, and Interpretation in Hierarchical Ontologies"

Copied!

1

0

0

1

0

0

Chargement.... (Voir le texte intégral maintenant)

Télécharger maintenant ( 1 Page )

Texte intégral

(1)

Key Phrase to Text Similarity, Clustering, and Interpretation in Hierarchical Ontologies

Boris Mirkin

Applied Mathematics and Informatics, National Research University Higher School of Economics Moscow

Computer Science and Information Systems, Birkbeck University of London BMirkin@hse.ru

Abstract. Scoring similarity between key phrases and unstructured texts is an issue which is important in both information retrieval and text analysis. Researchers from the two fields use different scoring functions, although clear delineation between the two still is lacking. We use suffix tree based score expressing the average conditional probability of a sym- bol in a common substring. Usually, a domain taxonomy serves as the source of key-phrases. Given a set of entities, such as texts or projects or working groups, one can derive clusters of key-phrases using key-phrase- to-entity scores. The clusters represent common themes in the meaning of texts or in activities of working groups. To interpret them, the domain ontology should be used. If the ontology is a rooted tree, a lifting method is proposed to find the most parsimonious interpreting head subject(s), up to a few gaps and offshoots. Some applications and application is- sues are considered. The work is being conducted jointly with T. Fenner (London), S. Nascimento (Lisbon) and E. Chernyak (Moscow).

Références

Télécharger maintenant ( PDF - 1 Page - 62.81 KB )

Documents relatifs

On Similarity Prediction and Pairwise Clustering

Assuming randomly drawn training pairs, we prove tight upper and lower bounds on the achievable misclassification error on unseen pairs, the bridge between the two settings

Similarity Based Hierarchical Clustering with an Application to Text Collections

It allows a better scalability compared to the conventional AHC algorithm both in memory and processing time, and its results are deterministic unlike some aforementioned

Deciphering Text from Touchscreen Key Taps

In this paper, we present an attack method- ology that can be used to extract the text typed by a user from the sound recorded by the built-in microphones of a mobile phone.. We

CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and MapReduce

In this paper a method for pairwise text similarity on massive data-sets, using the Cosine Similarity metric and the tf-idf (Term Frequency- Inverse

End-to-End Similarity Learning and Hierarchical Clustering for unfixed size datasets

Scores obtained demonstrate that models trained at the harder levels of difficulty are more ro- bust to noise and achieve better results.. As before, also in this case MLP is,

Extending Ontologies in the Nanotechnology Domain using Topic Models and Formal Topical Concept Analysis on Unstructured Text

In this paper, we use a phrase-based topic model approach and formal topical concept analysis on unstructured text in this domain to suggest additional concepts and axioms for

Semantic similarity: a key to ontology alignment

Semantic similarity is used to include similarity measures that use an ontology’s structure or external knowledge sources to determine similarity between

A Text Similarity Approach for Precedence Retrieval from Legal Documents

Lexical features are extracted from all the legal documents and the similarity between each current case document and all the prior case documents are determined using

Téléchargez tous les documents en téléchargeant vos documents d'étude.

Votre document sera enrichi, partagé sur 123dok FR pour vous aider à étudier.

Documents relatifs

Toward Scalable Hierarchical Clustering and Co-clustering Methods : application to the Cluster Hypothesis in Information Retrieval

Toward Scalable Hierarchical Clustering and Co-clustering Methods : application to the Cluster Hypothesis in Information Retrieval

175

0

0

Populating Legal Ontologies using Information Extraction based on Semantic Role Labeling and Text Similarity

Populating Legal Ontologies using Information Extraction based on Semantic Role Labeling and Text Similarity

229

0

0

Combining Statistical Information and Semantic Similarity for Short Text Feature Extension

Combining Statistical Information and Semantic Similarity for Short Text Feature Extension

7

0

0

Les systèmes alimentaires alternatifs face aux enjeux sanitaires

Les systèmes alimentaires alternatifs face aux enjeux sanitaires

45

0

0

الحجاج في التفسير القرآني، دراسة في مجالس التذكير من كـلام الحكيـم الخبيـر لـ: عبد الحميد ابن باديس

الحجاج في التفسير القرآني، دراسة في مجالس التذكير من كـلام الحكيـم الخبيـر لـ: عبد الحميد ابن باديس

14

0

0

Intégration financière internationale et croissance économique dans les pays émergents et en développement : le canal du développement financier

Intégration financière internationale et croissance économique dans les pays émergents et en développement : le canal du développement financier

36

0

0

en
fr

19

0

0

Un nouveau statut pour le régulateur des télécommunications dans un secteur en pleine mutation

Un nouveau statut pour le régulateur des télécommunications dans un secteur en pleine mutation

15

0

0