• Aucun résultat trouvé

Semantic Learning for Trend Recognition in Text Collections

N/A
N/A
Protected

Academic year: 2022

Partager "Semantic Learning for Trend Recognition in Text Collections"

Copied!
2
0
0

Texte intégral

(1)

Semantic-based Learning for Trend Mining in Text Collections

Olga Streibel

Networked Information Systems, Free University Berlin, K¨onigin-Luise-Str.24-26 , 14195 Berlin, Germany

[email protected] http://www.ag-nbi.de

Determination and early detection of emerging trends can be retrieved from numeric data as well as from texts [2]. However, the use of text collections in the process of trend detection requires new analysis techniques. Although many interesting approaches have been developed in the field of Trend Mining on texts, defined as Emerging Trend Detection in Text Mining[1], they are still lacking the integration of expert knowledge in the process of trend recognition. Such knowledge is crucial for the proper trend detection in various domains, e.g. in medical diagnosis, opinion mining, financial markets.

Our research addresses trend detection in text collections and we are concen- trating on the development of a novel, knowledge-based learning approach for the automatic detection of trends in text streams. At this stage of our work, we are concerning ourselves with two main research issues: representation of trend knowledge and knowledge-integrating learning approach for trend detection.

Knowledge about emerging trends is hard to define and relies on the knowl- edge of given domain including experts’ experience in detecting trends for this domain. Knowledge of given domain can be expressed in language, thus in formal definition of domain concepts and relations between these concepts. Semantic Web offers ontology as a technology for knowledge formalization. However, the classic Ontology approach has been created under assumption of hierarchical, static relations describing knowledge and therefor can be successfully applied in domains of taxonomic characteristic (i.e. life sciences). Knowledge about trends is more intuitive, dynamic, context- and time-dependent, subjective. We assume that formalization of trend knowledge requires for novel lightweight formaliza- tion techniques, likeextreme tagging[4]1, that allow for collective capturing non- hierarchical relations and enhanced semantics. For this reason, we are concentrat- ing on the utilization ofExtreme Tagging Systems approach in order to gather and formalize knowledge needed for trend detection.

Regarding chosen knowledge representation paradigm and different learning approaches from Artificial Intelligence[3], we are elaborating on the proper defi- nition of a knowledge-integrating learning approach for trend detection in texts.

Considering several statistical learning methods, we aim at applying the appro- priate method to the financial market texts in order to enable trend detection in this domain.

1 http://www.corporate-semantic-web.de/extreme-tagging.html

(2)

2 Olga Streibel

References

1. April Kontostathis, Leon Galitsky, William M. Pottenger, Soma Roy, and Daniel J.

Phelps. A Survey of Emerging Trend Detection in Textual Data Mining. Springer- Verlag, 2003.

2. Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, and James Allan. Mining of concurrent text and time series. In In proceedings of the 6 th ACM SIGKDD Int’l Conference on Knowledge Discovery and Data Mining Workshop on Text Mining, pages 37–44, 2000.

3. Stuart Russel Peter Norviq. Artificial Intelligence: A Modern Approach. Prentice Hall International, 2003.

4. Vlad Tanasescu and Olga Streibel. Extreme tagging: Emergent semantics through the tagging of tags. InESOE, pages 84–94, 2007.

Références

Documents relatifs

This paper deals with the consistency, a rate of convergence and the asymptotic distribution of a nonparametric estimator of the trend in the Skorokhod reflection problem defined by

The Biograph project is a biomedical knowledge discovery project combining graph data mining with structured biomedical information and with text mining on medline abstracts.. It is

The main goal of the meta ontology is to offer all necessary concepts and relations in order to span the trend template as a structure over a text corpus.. To actually translate

Interesting approaches have been developed in the field of trend mining on texts (s. following Section) but they are still lacking the integration of expert knowl- edge in the

Figure 1 shows the performance evolution of the proposed methods based on n-grams to detect the deception in Arabic.. This performance evolution is calculated for Twitter (right

The ViSTA tool 3 can be used to upload an eye tracking dataset, visualise indi- vidual scanpaths with gaze plots, visually draw AOIs on stimuli, apply the STA algorithm to discover

Also the task of Author profiling deals with the aspects of forensics where the gender and specific regions of countries are important, and this task of