• Aucun résultat trouvé

Importing Knowledge Fragments to CMS-Enabled Data Mining Analytical Reports

N/A
N/A
Protected

Academic year: 2022

Partager "Importing Knowledge Fragments to CMS-Enabled Data Mining Analytical Reports"

Copied!
1
0
0

Texte intégral

(1)

Importing Knowledge Fragments to

CMS-Enabled Data Mining Analytical Reports

Andrej Hazucha, Tom´aˇs Kliegr, and Vojtˇech Sv´atek

University of Economics, Prague, {xhaza00,tomas.kliegr,svatek}@vse.cz

Descriptive data mining only brings its fruits when the results are provided to the end user in a palatable form. The vehicle for end-user delivery of mining results (and associated information such as data schema, task settings, and do- main background knowledge) are so-calledanalytical reports. In order to manage a huge number of reports referring to different mining sessions, we designed a data mining web portal based on acontent management system, together called SEWEBAR-CMS.1 One of the requirements on the CMS was the ability to in- teract withsemantic knowledge sources and other structured data, see [1].

The data analyst who authors an analytical report in the CMS has different possibilities of (semi-)automatically entering structured data into the text.

First, for locally stored data such as mining task/result/data descriptions exported from mining tools in PMML (Predictive Model Mark-Up Language), a CMS plugin can pick marked segments of HTML code, produced from PMML using XSLT, and insert them into the report as indicated by the analyst.

Second, sophisticated support for remote data/knowledge has been newly added. The infrastructure for this functionality allows to persistently specify

– Links to queriableresources

– Template queries for these resources (which can be paramatrized by the end-user at runtime)

– XSLT transformations allowing to insert the results of queries as HTML fragments, either static ordynamically updated from the resources.

Currently we experiment with queriable resources in the form of native XML database(Berkeley, queried via XQuery), which stores PMML data, and seman- tic knowledge bases both in the form ofSPARQL endpoint andOntopia Knowl- edge Suite (a Topic Maps tool, queried via a Prolog-like language called tolog).

Inclusion of further types of resources such as Lucene indices is in progress.

This work has been partially supported by the CSF project no.201/08/0802, and by Grant F4/15/2010 of the University of Economics, Prague.

References

1. Kliegr M., Ralbovsk´y M., Sv´atek, V, ˇSim˚unek M., Jirkovsk´y V., Nemrava J., Zem´anek J.: Semantic Analytical Reports: A Framework for Post-Processing Data Mining Results. In: Proc. ISMIS’09, Springer Verlag, LNCS, 2009, 8898.

1 SEWEBAR stands for SEmantic WEB and Analytical Reports. More details in http://sewebar.vse.cz.

Références

Documents relatifs

The original diff coefficient is combined with the original first step algorithm to re- produce the original procedure, and with the Selection Reintegration.. Finally the corrected

Analytical report is a free-text document describing various elements of an as- sociation rule mining task: the data, pre-processing steps, task setting and par- ticularly

■ Chez un vendeur d’outil d’analyse de données, il peut exporter des modèles destinés à être intégrer dans des environnements de scoring. ■ Dans une

 Écrire un rapport décrivant la méthode, les résultats obtenus et comparer ces résultats à ceux du modèle standard KXEN obtenu en 1/.. ■

« Le Data Mining est une discipline née en dehors de la statistique, dans la communauté des bases de données et de l’IA dans le but de valoriser les bases de données. Le Data

This book is about the tools and techniques of machine learning used in practical data mining for finding, and describing, structural patterns in data.. As with any burgeoning

Research on this problem in the late 1970s found that these diagnostic rules could be generated by a machine learning algorithm, along with rules for every other disease category,

The main body of the book is as follows: Part 1, Data Mining and Knowledge Discovery Process (two Chapters), Part 2, Data Understanding (three Chapters), Part 3, Data