• Aucun résultat trouvé

Rough Sets and Formal Concept Analysis: Foundations and the Case Studies of Feature Subset Selection and Knowledge Structure Formation

N/A
N/A
Protected

Academic year: 2022

Partager "Rough Sets and Formal Concept Analysis: Foundations and the Case Studies of Feature Subset Selection and Knowledge Structure Formation"

Copied!
1
0
0

Texte intégral

(1)

Rough Sets and Formal Concept Analysis:

Foundations and the Case Studies of Feature Subset Selection and Knowledge Structure

Formation

Dominik ´Sl¸ezak Infobright Inc., Canada/Poland

Abstract.The theories of Rough Sets (RS) and Formal Concept Analysis (FCA) are well-established from the point of view of both mathematical foundations and real-life applications. The interest in searching for similarities and dissimilarities between RS and FCA has been constantly growing, both with respect to pure theory, as well as with an objective of developing hybrid techniques, better adjusted to practical problems.

In this talk, we outline introductory notions of RS and we draw basic lines of its comparison with FCA. As the first case study, we consider the KDD-related problem of feature subset selection and show how to model some new approaches to approximate selection (a more flexible and more practically applicable extension of the classical RS-based feature subset selection principles) in the FCA terminology. As the second case study, we consider the latest Infobright’s open source data warehouse platform (www.infobright.org) and we discuss possibilities of improving its performance by using new RS-FCA-based knowledge structures automatically calculated from data.

Références

Documents relatifs

We consider also the paving problem and give a deterministic algorithm to partition a matrix into almost isometric blocks recovering previous results of Bourgain-Tzafriri and

We found that the two variable selection methods had comparable classification accuracy, but that the model selection approach had substantially better accuracy in selecting

The problem of well conditioned column selection that we consider consists of finding the largest subset of columns of X such that the corresponding submatrix has all singular values

Due to the deficiency of the original fuzzy-rough dependency degree in removing redundant features, we proposed a new measure for feature selection based on the

Nevertheless, a number of generic feature construction methods exist: clustering; linear (affine) projections of the original feature space; as well as more sophisti- cated

In more detail, for low values of α, (from 0.005 to 0.11), all the feature sets resulting from the new proposed method lead to better LR performance in com- parison to the feature

knowledge, the algorithm providing the best approximation is the one proposed by Grabisch in [20]. It assumes that in the absence of any information, the most reasonable way of

Construct validity indicates whether the studied operational measures really represent what is investigated according to the research question. The purpose of this study is to