• Aucun résultat trouvé

Approximate Clusters, Biclusters and n-Clusters in the Analysis of Binary and General Data Matrices

N/A
N/A
Protected

Academic year: 2022

Partager "Approximate Clusters, Biclusters and n-Clusters in the Analysis of Binary and General Data Matrices"

Copied!
1
0
0

Texte intégral

(1)

Approximate Clusters, Biclusters and n-Clusters in the Analysis of Binary and General Data

Matrices

Boris G. Mirkin

National Research University Higher School of Economics, Moscow, Russia

Abstract. Approximate cluster structures are those of formal concepts andn-concepts with added numerical intensity weights. The talk presents theoretical results and computational methods for approximate cluster- ing andn-clustering as extensions of the algebraic-geometrical properties of numerical matrices (SVD and the like) to the situations where one or most of elements of the solutions to be found are expressed by binary vectors. The theory embraces such methods as k-means, consensus clus- tering, network clustering, biclusters and triclusters and provides natural data analysis criteria, effective algorithms and interpretation tools.

Keywords: Approximate clusters, biclusters, n-clusters, Formal Con- cept Analysis

References

1. Mirkin, B.G., Rostovtsev, P.S.: Method for revealing associated feature subsets,. In Mirkin, B., ed.: Models for Summarization of SocioEconomic Data (Metody Agre- girovania Sotsial’no-Economitcheskoi Informatsii), Novosibirsk: Institute of Eco- nomics Press (1978) 107–112 (in Russian).

2. Ignatov, D.I., Gnatyshak, D.V., Kuznetsov, S.O., Mirkin, B.G.: Triadic formal concept analysis and triclustering: searching for optimal patterns. Machine Learning 101(1-3) (2015) 271–302

Références

Documents relatifs

Our aims are: (1) to fit the BCGs with several models and see if some of their properties vary with redshift, (2) to compare the posi- tion angles of their major axes with the

We propose two quality criteria to assess whether the intervals estimated by interpretation models fit the data provided by the participants of the empirical study: endpoint

Sev- eral theoretical approaches can be employed to investi- gate cluster emission: among them the preformed cluster model (PCM) [3, 5, 6], in which the cluster is assumed to

In the first subsection, we study the effect of deleting each of the 20 observations on the two largest eigenvalues (which account for more than 99% of the total variation in

When the winner s is very far from the data (top line), any neuron benefits from a large learning rate and learns the new data (modulated by their own distance to the data but

L’obligation de moyen pesant sur l’assuré social ne doit pas faire oublier que, de leur côté, les institutions de sécurité sociale sont soumises à l’obligation

The geographical scope is perhaps less important for this distinction (the Netherlands is a small country, and could be seen as a region), than the difference in roles of

Among them, I distinguished scientific contributions related to three themes: Mathematical models in medical image analysis, Numerical methods for stochastic modeling and