Clustering transformed compositional data using K-means, with applications in gene expression and bicycle sharing system data
Texte intégral
Documents relatifs
The Spe3dLab software has been jointly developed by three French teams: a bioinformatics laboratory (LaTIM, Inserm U1101), a private partner specialized in health software (Tekliko)
L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des
Two samples containing precisely the same cellular types (e.g. coming from the same tissue in the same patient) can have two different profiles of gene expression simply because
In this paper, we show a direct equivalence between the embedding computed in spectral clustering and the mapping computed with kernel PCA, and how both are special cases of a
It has been shown that iterated watersheds performs better than methods such as spectral clustering, isoperimetric partitioning, and K-Means on various measures.. To illustrate
Finally, for the mixed annotation u C , ICAL most frequently selects three clusters (79 times) or four clusters (21 times). Because the annotation u C is mixed, there is less
The weighted-kappa score for consensus clustering (solid line) calculated by comparing consensus clusters to the corresponding individual clustering algorithm is shown relative
In this section, we provide k-means accounts for judgmental contrast observed on multiple distribution shapes, and for a variety of interaction effects pertaining to the number of