Effacer les filtres
Effacer les filtres

Can we use k-means clustering on a 100×34596 matrix?

5 vues (au cours des 30 derniers jours)
bamini thavarajah
bamini thavarajah le 9 Nov 2017
Commenté : Image Analyst le 5 Déc 2017
  • I have a feature matrix with size 100×34596, with class labels.
  • This matrix is very large because I want to cluster these matrix.
  • Is it possible?
  • I don't have experience about k-means.
  • Could you help me?*

Réponses (1)

Walter Roberson
Walter Roberson le 9 Nov 2017
Just make sure to pass data to kmeans such that the rows correspond to observations (points).
The class table will not be of any use for kmeans clustering: kmeans is a method of trying to figure out which values belong together without knowledge of the class labels.
  1 commentaire
Image Analyst
Image Analyst le 5 Déc 2017
Hopefully you have 34,596 observations and 100 features (which still seems ridiculously high) and not 100 observations of 34,596 features (which I don't see as possible in most normal/typical scenarios).
Anyway, if you have some of them labeled already, then you can use knnsearch() to label/classify any other, non-labeled data.
kmeans will come up with its own labels, so if they're all labeled already, then all you can do is pass in the feature values, without the labels as Walter said, and see how what kmeans chose differs from whatever method you have that produced the labels in advance (like human judging or whatever). You can then create a confusion matrix or ROC curve.

Connectez-vous pour commenter.

Catégories

En savoir plus sur Statistics and Machine Learning Toolbox dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by