Effacer les filtres
Effacer les filtres

How to extract the variable IDs after dimensionality reduction via pca?

2 vues (au cours des 30 derniers jours)
Bank Yin
Bank Yin le 8 Juil 2014
Commenté : Bank Yin le 8 Juil 2014
I have a dataset X(voxel*protein), it's size is 3694200*10. I used pca to reduce dimensionality and got a matrix of size 3694200*4. The question is that I need to know which 4 proteins are remained. How can I get the original column IDs(0-9) of the 4 proteins?
My code: [COEEF,SCORE,LATENT] = pca(X); cumVar = cumsum(LATENT)./sum(LATENT); reducedData = SCORE(:,1:4);

Réponse acceptée

the cyclist
the cyclist le 8 Juil 2014
PCA doesn't select a subset of variable. (Specifically, in your case, PCA does not select 4 out of your 10 variables).
Instead, PCA is identifying linear combinations of your original variables that explain the overall variation. If a small number of these linear combinations capture most of the variation, then it makes sense to limit to those combinations. The first output variable tells you the coefficients of those linear combinations.

Plus de réponses (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by