Removing outliers from a matrix
4 vues (au cours des 30 derniers jours)
Afficher commentaires plus anciens
I removed outliers from my each column of my feature matrix, but now I have feature vectors with differing lengths. I cannot make them the same length because I will lose data from a certain range. If I shuffle, then the correlation between my features and my outcome becomes messy. How can I do that? My matrix of features and outcomes is a 55000*14 matrix. The last column is the outputs.
1 commentaire
Davide Masiello
le 11 Oct 2022
Maybe, instead of removing the outliers, you could replace them with an interpolated value.
Réponses (1)
Bjorn Gustavsson
le 11 Oct 2022
Modifié(e) : Bjorn Gustavsson
le 11 Oct 2022
QD-answer: The best you can do for single pairs of columns is to only use the rows where neither are outliers. With the cov-function you can get this handled with the options 'omitrows' or 'partialrows' if you replace each outlier with a nan. If you want to use the corrcoef-function it has a slightly different interface. It takes a parameter-value pair to set the options for 'rows', see the help and documentation for description.
HTH
0 commentaires
Voir également
Catégories
En savoir plus sur Descriptive Statistics dans Help Center et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!