Is there a better way to compute metrics on labeled array elements.

4 vues (au cours des 30 derniers jours)
Burke Rosen
Burke Rosen le 17 Juin 2018
Modifié(e) : Burke Rosen le 18 Juin 2018
For example, I have a 1d double array 'data' and a 1d cell array of strings called 'labels'. For each unique label I want the mean of the data. The best I have come up with is below. I don't believe this is fully vectorized. Is there a better way?
%%make sample dataset
n = 1000;
data = rand(n,1);
labels = char(randsample(97:122,n,true)');%[a-z]
%%get means for each label
[uniLab,~,labIdx] = unique(labels,'stable');% stable for speed
mu = arrayfun(@(x) mean(data(labIdx==x)),1:numel(uniLab));

Réponse acceptée

Walter Roberson
Walter Roberson le 17 Juin 2018
  2 commentaires
Walter Roberson
Walter Roberson le 17 Juin 2018
The last step of your code can be replaced by
accumarray(labIdx, data, [], @mean)
Burke Rosen
Burke Rosen le 18 Juin 2018
Modifié(e) : Burke Rosen le 18 Juin 2018
This yields a ~25% speed increase at n = 1e3 and ~5% at n = 1e5. (500 trials per algorithm, randomized order). Thank you.

Connectez-vous pour commenter.

Plus de réponses (1)

Burke Rosen
Burke Rosen le 17 Juin 2018
Thank you for that tip @Walter.
After further review:
1. The way I wrote the sample data set, labels is actually a character array not a cell array, one has to cellstr it to yield that.
2. mu = grpstats(data,labels,'mean') is compact, easy to read, and maybe 1 or 2 percent faster that my formulation, if one adds the cellstr.
3. My solutions is 5x faster than grpstats if labels is a character rather than a cell array.
4. My guess is that unique operates much faster on character arrays than cell arrays and the runtime of the loop (or arrayfun) over the unique labels is negligible compared the unique itself.

Catégories

En savoir plus sur Cell Arrays dans Help Center et File Exchange

Produits


Version

R2017a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by