groupby of one column

Question

0 votes

I have a file called msg.txt which has three columns:

message_no instance_no delay
1  1228
1  1240
1  3304
1  5320
1  7324
2  1232  
2  3308
3  1236
3  3300
2  328
2  1080
4  1228
4  3304
2  5320
5  1232
5  3308
6  1236
6  3300
3  1076
3  3328
7  1228
7  3304
3  5320
8  1232
8  3308
4  328
9  1236
4  1072
9  3300
10  1228
10  3304
4  5320
11  1232
11  3308
5  1324
12  1236
5  1248
12  3300
13  1228
13  3304

Now i want group all message by column1 i.e by there message_no and plot the graph between instance_no and delay of the corresponding message_no. Suppose if we consider message_no=2, then it has 13 instances and delays, and i have plot the graph between instance_no and delays of message_no=2.

Thank You, Venkata

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Answer 1

neuromechanist le 14 Juil 2020

Modifié(e) : neuromechanist le 25 Fév 2021

5 votes

This is about eight years late. But for anybody who may stumble upon this question, look in to findgroups function. It creates groups based on the content of table column (or any variable). Then you can use the results with splitapply to apply any aggreagate fucntion such as mean, max, sum, etc.

2 commentaires
Afficher Aucune Masquer Aucune

frankovaT le 3 Fév 2021

I needed it thanks

neuromechanist le 3 Fév 2021

You are welcome. I appreciate it if you can upvote it, so others will find the answer easier.

Connectez-vous pour commenter.

Answer 2

Geoff le 8 Mai 2012

Ouvrir dans MATLAB Online

1 vote

Selection is different from grouping. If you just want instance_no and delays where message_no=2, you can do this (assuming you have already read the file into a matrix called data):

xy = data(data(:,1)==2, [2 3]);

But to get all the groups....

I don't know any fancy grouping functions off-hand, but if you don't have loads of data, why not do the same thing. It's not particularly efficient but it's easy:

messages = unique(data(:,1));
xys = arrayfun( @(m) data(data(:,1)==m, [2 3]), messages, 'UniformOutput', 0 );

If you want to be a bit more efficient (maybe you have lots of data), you could sort by message number first and then partition the data set:

sdata = sortrows(data);
endidx = find(diff(sdata(:,1)) ~= 0);
r = [1 endidx'; endidx' size(sdata,1)];
idx = arrayfun( @(b) r(1,b):r(2,b), 1:size(r,2), 'UniformOutput', 0 );
xys = cellfun( @(ii) data(ii,[2 3]), idx, 'UniformOutput', 0 );

The above finds the indices of the last number in each set (by detecting when the value changes). It then builds the 'range' matrix r whos first row is the start-index and second row is the end-index of each set. Then an cell-array of index ranges for each set, idx, is created and that is used to pull the required two columns out of the data.

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

groupby of one column

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponses (2)

2 commentaires
Afficher Aucune Masquer Aucune

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Catégories

Tags

Community Treasure Hunt

groupby of one column

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponses (2)

2 commentaires Afficher Aucune Masquer Aucune

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Catégories

Tags

Voir également

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

2 commentaires
Afficher Aucune Masquer Aucune

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens