Removing row from a matrix if value in row < previous value

Question

Alexander Seal le 16 Nov 2016

1
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/312532-removing-row-from-a-matrix-if-value-in-row-previous-value

Modifié(e) : Alexander Seal le 16 Nov 2016

I have data that when sorted based on column 1, column 6 should be in ascending order, so n should >n-1. However the program that outputs this data creates incorrect values for column 6 that are much lower than they should be for a few data points, then they return to normal. I want to remove these values (or make n = n-1 if n<n-1)

At the moment I'm doing this with an if command in excel after sorting the file, but with 100's of files this is incredibly tedious.

I've tried the below, choosing a threshold value for the "incorrect" data to be below, however the threshold changes between files so this is no use.

datas = sortrows(data,-1) %sorts data by descending distance data
rowremove = datas(:,6)<=athres %removes row if area data is <= threshold - corrects for "bad" area data
datas(rowremove,:) = []

Is this possible in matlab?

2 commentaires
Afficher AucuneMasquer Aucune

Jan le 16 Nov 2016

Modifié(e) : Jan le 16 Nov 2016

Is what possible?

How do you identify the bad data securely? If a threshold is not sufficient, n>n-1 might be. But what happens if several bad values are neighboring? Is the n>n-1 property guaranteed even then?

Alexander Seal le 16 Nov 2016

Yes that is an issue unfortunately, there is some neighboring bad data. The way the excel if function works is to append the "good" data to a new column, and check the original data's column against the new columns preceding data.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Answer 1

dpb le 16 Nov 2016

1
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/312532-removing-row-from-a-matrix-if-value-in-row-previous-value#answer_243527

Ouvrir dans MATLAB Online

Jan's point is valid; if the bad data are excessively corrupted removing the offending rows may only create a new set of offending values. If, otoh, the overall slope is large enough and the erroneous values not too far of, then perhaps

   ix=[true; diff(x(:,6))>0];
   x=x(ix,:);

may work. If the above causes the issue that you then have a new set of offenders, then you'll likely have to use the above to

locate the first of each offending section
search from that point to the next
replace/remove those sections before processing next

A sample dataset (relatively short) but showing typical result would be helpful, probably. (Only need the two columns; the additional are immaterial to the problem of selection/retention/disposal).

1 commentaire
Afficher -1 commentaires plus anciensMasquer -1 commentaires plus anciens

Alexander Seal le 16 Nov 2016

Modifié(e) : Alexander Seal le 16 Nov 2016

565_1.csv

thank you for the reply, attached is a sample of the "worst offending" data sets, with multiple "bad" data neighboring

Columns A and F are relevant, data is sorted by column A descending, which in theory should lead to column F ascending (the large steps in data are expected). Note instances of sudden drops in order of magnitude (see rows 270-274). Column G is the "fixed" data using '=if(f3<g2,TRUE=g2+(g2-g1), FALSE=f3)'

Connectez-vous pour commenter.