How to compare array's values with each other?

Question

0 votes

In an array (a) with indexes from 1 to m, I want to compare the values of this array one by one with each other, and if the distance (Difference) between two values is more than a value (z), for example, the difference between a(i) and a(j) at indexes i and j is more than z, I want to save these two indexes i and j and represent them in the output. I wrote these codes:

if abs(a(i)-a(j))> z
   disp(i);
   disp(j);
   fprintf('result is between %10.6f and %10.6f',i,j);
end

but there is an error in if line:

Subscript indices must either be real positive integers or logicals.

How can I define indexes for matlab. Is a for loop (for i=1:m) needed for passing the array, If a loop is necessary, should I put fprintf out of the loop because it will repeat. For saving and representing the indexes i and j in the output, I'm looking for better functions besides disp or fprintf.

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Answer 1

Guillaume le 31 Juil 2019

Ouvrir dans MATLAB Online

1 vote

It's unclear how you get your error if your i and j were just created with a for i = 1:m and for j=1:m. They're clearly something else for you to get that error.

Anyway, assuming a is a vector and assuming matlab>=R2016b, this is very straightforward:

distance = abs(a - a.')

will create a m x m matrix of the distance between a(i) and a(j) for all i and j.

finding the i and j of the elements for which distance is greater than z is also easy:

[i, j] = find(distance > z)

which you could store in a 2 column matrix if you wanted:

pairs = [i, j]

5 commentaires
Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

phdcomputer Eng le 31 Juil 2019

Modifié(e) : phdcomputer Eng le 31 Juil 2019

Ouvrir dans MATLAB Online

Thank you very much

I'm very grateful for your help.

I wrote a program to select best features of a data, and the aim of my question about comparing an array's elements was to find best threshold for cuting features.

first I loaded lung dataset, beacause this data had 2 class labels I seprated them and then I computed the hamming distances between each of features, then I sorted them ascendingly and saved the results in the array (A), I calculated z value and with your help I could find the best range of threshold for feature selection.

I wanted to ask your opinions about the results of pairs matrix.

close all;
clc
load lung.mat
data=lung; 
[n,m]=size(data);
l=1;
t=1;
data1=[];
data2=[];
    if data(i,m)==1
        data1(l,:)=data(i,:);
        l=l+1;
    else
        data2(t,:)=data(i,:); 
        t=t+1;
    end
end
if t>l
    data1(l:t-1,:)=0;
else
    data2(t:l-1,:)=0;
end
for i=1: m                       
   thisCol1=data1(:,i);      
   thisCol2=data2(:,i);
   a(i)=fHammingDist(thisCol1,thisCol2);
end
[A,indA1]=sort(a,'descend'); 
z=sum(A)/(m-1);
distance=bsxfun(@minus,A,A.');
[i,j]=find(distance>z);
pairs=[i,j];

according to pairs results in the output:

I wanted to set the threshold between a(i) and a(j) that difference between them is greater than z. but I don't know how to find the thereshold among these values.I'll be very grateful to have your grateful opinions.

Thanks

Guillaume le 31 Juil 2019

Modifié(e) : Guillaume le 31 Juil 2019

Ouvrir dans MATLAB Online

I'm really confused as to what you're trying to do. Why are you calculating distances between your hamming distances? On the other hand, it's not my field, so maybe it makes sense to calculate the distance of distances.

I also don't understand why you're sorting your hamming distances, thereby separating their ordering from the feature vectors ordering. The z calculation and the distance of distances calculation doesn't depend of the order, so why?

----

Unrelated to this, the data1 and data2 separation can be done more simply with just:

data1 = data(data(:, end) == 1, :);
data2 = data(data(:, end) ~= 1, :);
data1(end+1:size(data2, 1), :) = 0;  %will add rows to data1 if shorter than data2, otherwise does nothing
data2(end+1:size(data1, 1), :) = 0;  %will add rows to data1 if shorter than data2, otherwise does nothing

And it would be wiser to use cell arrays instead of numbered variables (numbered variables are always a bad idea), particularly if in the future you have more than 2 classes:

data{1} = data(data(:, end) == 1, :);
data{2} = data(data(:, end) ~= 1, :);
data{1}(end+1:size(data2, 1), :) = 0;  %will add rows to data{1} if shorter than data2, otherwise does nothing
data{2}(end+1:size(data1, 1), :) = 0;  %will add rows to data{2} if shorter than data2, otherwise does nothing

phdcomputer Eng le 9 Août 2019

@Guillaume Thank you very much for your valuable and helpful tips, I'm very grateful for your attention.

I calculated the distance between the part of each feature that belongs to class 1 and another part of the feature that belongs to class 2 .

My aim was selecting the most discriminative features among all of the features of the data by sorting these distance values descendingly and then cut the greater values so just keep this count of features and discard the rest of them.

For this purpose, when I plot the sorted distances (A) it's very complicated to find the best threshold for cutting the features through observation.

I wanted to use the z value in this way that if the distance of two computed values (the elements of A , for example i & j ) are greater than z , so the program keeps the number of two features.

By using this approach I obtained the above results of i & j , but it seems meaningless,I think i and j must be continuous for example 10 & 11 , that we can select 10 features of the data.

Your valuable advices will help me a lot.

Thanks greatly

Guillaume le 9 Août 2019

"My aim was selecting the most discriminative features among all of the features of the data by sorting these distance values descendingly and then cut the greater values so just keep this count of features and discard the rest of them."

As I said, this is not my field. If most discriminative features is equivalent to pair of features with the largest hamming distance between them, then that part makes sense.

What I don't understand is what you do after, if you have hamming distance a(i) between feature V(m) and V(n), and hamming distance a(j) between feature V(x) and V(y), what does a(i)-a(j) mean (which is what you calculate with your distance)?

phdcomputer Eng le 11 Août 2019

Modifié(e) : phdcomputer Eng le 11 Août 2019

Thanks. I'm very grateful for your attention.

rst I tried to find the best point for cutting the best features by plotting the sorted distances (A) , but it's complicated because in some parts of the figure , values are changing gradually but in some points the decrease is suddenly, by the way sometimes I can't choose which points is better as threshold.for example in multiple points, the values have sudden drop.

as you said if we suppose a(i) and a(j) are disances.

a(i)-a(j) is the difference of two points in the plot(A) figure and we can put this condition that if the difference of two points in the figure is more than a computed value for example z , we keep the points i and j and we can cut i number of features.

my purpose of threshold is the point that the distance values are decreasing after that point impressive.

Thank you very much

Connectez-vous pour commenter.

Answer 2

Jon le 31 Juil 2019

Ouvrir dans MATLAB Online

1 vote

Staying close to what you have started here, you could put your code into a double loop, for example

% assign threshold
z = 10; % or what ever your threshold is
% find number of elements to loop through
N = length(a)
% preallocate array to hold results
% elements of D will be set to true (1) when
% a(i) and a(j) are further apart than threshold
D = zeros(N,N)
for i = 1:N
    for j = 1:N
        D(i,j)=abs(a(i)-a(j))> z
    end
end
% display indices of elements whose absolute difference exceeds threshold, z
[idxI, idxJ] = find(D)
disp(idxI)
disp(idxJ)

5 commentaires
Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

Guillaume le 31 Juil 2019

For real matrix, I don't think there's any difference in performance between the two, so you can indeed use either.

However, since the OP never specified that the vectors were pure real, and since the original code would have worked with complex numbers, I used the plain transpose so as not to change the meaning of the distance formula.

By default, I tend to use .' so that the code works the same with real or complex numbers, when all is meant is changing the direction of a vector.

I'm not a mathematician, maybe it makes sense that the shorter ' is a conjugate tranpose. if the design had been up to me, I would have swapped the meaning of the two so that ' was a plain transpose and .' a conjugate transpose.

Jon le 31 Juil 2019

@Guillaume - Thanks for the explanation.

Connectez-vous pour commenter.

How to compare array's values with each other?

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponse acceptée

5 commentaires
Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

Plus de réponses (1)

5 commentaires
Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

Catégories

Tags

Community Treasure Hunt

How to compare array's values with each other?

0 commentaires Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Réponse acceptée

5 commentaires Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

Plus de réponses (1)

5 commentaires Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

Catégories

Tags

Voir également

Community Treasure Hunt

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

5 commentaires
Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens

5 commentaires
Afficher 3 commentaires plus anciens Masquer 3 commentaires plus anciens