parfor and for loops - different results

Question

0 votes

Hello all, I have a strange issue that I can't quite figure out. I tried to speed up my code by introducing parrallel processing, I was successful in making my code run faster but the final output of my code changed.

The purpose of my code is to evaluate the auROC curve based on two inputs. The part of my code where I introduced parallel processing was a step where I shuffled my data randomly 1000 times, and calculated 1000 auROC curves. I tried to run this through a parfor loop:

p = randperm(size(binned_raw,1),1000);
    parfor ii = 1:1000
        binned_raw_shift = circshift(binned_raw,p(ii),1);
        [AUROC, TPR, FPR] = get_ROC(binned_raw_shift, binned_behavior);
        shuffled_raw(ii,:) = AUROC;
    end
    

As you can see in my code I first generate 1000 random numbers that are less than or equal to the length of my input, and with each loop in the parfor loop I perform a circular shift on my input (binned_raw), and run it through a function I wrote. I collect all 1000 outputs in my matrix shuffled_raw, and I use that as a threshold to determine if any of my original auROC curves were significantly greater than my randomly shuffled data.

The issue I am having is that introducing the parfor loop at this step changes the results compared to when I only use a for loop. I do not exactly see why it should change anything because any indexing and saving of outputs should be controlled by the value of ii. I should note, the changes I observed were that when I used the parfor loop, the results we much more liberal than when I used a normal for loop, stating that a much higher percent of my original data was significantly greater than the shuffled data. Please if anyone can help me figure out why this problem is occurring I would be very greatful. Thanks a bunch.

8 commentaires
Afficher 6 commentaires plus anciens Masquer 6 commentaires plus anciens

Dana le 26 Août 2020

The point is, if you run this script twice even just using the for loop both times (no parfor), you'll likely get different answers. So it has nothing to do with the parfor, it's that the initial random draw will be different every time you run the script (unless you use rng to seed the random number generator).

Connor Johnson le 26 Août 2020

I disagree that the issue stems from this, as shuffling the data 1000 times and recalculating should not change the overall trend of the random data. I can run this code over and over again with the normal for loop and get the same results.

I should be able to randomly generate 1000 numbers to shift my data (within range of the actual length of my data) and get similar results no matter what the numbers are, as long as they are not repeating.

However, I could still be wrong and I will run some tests. I feel strongly like this is not the case because I can run the for loop version of my code multiple times and get the same results, and run the parfor multiple times and get the same results. However the for and parfor results are very different.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Answer 1

Matt J le 26 Août 2020

Ouvrir dans MATLAB Online

0 votes

Because randperm returns a random result, the sequence p(ii) will be different in two consecutive runs,e.g.,

>> p = randperm(8,4)
p =
     6     4     7     3
>> p = randperm(8,4)
p =
     8     7     5     4

That would explain why the parfor and the for-loop versions don't give the same output.

2 commentaires
Afficher Aucune Masquer Aucune

Matt J le 26 Août 2020

Ouvrir dans MATLAB Online

The code for the comparison should look like this:

p = randperm(size(binned_raw,1),1000); %generate only once
    for ii = 1:1000
        binned_raw_shift = circshift(binned_raw,p(ii),1);
        [AUROC, TPR, FPR] = get_ROC(binned_raw_shift, binned_behavior);
        shuffled_rawFOR(ii,:) = AUROC;
    end
    parfor ii = 1:1000
        binned_raw_shift = circshift(binned_raw,p(ii),1);
        [AUROC, TPR, FPR] = get_ROC(binned_raw_shift, binned_behavior);
        shuffled_rawPARFOR(ii,:) = AUROC;
    end
    
    
 Difference = max(abs(shuffled_rawFOR - shuffled_rawPARFOR),'all')   

Connor Johnson le 26 Août 2020

I will run this and share, lets see.

Connectez-vous pour commenter.

parfor and for loops - different results

8 commentaires
Afficher 6 commentaires plus anciens Masquer 6 commentaires plus anciens

Réponses (1)

2 commentaires
Afficher Aucune Masquer Aucune

Catégories

Tags

Community Treasure Hunt

parfor and for loops - different results

8 commentaires Afficher 6 commentaires plus anciens Masquer 6 commentaires plus anciens

Réponses (1)

2 commentaires Afficher Aucune Masquer Aucune

Catégories

Tags

Voir également

Community Treasure Hunt

8 commentaires
Afficher 6 commentaires plus anciens Masquer 6 commentaires plus anciens

2 commentaires
Afficher Aucune Masquer Aucune