blazing fast performance with parfor except it pukes on itself with big files

1 vue (au cours des 30 derniers jours)
Robert Scott
Robert Scott le 2 Août 2021
Commenté : Robert Scott le 2 Août 2021
This is a general question if anyone has ever seen this. Im not aiming to pick apart complex code here.
I have a data handing program i wrote that is meant to be extremely fast. Lots of care was taken to get it to perform to the highest speed.
Over the course of several versions i have made 100 X timing performance gains.
I started using par for which gave me a 4X speed up which was wonderful. However, when i ask par for to chew on a large version of my test files it just pukes and dies. Locks up my computer and that is the end of it until i force reboot.
My smaller test files are identical except less quantity of lines to parse.
When i use a 10,000 line test csv i get great performance.
When i use a 50,000 line csv it just falls down dead
Has anyone seen this before? i have a fairly significant machine to run this on so its not a resource problem i dont think. In looking at the task manager i do see 90% memory ussage.
Any general thoughts?
  5 commentaires
Walter Roberson
Walter Roberson le 2 Août 2021
Yes, it is specifically documented to use parfor with tall array.
Could you give an approximate file size that you need to deal with? You mentioned 50000 lines, but about how many columns? Is each column 4 hex digits representing 16 bits?
Robert Scott
Robert Scott le 2 Août 2021
The true answer is that the row index can be in the millions
The colum index will be in several hundred

Connectez-vous pour commenter.

Réponses (0)

Catégories

En savoir plus sur Big Data Processing dans Help Center et File Exchange

Produits


Version

R2020b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by