TreeBagger using obscene amount of memory when run in parallel
Afficher commentaires plus anciens
Hi,
Im experiencing issues when running TreeBagger on a cluster. I run this code on a large cluster with 64 processors and 128 GB of memory. However, when I try to use TreeBagger on my dataset (~200 MB in size) with 5000 trees, matlab errors out after a few hours with OUT of MEMORY issues.
Here are my steps:
1. send a batch job to the cluster via distributed computing toolbox and open a matlabpool with 32 workers.
2. options = statset('UseParallel', 'Always');
3. B= TreeBagger(ntrees, tsp, tsp_label, 'Fboot', fboot, 'Options', options); where ntrees = 5000 and fboot=0.5.
I dont understand why TreeBagger is using so much memory (>128GB). When I run this same job locally on my 16GB computer, the memory use does not exceed 16GB. Am I doing something improperly?
Thanks for your help!
Réponse acceptée
Plus de réponses (0)
Catégories
En savoir plus sur Classification Ensembles dans Centre d'aide et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!