parallel code execution on MATLAB cluster

As I run a code on a cluster using spmd, sometimes a worker gets disconnected and the execution stops. In another instance, the job became 'queued' after running for multiple hours and then eventually the execution stopped. What could be potential reasons for these?

1 commentaire

Are you using Linux? Could you cofirm the maximum process is sufficient?
ulimit -a

Connectez-vous pour commenter.

Réponses (0)

Catégories

En savoir plus sur MATLAB Parallel Server dans Centre d'aide et File Exchange

Commenté :

le 11 Jan 2018

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by