GPU backslash performance much slower than CPU

9 vues (au cours des 30 derniers jours)

Meme Young le 27 Déc 2020

1
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/703252-gpu-backslash-performance-much-slower-than-cpu

Modifié(e) : Matt J le 26 Mai 2022

test Gpu backslash (2).zip

I am doing numerical power flow caclulation by modifying the functions of matpower, an open source toolbox. By modifying its function newtonpf.m, GPU computation can be implemented. However, I found that GPU performance is much much slower than CPU. When calculating the built-in case3012wp of matpower, the matrix in newtonpf.m will be :

A: 5725 * 5725 sparse double, b: 5725 * 1 double.

The process of A \ b in the 1st iteration of newtonpf() will generally take around 0.01 sec on my i7-10750H + RTX 2070super MSI-GL65.

But if A and b are changed into GPU arrays, the process of A \ b will take the following time if A is the following types:

full double, 0.8 sec

sparse double, 4 sec

full single, 0.1 sec

(sparse single is not supported)

So why is the diference in performance? I thought GPU could do things much faster than CPU.

Files are attached as follows. Atest is sparse and Agpu is a sparse gpu array. All are doubles.

9 commentaires
Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

kant le 26 Mai 2022

I also have this problem for my matlab code? Has the problem been solved？

Matt J le 26 Mai 2022

Modifié(e) : Matt J le 26 Mai 2022

@kant It has been concluded that this is expected behavior, but see below.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Réponses (1)

Matt J le 27 Déc 2020

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/703252-gpu-backslash-performance-much-slower-than-cpu#answer_585097

This thread looks relevant. It appears that sparse mldivide on the GPU is not expected to be faster.

https://www.mathworks.com/matlabcentral/answers/500526-solution-of-large-sparse-matrix-systems-using-gpu-mldivide

13 commentaires
Afficher 11 commentaires plus anciensMasquer 11 commentaires plus anciens

Meme Young le 30 Déc 2020

What do you mean sparse solver algorithm Mr Knight? like pcg()? I have tried it is not as efficient as this way: reordering using amd(), LU decomp, and two backslashes based on the decomp, especially when coping with the type of sparse matrix that I uploaded

Joss Knight le 10 Jan 2021

Modifié(e) : Joss Knight le 10 Jan 2021

Yes, PCG, GMRES, CGS, LSQR, QMR, TFQMR, BICG, BICGSTAB. Try them all, play with tolerance, iterations and preconditioning - something is likely to work. I'm not an expert in this field but this is what the sparse community tend to do.

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Catégories

MATLAB Mathematics Linear Algebra

En savoir plus sur Linear Algebra dans Help Center et File Exchange

Produits

Parallel Computing Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

GPU backslash performance much slower than CPU

9 commentaires
Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

Réponses (1)

13 commentaires
Afficher 11 commentaires plus anciensMasquer 11 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Community Treasure Hunt

GPU backslash performance much slower than CPU

9 commentaires Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

Réponses (1)

13 commentaires Afficher 11 commentaires plus anciensMasquer 11 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Community Treasure Hunt

9 commentaires
Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

13 commentaires
Afficher 11 commentaires plus anciensMasquer 11 commentaires plus anciens