I have to multiply a very small sized matrix ( size – 10×10 ) with a vector several times 50000 to 100000 times ( could even be more than that). This happens for 1000 different matrices (could be much more). Would there be any significant performance gain by doing this operation on CUDA.
Share
Yes, it’s an ideal task for the GPU.