I just can’t find examples for JCuda for one of simplest tasks: add two arrays of values to a third array.
I always only find C/C++ examples, which rely on some CUDA-precompiler, hence unusable for me. Scary things occur there, such as:
VecAdd<<<blocksPerGrid, threadsPerBlock>>>(d_A, d_B, d_C, N)
Well, in JCuda libraries there is nothing similar to VecAdd.
There are a lot of sub-libraries in JCuda, such as JCudpp, JCublas, … I really don’t know where to look and for what.
Please help!
I managed to assemble a working example:
Source
Output
Note
Now I will play around with
AMD Aparapi.