I have a CUDA kernel that do my hard work, but I also have

Question

0

Asked: June 13, 20262026-06-13T09:45:55+00:00 2026-06-13T09:45:55+00:00

I have a CUDA kernel that do my hard work, but I also have

0

I have a CUDA kernel that do my hard work, but I also have some hard work that need to be done in the CPU (calculations with two positions of the same array) that I could not write in CUDA (because CUDA threads are not synchronous, I need to perform a hard work on a position X of an array and after do z[x] = y[x] – y[x – 1], where y is the array resultant of a CUDA kernel where each thread works on one position of this array and z is another array storing the result). So I’m doing this in the CPU.

I have several CPU threads to do the CPU side work, but each one is calling a CUDA kernel passing some data. My question is: what happens on the GPU side when multiple CPU threads are making GPU calls? Would be better if I do the CUDA kernel call once and then create multiple CPU threads to do the CPU side work?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-13T09:45:56+00:00

Kernel calls are queued and executed one by one in single stream.

However you can specify stream during kernel execution – then CUDA operations in different streams may run concurrently and operations from different streams may be interleaved.
Default stream is 0.

See: CUDA Streams and Concurrency

Things are similar when different processes use the same card.

Also remember that kernels are executed asynchronously from CPU stuff.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a CUDA kernel that do my hard work, but I also have

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply