I’m confused by some comments I’ve seen about blocking and cudaMemcpy. It is my

Question

0

Asked: June 8, 20262026-06-08T07:52:59+00:00 2026-06-08T07:52:59+00:00

I’m confused by some comments I’ve seen about blocking and cudaMemcpy. It is my

0

I’m confused by some comments I’ve seen about blocking and cudaMemcpy. It is my understanding that the Fermi HW can simultaneously execute kernels and do a cudaMemcpy.

I read that Lib func cudaMemcpy() is a blocking function. Does this mean the func will block further execution until the copy has has fully completed? OR Does this mean the copy won’t start until the previous kernels have finished?

e.g. Does this code provide the same blocking operation?

SomeCudaCall<<<25,34>>>(someData);
cudaThreadSynchronize();

vs

SomeCudaCall<<<25,34>>>(someParam);
cudaMemcpy(toHere, fromHere, sizeof(int), cudaMemcpyHostToDevice);

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-08T07:53:02+00:00

Editorial Team

2026-06-08T07:53:02+00:00Added an answer on June 8, 2026 at 7:53 am

Your examples are equivalent. If you want asynchronous execution you can use streams or contexts and cudaMemcpyAsync, so that you can overlap execution with copy.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m confused by some comments I’ve seen about blocking and cudaMemcpy. It is my

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply