I am working with a cuda program which I managed to assign a work

Question

0

Asked: June 14, 20262026-06-14T04:29:29+00:00 2026-06-14T04:29:29+00:00

I am working with a cuda program which I managed to assign a work

0

I am working with a cuda program which I managed to assign a work to one Stream Multiprocessor. For example, I have the works A and B and my GPU has 2 SMs (SM0 and SM1). Are there ways to assign the work A exactly to SM0 and the work B to SM1?

Can you suggest me some ways to do that?

Thanks for your help.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-14T04:29:31+00:00

One approach would be to implement work A in (let’s say) kernelA and work B in kernelB and launch both as a 1*1 grid in separate streams, because on Fermi and Kepler GPUs such kernels can run concurrently. The reason for the 1*1 grid launch is that if you have more than one blocks then those blocks may execute on different SMs and in that case the two kernels cannot execute at the same time (i.e. only one kernel/SM)

cudaStream_t stream1, stream2;
cudaStreamCreate ( &stream1 );
cudaStreamCreate ( &stream2 );
kernelA<<<1, 512, 0, stream1>>>(...);
kernelB<<<1, 512, 0, stream2>>>(...);
...

For more details, see this NVIDIA presentation

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am working with a cuda program which I managed to assign a work

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply