I want to port my c code to CUDA. The main computational part contains

Question

0

Asked: May 23, 20262026-05-23T09:10:56+00:00 2026-05-23T09:10:56+00:00

I want to port my c code to CUDA. The main computational part contains

0

I want to port my c code to CUDA. The main computational part contains 3 for nested loops:

for (int i=0; i< Nx;i++){
  for (int j=0;j<Ncontains[i];j++){
    for (int k=0;k< totalVoxels;k++){
          .......
   }
  }
}

How can I translate that to my CUDA kernel? With two for loops I could do something like:

int n= blockIdy.y * blockDim.y + threadIdx.y;
int i= blockIdx.x * blockDim.x + threadIdx.x;

But how can I initially this get running?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T09:10:57+00:00

Editorial Team

2026-05-23T09:10:57+00:00Added an answer on May 23, 2026 at 9:10 am

Many ways you can do it, One of them is:

for (int i=blockIdx.x; i< Nx; i += gridDim.x){
  for (int j=threadIdx.y; j<Ncontains[i]; j+= blockDim.y){
    for (int k=threadIdx.x; k< totalVoxels; k += blockDim.x){
          .......
   }
  }
}

The above you would call:

// nx,ny block dimensions
kernel <<< dim3(nBlocks), dim3(nx, ny) >>> (...);

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I want to port my c code to CUDA. The main computational part contains

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply