Suppose I have a two dimensional array in C++ under CUDA, stored in the

Question

Asked: June 12, 20262026-06-12T06:02:19+00:00 2026-06-12T06:02:19+00:00

Suppose I have a two dimensional array in C++ under CUDA, stored in the shared memory,
like so:

__shared__ float arr[4][4]; // C++ has a default row-major ordering

By default C++ will order the elements in arr in a row-major format.

That is it will allocate a continuous block of memory and store the elements like this (0,0), (0,1), (0,2), (0,3), (1,0), (1,1), … and so on…

Is there a way to tell the C++/CUDA compiler to arrange this in a column-major order?

You must login to add an answer.

Need An Account,

Editorial Team · Answer 1 · 2026-06-12T06:02:20+00:00

Editorial Team

Why don’t you just swap indexes you are using?

Instead of using arr[x][y] use arr[y][x].

Interesting is why you would like to do this. Maybe using cache memory could be helpful but I can’t tell for sure without details.

Hope it help.

The Archive Base Latest Questions