I have a class that I use on both host and device code, to

Question

0

Asked: June 11, 20262026-06-11T04:47:21+00:00 2026-06-11T04:47:21+00:00

I have a class that I use on both host and device code, to

0

I have a class that I use on both host and device code, to allow for easier data passing. This class has some method that manipulates the data. A simple example is:

struct Vector {
  float x, y, z;
  __host__ __device__ Vector(float _x, float _y, float _z) {
    //...
  }
};

If I implement this class on a header file, it works fine and nvcc is happy. However, if I try to implement the constructor on the source file, nvcc complains the constructor is non-inlined. Is there anyway to bypass this or that is just a limitation of the compiler?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-11T04:47:22+00:00

Up until CUDA 5.0 the CUDA compiler has had the restriction that everything required by a kernel (i.e. a __global__ function) must be in a single translation unit. For pre-Fermi devices (i.e. compute capability 1.x) the compiler also had to inline all __device__ functions. So if you have the struct defined in file a.cu and the __global__ kernel that uses the struct defined in b.cu, then when the compiler is processing b.cu it would be unable to find the __device__ function.

With CUDA 5.0 you are able to compile the two files separately and link them together. This still requires Fermi or later (2.x or later).

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a class that I use on both host and device code, to

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply