Consider a simple example: vector addition.
If I build a program for CL_DEVICE_TYPE_GPU, and I build the same program for CL_DEVICE_TYPE_CPU, what is the difference between them(except that “CPU program” is running on CPU, and “GPU program” is running on GPU)?
Thanks for your help.