I’m using OpenCL and have ATI 4850 card. It has: CL_DEVICE_MAX_COMPUTE_UNITS: 10 CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3

Question

0

Editorial Team

Asked: May 23, 20262026-05-23T08:15:52+00:00 2026-05-23T08:15:52+00:00

I’m using OpenCL and have ATI 4850 card. It has: CL_DEVICE_MAX_COMPUTE_UNITS: 10 CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3

0

I’m using OpenCL and have ATI 4850 card. It has:

CL_DEVICE_MAX_COMPUTE_UNITS: 10
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
CL_DEVICE_MAX_WORK_GROUP_SIZE: 256
CL_DEVICE_MAX_WORK_ITEM_SIZES:(256, 256, 256)
CL_DEVICE_AVAILABLE: 1
CL_DEVICE_NAME: ATI RV770

How many tasks can it execute simultaneously?

Is it CL_DEVICE_MAX_COMPUTE_UNITS * CL_DEVICE_MAX_WORK_ITEM_SIZES = 2560?

To be more specific: a single core processor can execute only one task in the one moment, dual-core can execute 2 tasks… How many tasks can execute my GPU at one moment? Or rephrased: How many processors does my GPU have?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T08:15:52+00:00

The RV770 has 10 SIMD cores, each consisting of 16 shader cores, each consisting of 5 ALUs (VLIW5 architecture). A total of 800 ALUs that can do parallel computations. I don’t think there’s a way to get all these numbers out of OpenCL. I’m also not sure what you would equate to a CPU core. Perhaps a shader core? You can read about VLIW at Wikipedia. It’s an interesting design.

If you say a CPU core is only executing one “task” at any given time, even though it has multiple ALUs working in parallel, then I guess you can say the RV770 would be working on 160 tasks. But with the differences in how different chips work, I think “core” and “task” can become difficult to define. A CPU with hyperthreading can even execute two sets of code at the same time. With OpenCL I don’t believe it is possible yet to execute more than one kernel at any given time – unless recent driver updates have changed that.

Anyway, I think it is more important to present your work to the GPU in a way that gives the best performance. Unfortunately there’s no way to find the best work group size other than experimenting. At least not that I know of. One help is that if the drivers support OpenCL 1.1 you can query the CL_KERNEL_PREFERRED_WORK_GROUP_SIZE_MULTIPLE and set your work size to a multiple of that. Otherwise, going for a multiple of 64 is probably a safe bet.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m using OpenCL and have ATI 4850 card. It has: CL_DEVICE_MAX_COMPUTE_UNITS: 10 CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply