I would like to upload two images to the GPU memory, and I’m interested

Question

0

Asked: May 21, 20262026-05-21T23:54:12+00:00 2026-05-21T23:54:12+00:00

I would like to upload two images to the GPU memory, and I’m interested

0

I would like to upload two images to the GPU memory, and I’m interested how fast I can do this?

In fact – will it be faster to compare two bitmaps in RAM with CPU, or upload them to GPU and use GPU parallelism to do it?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-21T23:54:12+00:00

If you run the CUDA device bandwidth sample, you’ll get a benchmark for the upload speed.

Assuming DDR3 tri-channel 1600MHz RAM, you’ll get something like 38 GB/s memory bandwidth.

Take a typical midrange card like a GTX460 and you’ll get something like 84 GB/s memory bandwidth. Note that you’ll have to make a hop across the bus which is something like 8GB/s theoretical, ~5.5 in practice for a PCI-E2.0 x16 link.

Note that kotlinski’s answer isn’t quite correct. You’ll can do compared in parallel and then do a parallel reduction in which case, the bigger GPU device bandwidth can work win out eventually.

I think the answer is likely to be: a loss to upload to GPU and do comparison once. Possible gain if comparison is made multiple times (kept and modified on the GPU, for example).

Edit:

The multiple times comparison refers to if you modified the images on the GPU memory in situ. Thus, it would merit another comparison (caching doesn’t cut it), while not incurring the penalty of another copy across the bus.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I would like to upload two images to the GPU memory, and I’m interested

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply