I’m implementing am algorithm on C/C++ to process some vectors and I thought it

Question

0

Asked: May 29, 20262026-05-29T06:13:32+00:00 2026-05-29T06:13:32+00:00

I’m implementing am algorithm on C/C++ to process some vectors and I thought it

0

I’m implementing am algorithm on C/C++ to process some vectors and I thought it could be a good idea to make it parallel since I’m working with a multicore CPU. I have some experience with GPGPU and there bad memory access can ruin the entire performance, do I need to consider any special access layout between the cores on the CPU also?

Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-29T06:13:33+00:00

There are a number of memory-related problems you can run into with a multiprocessor setup, and some of them can slow an application to a crawl.

You need to be roughly aware of the cache line size on your box and attempt 2 things:

Limit the number of data cache lines (particularly cache lines you write to) accessed in close time sequence by a single thread. Ie, avoid “dirtying” more cache lines than you must.
Avoid like the plague having two separate threads “simultaneously” access the same data cache line, with either one writing.

(The above two rules also apply to data pages, if you’re dealing with large data structures that must be paged.)

Where possible, set up separate working data structures (especially heap) for each thread, rather than sharing the data. Especially beware of having a common counter that all threads update, and (obviously) avoid locks and semaphores except at critical junctures where you absolutely need to synchronize threads.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m implementing am algorithm on C/C++ to process some vectors and I thought it

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply