I want to speed up an array multiplication in C99. This is the original

Question

0

Editorial Team

Asked: May 16, 20262026-05-16T03:34:57+00:00 2026-05-16T03:34:57+00:00

I want to speed up an array multiplication in C99. This is the original

0

I want to speed up an array multiplication in C99.

This is the original for loops:

for(int i=0;i<n;i++) {
        for(int j=0;j<m;j++) {
            total[j]+= w[j][i] * x[i];
        }
    }

My boss asked my to try this, but it did not improve the speed:

for(int i=0;i<n;i++) {
        float value = x[i];
        for(int j=0;j<m;j++) {
            total[j]+= w[j][i] * value;
        }
    }

Have you other ideas (except for openmp, which I already use) on how I could speed up these for-loops?
I am using:

gcc -DMNIST=1 -O3 -fno-strict-aliasing -std=c99 -lm -D_GNU_SOURCE -Wall -pedantic -fopenmp

Thanks!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-16T03:34:57+00:00

One of the theories is that testing for zero is faster than testing for j<m. So by looping from j=m while j>0, in theory you could save some nanoseconds per loop. However in recent experience this has made not a single difference to me, so I think this doesn’t hold for current cpu’s.

Another issue is memory layout: if your inner loop accesses a chunk of memory that isn’t spread out, but continuous, chances are you have more benefit of the lowest cache available in your CPU.

In your current example, switching the layout of w from w[j][i] to w[i][j] may therefore help. Aligning your values on 4 or 8 bytes boundaries will help as well (but you will find that this is already the case for your arrays)

Another one is loop-unrolling, meaning that you do your inner loop in chunks of, say, 4. So the evaluation if the loop is done, has to be done 4 times less. The optimum value must be determined emperically, and may also depend on the problem at hand (e.g. if you know you’re looping a multiple of 5 times, use 5)

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I want to speed up an array multiplication in C99. This is the original

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply