I’m implementing a distance matrix that calculates the distance between each point and all

Question

0

Asked: May 30, 20262026-05-30T09:22:30+00:00 2026-05-30T09:22:30+00:00

I’m implementing a distance matrix that calculates the distance between each point and all

0

I’m implementing a distance matrix that calculates the distance between each point and all the other points and I have 100,000 points, so my matrix size will be 100,000 x 100,000. I implemented that using vector<vector<double> > dist. However, for this large data size it give out of memory error. The following is my code and any help will be really appreciated.

vector<vector<double> > dist(dat.size()) vector<double>(dat.size()));
size_t p,j;
ptrdiff_t i;
#pragma omp parallel for private(p,j,i) default(shared)
for(p=0;p<dat.size();++p)
{
// #pragma omp parallel for private(j,i) default(shared)
for (j = p + 1; j < dat.size(); ++j)
{
double ecl = 0.0;
for (i = 0; i < c; ++i)
{
ecl += (dat[p][i] - dat[j][i]) * (dat[p][i] - dat[j][i]);
}
ecl = sqrt(ecl);
dist[p][j] = ecl;
dist[j][p] = ecl;
}
}

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-30T09:22:31+00:00

A 100000 x 100000 matrix? A quick calculation shows why this is never going to work:

100000 x 100000 x 8 (bytes) / (1024 * 1024 * 1024) = 74.5 gigabytes...

Even if it was possible to allocate this much memory I doubt very much whether this would be an efficient approach for a real problem.

If you’re looking to do some kind of geometric processing on large data sets you may be interested in some kind of spatial tree structure: kd-trees, quadtrees, r-trees maybe?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m implementing a distance matrix that calculates the distance between each point and all

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply