I am writing program to implement k-means clustering. consider a simple input with 4

Question

0

Asked: June 16, 20262026-06-16T05:09:30+00:00 2026-06-16T05:09:30+00:00

I am writing program to implement k-means clustering. consider a simple input with 4

0

I am writing program to implement k-means clustering.

consider a simple input with 4 vertices a,b,c and d with following edge costs

[vertex1] [vertex2] [edge cost]
a b 1
a c 2
a d 3
b d 4
c d 5

Now I need to make the program run until i get 2 clusters.

My doubt is, in the first step when calculate the minimum distance it is a->b (edge cost 1). Now I should consider ab as a single cluster. If that is the case, what will be the distance of ab from c and d?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-16T05:09:31+00:00

The K-means algorithm works as follows:

choose k points as initial centroids (hence, K-*);
calculate the distance from all vertices to the k centroids choosen;
assign each vertex to the closest centroid;
recalculate the position of the centroids by generating the mean between all the vertices that belong to the centroid (hence, k-means, one mean calculation for each of the k centroids);
go to step 2 and stop when, in step 3, no vertex get assigned to another centroid — or until your error condition gets satisfied.

In your case, as you have an undirected graph, it’d be better for you to generate the coordinates of each vertex considering the edge distances, and then, apply the algorithm.

If you don’t want to do this initial process, you may calculate the distance from a vertex to all other reachable vertices, but you’d have to do this for every iteration — which is quite an unnecessary overhead.

For your undirected graph:

[vertex1] [vertex2] [edge cost]
a b 1
a c 2
a d 3
b d 4
c d 5

The table of distances would be something like:

     a    b    c    d
a    0    1    2    3
b    1    0   (1)   4
c    2   (1)   0    5
d    3    4    5    0

(1) - b to c = (b to a, a to c) = 3

If this should be your table, simply apply the Dijkstra algorithm on your graph, for each vertex, and consider the resultant table your table of distances.

The table would have the minimal distances, but, if you have any other policy to calculate it, it’s totally up to you saying how to calculate it.

Notice also that, if your graph is directed, the matrix will not be symmetric, as it is, in this case.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am writing program to implement k-means clustering. consider a simple input with 4

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply