Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7919433
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 3, 20262026-06-03T15:51:50+00:00 2026-06-03T15:51:50+00:00

I have a clustering problem that could be summarized this way: i have N

  • 0

I have a clustering problem that could be summarized this way:

  • i have N particles in a 3D spaces
  • each particle can interact with a different number of other particles
  • each interaction has a strength
  • i don’t know the number of cluster a priori
  • i don’t have leaning samples (should be unsupervised)

Output: i’d like to get:

  • the number of clusters
  • a probability for each particle to be part of a cluster (to be able to remove particles not clearly assigned)
  • i want to call the clusterer directly from my java code.

Question:

  • what clusterer would fit best to my problem?
  • how should i format my data?
  • should i use the 3D positioning information in complement to the interaction information?
  • how can i get the result for each particle?

I’m very new to weka, but from what i could find on the Internet:

  • SOM could solve my problem
  • it is a multi-instance problem but i could find any examples showing how to create relational data. and does SOM support relational attributes?

Thanks for your help.
jeannot

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-03T15:51:51+00:00Added an answer on June 3, 2026 at 3:51 pm

    Weka is very “limited” when it comes to clustering. It has only very few clustering algorithms, and they are quite limited. I’m not sure if you could put in the interaction strength into any of the Weka clustering algorithms.

    You might want to have a look at ELKI. It has much more advanced clustering algorithms than Weka, and they are very flexible. For example, you can easily define your own distance function (Tutorial) and use it in any distance-based clustering algorithm.

    Choosing the appropriate clustering algorithm is nothing we can answer here. You need to try some and try different parameters. The key question you should try to answer first is: what is a useful cluster for you?

    You have started to pose some of these questions. For example, whether you want to use interaction strength only, or whether to also include positional information. But as I do not know what you want to achieve, I can’t tell you how.

    Definitely have a look at the DBSCAN and OPTICS algorithms (in particular for OPTICS, don’t use the one in Weka. It is slow, incomplete and unmaintained!). Maybe start reading their Wikipedia articles, if that makes any sense for your task. Here is why I believe they are helpful for you:

    • They do not need to know the number of clusters (unlike k-means and EM clustering)
    • They need a “minimum points” parameter, which is essentially a “minimum cluster size”; it controls how fine-grained the result becomes. Increase it to get fewer and larger clusters.
    • They can use arbitrary distance or similarity functions (for example, interaction strength). For DBSCAN you need to set a threshold to consider significant, for OPTICS this is not necessary.

    Next I would probably use the interaction-strength data with OPTICS and try the Xi-extraction of clusters, if they make any sense for your use case. (Weka doesn’t have the Xi extraction). Or maybe look at the OPTICS plot first, to see if your similarity and MinPts parameter actually produce the “valleys” you need for OPTICS.
    DBSCAN is faster, but you need to fix the distance threshold. If your data set is very large, you might want to start with OPTICS on a sample, then decide on a few epsilon-values and run DBSCAN on the full dataset with these values.

    Still, start reading here to see if that makes sense for your task:

    https://en.wikipedia.org/wiki/DBSCAN#Basic_idea

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have hit a real problem. I need to do some Kmeans clustering for
have a problem. At first look at this HTML <div id=map style=background-image: url(map.png); width:
I have a function that performs a hierarchical clustering on a list of input
I have what feels like a simple problem, but I can't seem to find
We have an error that we can't seem to find and don't have the
Important note: I've asked this question already on ServerFault: https://serverfault.com/questions/349065/clustering-tcp-servers-so-can-send-data-to-all-clients , but I'd also
I am using gmap3 script to show clustering markers, but i have some questions
have written this little class, which generates a UUID every time an object of
I have a question about k-means clustering in R. Actually i'm doing everything according
I have several web-servers and need them to use Quartz. The clustering feature of

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.