Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8512941
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 11, 20262026-06-11T04:25:06+00:00 2026-06-11T04:25:06+00:00

I have a directed graph (actually it is a hypergraph, but its ok to

  • 0

I have a directed graph (actually it is a hypergraph, but its ok to ignore that for the moment).

From this graph I pick various subgraphs and I’m looking for a function that ranks various such subsets by their ‘cluster quality’.

‘cluster quality’ should be high when lots of links exist between the members of the subset

‘cluster quality’ should be low when lots of links exist from many members of the subset to or from outside the subset.

My question is:

  • What is the correct term for ‘cluster quality’.?

  • What are the relevant algorithms / functions that do exist in this context?

  • What implementations exist on the JVM. Scala preferred, but anything callable from java is fine?

Background: The idea is to extract words from source code (class & method names or pieces thereof) and find those that might describe the application best by finding those that are used by ‘good clusters’ thus possibly representing some knowledge concept in the code.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-11T04:25:07+00:00Added an answer on June 11, 2026 at 4:25 am

    In regards to algorithms/functions that are relevant to cluster analysis, there are several. Clustering in graphs is closely related to graph partitioning, which has recently been an active field of study, especially with the emergence of online social networks like Facebook and Twitter whose underlying structure is naturally represented by a (social) graph.

    That being said, in my experience, two measures of clustering come to mind. One is modularity, which basically compares the sub-graph (cluster) to what a sub-graph would look like if the edges were distributed randomly.

    Another is conductance, which measures how fast a random walk on the cluster candidate will converge to some uniform distribution.

    Another, more loose measure, is looking at clustering coefficient, which measures the number of triangles (3-cycles) in a graph versus the number of possible triangles that could exist.

    All in all, there’s lots of algorithms (and academic papers) pertaining to this topic, the three I mention above are more general use cases.

    Regarding an implementation on the JVM, there are no libraries I am aware of that come with such algorithms as part of it, but popular graph libraries for Scala are Graph for Scala (to be incorporated into the Scala Extended Core Library in the future) and Cassovary, released by Twitter.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have an unweighted directed graph that may or may not have cycles in
I have a d3 force-directed graph that contains a group of nodes: var node
Suppose that you have a directed graph with nonnegative, integer edge lengths that are
I have a directed cyclic graph with values at edges but no values at
I have defined a C#-class, that shall be the elements of a directed graph
I have a directed graph described by A -> B meaning that there exists
I have a data structure that represents a directed graph and I'm looking for
I have a data structure that represents a directed graph, and I want to
I have a directed acyclic graph I'm trying to implement but I'm not sure
I am implementing this algorithm for a directed graph. But the interesting thing about

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.