Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 562551
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 13, 20262026-05-13T12:34:00+00:00 2026-05-13T12:34:00+00:00

Assume a group of data points, such as one plotted here (this graph isn’t

  • 0

Assume a group of data points, such as one plotted here (this graph isn’t specific to my problem, but just used as a suitable example):

Inspecting the scatter graph visually, it’s fairly obvious the data points form two ‘groups’, with some random points that do not obviously belong to either.

I’m looking for an algorithm, that would allow me to:

  • start with a data set of two or more dimensions.
  • detect such groups from the dataset without prior knowledge on how many (or if any) might be there
  • once the groups have been detected, ‘ask’ the model of groups, if a new sample point seems to fit to any of the groups
  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-13T12:34:00+00:00Added an answer on May 13, 2026 at 12:34 pm

    There are many choices, but if you are interested in the probability that a new data point belongs to a particular mixture, I would use a probabilistic approach such as Gaussian mixture modeling either estimated by maximum likelihood or Bayes.

    Maximum likelihood estimation of mixtures models is implemented in Matlab.

    Your requirement that the number of components is unknown makes your model more complex. The dominant probabilistic approach is to place a Dirichlet Process prior on the mixture distribution and estimate by some Bayesian method. For instance, see this paper on infinite Gaussian mixture models. The DP mixture model will give you inference over the number of components and the components each elements belong to, which is exactly what you want. Alternatively you could perform model selection on the number of components, but this is generally less elegant.

    There are many implementation of DP mixture models models, but they may not be as convenient. For instance, here’s a Matlab implementation.

    Your graph suggests you are an R user. In that case, if you are looking for prepacked solutions, the answer to your question lies on this Task View for cluster analysis.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Assume that I have a set of data pair where index 0 is the
Sometimes I come across this problem where you have a set of functions that
I hardly know how to state this question, let alone search for answers. But
I am building an ad analytics tool which assumes a data structure like this:
Assume java 1.6 and leopard. Ideally, it would also be nice to get a
(assume php5) consider <?php $foo = 'some words'; //case 1 print these are $foo;
Assume you are doing something like the following List<string> myitems = new List<string> {
Assume a table structure of MyTable(KEY, datafield1, datafield2...) . Often I want to either
Assume the following class: public class MyEnum: IEnumerator { private List<SomeObject> _myList = new
Assume that we have N erlang nodes, running same application. I want to share

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.