Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try using DBSCAN for initial clustering #9

Open
kbarbary opened this issue Sep 4, 2015 · 0 comments
Open

Try using DBSCAN for initial clustering #9

kbarbary opened this issue Sep 4, 2015 · 0 comments
Milestone

Comments

@kbarbary
Copy link
Owner

kbarbary commented Sep 4, 2015

In the multi-ellipsoidal clustering, clusters are created by recursively splitting points into two clusters. In some (contrived) examples, this results in strange partitioning. Example:

image

In this case, it would clearly be better to bound the center points in a single ellipsoid. What probably happened here is that the points were initially split into two clusters vertically, meaning that the center mode was split into two right away and so could never be reassembled.

It might be helpful to do initial clustering using an algorithm with a non-fixed number of clusters such as DBSCAN. However DBSCAN would not split curving degeneracies into multiple clusters, so we'd still need to use the current algorithm for that.

I'm not sure how big a problem this is in "real-world" problems.

@kbarbary kbarbary changed the title Use DBSCAN for initial clustering Try using DBSCAN for initial clustering Sep 4, 2015
@kbarbary kbarbary modified the milestone: v0.2 Sep 4, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant