All Projects → vsmolyakov → DP_means

vsmolyakov / DP_means

Licence: MIT license
Dirichlet Process K-means

Programming Languages

C++
36643 projects - #6 most used programming language
julia
2034 projects
python
139335 projects - #7 most used programming language
matlab
3953 projects
CMake
9771 projects

Projects that are alternatives of or similar to DP means

Libcluster
An extensible C++ library of Hierarchical Bayesian clustering algorithms, such as Bayesian Gaussian mixture models, variational Dirichlet processes, Gaussian latent Dirichlet allocation and more.
Stars: ✭ 129 (+258.33%)
Mutual labels:  clustering, bayesian-inference
TuringBnpBenchmarks
Benchmarks of Bayesian Nonparametric models in Turing and other PPLs
Stars: ✭ 24 (-33.33%)
Mutual labels:  bayesian-inference, bayesian-nonparametric-models
Heart disease prediction
Heart Disease prediction using 5 algorithms
Stars: ✭ 43 (+19.44%)
Mutual labels:  clustering
R-stats-machine-learning
Misc Statistics and Machine Learning codes in R
Stars: ✭ 33 (-8.33%)
Mutual labels:  clustering
syncflux
SyncFlux is an Open Source InfluxDB Data synchronization and replication tool for migration purposes or HA clusters
Stars: ✭ 145 (+302.78%)
Mutual labels:  clustering
A-quantum-inspired-genetic-algorithm-for-k-means-clustering
Implementation of a Quantum inspired genetic algorithm proposed by A quantum-inspired genetic algorithm for k-means clustering paper.
Stars: ✭ 28 (-22.22%)
Mutual labels:  clustering
cgpm
Library of composable generative population models which serve as the modeling and inference backend of BayesDB.
Stars: ✭ 24 (-33.33%)
Mutual labels:  bayesian-inference
Clustering4Ever
C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+250%)
Mutual labels:  clustering
hierarchical-clustering
A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+72.22%)
Mutual labels:  clustering
statrethink course in pymc3
Statistical Rethinking course in pymc3
Stars: ✭ 141 (+291.67%)
Mutual labels:  bayesian-inference
kmeans-clusterer
k-means clustering in Ruby
Stars: ✭ 88 (+144.44%)
Mutual labels:  clustering
metaBMA
Bayesian Model Averaging for Random and Fixed Effects Meta-Analysis
Stars: ✭ 20 (-44.44%)
Mutual labels:  bayesian-inference
PyRate
PyRate is a program to estimate speciation, extinction, and preservation rates from fossil occurrence data using a Bayesian framework.
Stars: ✭ 58 (+61.11%)
Mutual labels:  bayesian-inference
mathematics-statistics-for-data-science
Mathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (+55.56%)
Mutual labels:  clustering
northstar
Single cell type annotation guided by cell atlases, with freedom to be queer
Stars: ✭ 23 (-36.11%)
Mutual labels:  clustering
enspara
Modeling molecular ensembles with scalable data structures and parallel computing
Stars: ✭ 28 (-22.22%)
Mutual labels:  clustering
WatsonCluster
A simple C# class using Watson TCP to enable a one-to-one high availability cluster.
Stars: ✭ 18 (-50%)
Mutual labels:  clustering
PyCBC-Tutorials
Learn how to use PyCBC to analyze gravitational-wave data and do parameter inference.
Stars: ✭ 91 (+152.78%)
Mutual labels:  bayesian-inference
eris-fleet
Cluster management for Discord bots using the Eris library.
Stars: ✭ 38 (+5.56%)
Mutual labels:  clustering
hazelcast-csharp-client
Hazelcast .NET Client
Stars: ✭ 98 (+172.22%)
Mutual labels:  clustering

DP_means

Dirichlet Process K-means

Description

DP K-means is a bayesian non-parametric extension of the K-means algorithm based on small variance assymptotics (SVA) approximation of the Dirichlet Process Mixture Model.

It doesn't require prior knowledge of the number of clusters K. The cluster penalty parameter lambda is set based on the data by taking the maximum distance to the K++ means initialization. Normalized Mutual Information (NMI) is used to compare posterior cluster assignments with the ground truth.

Reference

B. Kulis and M. Jordan, "Revisiting k-means: New Algorithms via Bayesian Nonparametrics"

Dependencies

Matlab 2015a
Python 2.7
Eigen3

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].