Clustering methods in Machine Learning includes both theory and python code of each algorithm. Algorithms include K Mean, K Mode, Hierarchical, DB Scan and Gaussian Mixture Model GMM. Interview questions on clustering are also added in the end.

Stars: ✭ 27 (+107.69%)

Mutual labels: clustering, clustering-algorithm

Clustering

Implements "Clustering a Million Faces by Identity"

Stars: ✭ 128 (+884.62%)

Mutual labels: clustering, clustering-algorithm

Hdbscan

A high performance implementation of HDBSCAN clustering.

Stars: ✭ 2,032 (+15530.77%)

Mutual labels: clustering, clustering-algorithm

Uci Ml Api

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

Stars: ✭ 190 (+1361.54%)

Mutual labels: clustering

dmmclust

dmmclust is a package for clustering short texts, based on Yin and Wang (2014)

Stars: ✭ 23 (+76.92%)

Mutual labels: clustering

Pqkmeans

Fast and memory-efficient clustering

Stars: ✭ 189 (+1353.85%)

Mutual labels: clustering

Vectorai

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

Stars: ✭ 195 (+1400%)

Mutual labels: clustering

Machine-Learning-Algorithms

All Machine Learning Algorithms

Stars: ✭ 24 (+84.62%)

Mutual labels: clustering

Timeseries Clustering Vae

Variational Recurrent Autoencoder for timeseries clustering in pytorch

Stars: ✭ 190 (+1361.54%)

Mutual labels: clustering

scicloj.ml

A Clojure machine learning library

Stars: ✭ 152 (+1069.23%)

Mutual labels: clustering

Clustergrammer

An interactive heatmap visualization built using D3.js

Stars: ✭ 188 (+1346.15%)

Mutual labels: clustering

Dtwclust

R Package for Time Series Clustering Along with Optimizations for DTW

Stars: ✭ 185 (+1323.08%)

Mutual labels: clustering

pypmc

Clustering with variational Bayes and population Monte Carlo

Stars: ✭ 46 (+253.85%)

Mutual labels: clustering

Pottslab

Unsupervised multilabel image segmentation (color/gray/multichannel) based on the Potts model (aka piecewise constant Mumford-Shah model)

Stars: ✭ 97 (+646.15%)

Mutual labels: clustering

Clustering With Deep Learning

Generic implementation for clustering with deep learning : representation learning (DNN) + clustering

Stars: ✭ 236 (+1715.38%)

Mutual labels: clustering

View All Similar Projects ➔

Clueminer

Clueminer is a platform for interactive data-mining with special focus on clustering algorithms.

Building latest development version

Prerequisites:

git
Java 8 or newer
Maven

on Debian/Ubuntu Linux:

   apt install maven libjhdf5 git
   git clone git://github.com/deric/clueminer.git
   cd clueminer
   git submodule init
   git submodule update
   mvn clean install

Once build you can run Clueminer via Maven

cd modules/application
mvn nbm:cluster-app nbm:run-platform

or use generated bin file

bash modules/application/target/clueminer/bin/clueminer

From NetBeans

open the clueminer directory (which is a maven module) -- this module is called clueminer-parent
from dependent modules of clueminer-parent open clueminer-app
execute "Build with dependencies"
now you can run the main application (run module clueminer-app)

How to increase heap size

In application/src/main/resources/clueminer.conf adjust Java options:

default_options="--branding clueminer -J-Xms24m -J-Xmx2048m"

Note: increasing heap size J-Xmx to values bigger than is your actual physical RAM will cause serious preformance issues!

On Unix systems you can find out your memory size with this command:

echo $(( $(awk '/MemTotal/{print $2}' /proc/meminfo) >> 10 ))m

Forking

click on fork button on github
add upstream repository

 $ git remote add upstream https://github.com/deric/clueminer.git

from time to time merge with upstream

$ git fetch upstream
$ git checkout master
$ git merge upstream/master

Benchmarks

Benchmarks of clustering algorithms are located in modules/clustering-benchmark

In order to run benchmarks build an asssembly with Maven:

$ cd modules/clustering-benchmark
$ mvn assembly:assembly

and run benchmarks (might be computationally expensive):

$ java -jar target/*-jar-with-dependencies.jar

R support

In order to enable R code execution JRI is needed.

Debian/Ubuntu:

apt install r-cran-rjava

HDF support

Library libjhdf5 is needed to be present on library path.

Debian/Ubuntu:

apt install libjhdf5-jni

OpenGL support

For OpenGL visualizations you'll need native extensions

Debian/Ubuntu

jzy3d is using jogl library which has native bindings:

sudo apt-get install libjogl-java

R support

Currently there are two possibilities how to execute R code from Java:

TCP connection to RServe
load dynamic library libjri.so
- for Debian: apt install r-cran-rjava, make sure R_HOME is set
- update classpath export CLASSPATH=.:/usr/lib/R/site-library/rJava/jri/

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

deric / clueminer

Programming Languages

Labels

Projects that are alternatives of or similar to clueminer

Clueminer

Building latest development version

From NetBeans

How to increase heap size

Forking

Benchmarks

R support

HDF support

OpenGL support

Debian/Ubuntu

R support