NannyA tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
WatsetWatset: Automatic Induction of Synsets from a Graph of Synonyms
Pyclusteringpyclustring is a Python, C++ data mining library.
Minisom🔴 MiniSom is a minimalistic implementation of the Self Organizing Maps
AgooA High Performance HTTP Server for Ruby
Depth clustering🚕 Fast and robust clustering of point clouds generated with a Velodyne sensor.
Machine Learning Octave🤖 MatLab/Octave examples of popular machine learning algorithms with code examples and mathematics being explained
ElkiELKI Data Mining Toolkit
EliasdbEliasDB a graph-based database.
SmileStatistical Machine Intelligence & Learning Engine
TalismanStraightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
CilantroA lean C++ library for working with point cloud data
Superpoint graphLarge-scale Point Cloud Semantic Segmentation with Superpoint Graphs
ClustergcnA PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).
LopqTraining of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
DtaidistanceTime series distances: Dynamic Time Warping (DTW)
Tensorflow BookAccompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
VsearchVersatile open-source tool for microbiome analysis
ShamanSmall, lightweight, api-driven dns server.
Akkatecturea cqrs and event sourcing framework for dotnet core using akka.net
MoaMOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
CdpCode for our ECCV 2018 work.
Stats Maths With PythonGeneral statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
DogvscatSample Docker Swarm cluster stack of tools
Protoactor GoProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Self LabelSelf-labelling via simultaneous clustering and representation learning. (ICLR 2020)
CdhitAutomatically exported from code.google.com/p/cdhit
MalheurA Tool for Automatic Analysis of Malware Behavior
ElasticlusterCreate clusters of VMs on the cloud and configure them with Ansible.
RAll Algorithms implemented in R
Gcn clusteringCode for CVPR'19 paper Linkage-based Face Clustering via GCN
Dedupe🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
DagsfmDistributed and Graph-based Structure from Motion
L2cLearning to Cluster. A deep clustering strategy.
AlinkAlink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
PycaretAn open-source, low-code machine learning library in Python
NNSNonlinear Nonparametric Statistics
dti-clustering(NeurIPS 2020 oral) Code for "Deep Transformation-Invariant Clustering" paper
miccamicca - MICrobial Community Analysis
watset-javaAn implementation of the Watset clustering algorithm in Java.
TEXTOIRTEXTOIR is a flexible toolkit for open intent detection and discovery. (ACL 2021)
MVGLTCyb 2018: Graph learning for multiview clustering
scrapyra simple & tiny scrapy clustering solution, considered a drop-in replacement for scrapyd
Clustering-in-PythonClustering methods in Machine Learning includes both theory and python code of each algorithm. Algorithms include K Mean, K Mode, Hierarchical, DB Scan and Gaussian Mixture Model GMM. Interview questions on clustering are also added in the end.
M3CMonte Carlo Reference-based Consensus Clustering