EgoSplittingA NetworkX implementation of "Ego-splitting Framework: from Non-Overlapping to Overlapping Clusters" (KDD 2017).
Stars: ✭ 78 (+225%)
LabelPropagationA NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
Stars: ✭ 101 (+320.83%)
ClusterTransformerTopic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huggingface.
Stars: ✭ 36 (+50%)
VectoraiVector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.
Stars: ✭ 195 (+712.5%)
event-embedding-multitask*SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach
Stars: ✭ 22 (-8.33%)
M-NMFAn implementation of "Community Preserving Network Embedding" (AAAI 2017)
Stars: ✭ 119 (+395.83%)
watchmanWatchman: An open-source social-media event-detection system
Stars: ✭ 18 (-25%)
lannisterA lightweight MQTT broker w/ full spec,Clustering,WebSocket,SSL written in Java
Stars: ✭ 20 (-16.67%)
napari-clusters-plotterA plugin to use with napari for clustering objects according to their properties.
Stars: ✭ 18 (-25%)
reachLoad embeddings and featurize your sentences.
Stars: ✭ 17 (-29.17%)
elixir clusterDistributed Elixir Cluster on Render with libcluster and Mix Releases
Stars: ✭ 15 (-37.5%)
CODERCODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Stars: ✭ 24 (+0%)
treecutFind nodes in hierarchical clustering that are statistically significant
Stars: ✭ 26 (+8.33%)
magento-clusterHighly Available and Auto-scalable Magento Cluster
Stars: ✭ 21 (-12.5%)
binary-decompilationExtracting high level semantic information from binary code
Stars: ✭ 55 (+129.17%)
Clustering-in-PythonClustering methods in Machine Learning includes both theory and python code of each algorithm. Algorithms include K Mean, K Mode, Hierarchical, DB Scan and Gaussian Mixture Model GMM. Interview questions on clustering are also added in the end.
Stars: ✭ 27 (+12.5%)
zio-entityZio-Entity, a distributed, high performance, functional event sourcing library
Stars: ✭ 68 (+183.33%)
ruimteholR package to Embed All the Things! using StarSpace
Stars: ✭ 95 (+295.83%)
scclustevalSingle Cell Cluster Evaluation
Stars: ✭ 57 (+137.5%)
pytorch kmeansImplementation of the k-means algorithm in PyTorch that works for large datasets
Stars: ✭ 38 (+58.33%)
newerhoodsA Data Clinic project that aggregates NYC Open Data at the tract-level and uses Machine Learning techniques to re-imagine neighborhood boundaries.
Stars: ✭ 36 (+50%)
FredA fast, scalable and light-weight C++ Fréchet distance library, exposed to python and focused on (k,l)-clustering of polygonal curves.
Stars: ✭ 13 (-45.83%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (+16.67%)
revolverREVOLVER - Repeated Evolution in Cancer
Stars: ✭ 52 (+116.67%)
ClusteringImplements "Clustering a Million Faces by Identity"
Stars: ✭ 128 (+433.33%)
dropClustVersion 2.1.0 released
Stars: ✭ 19 (-20.83%)
scrapyra simple & tiny scrapy clustering solution, considered a drop-in replacement for scrapyd
Stars: ✭ 50 (+108.33%)
entity-embedPyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Stars: ✭ 96 (+300%)
dbscan-python[New Version] Theoretically Efficient and Practical Parallel DBSCAN
Stars: ✭ 18 (-25%)
graphgroveA framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search
Stars: ✭ 29 (+20.83%)
kmeansK-Means clustering
Stars: ✭ 51 (+112.5%)
BabelNet-Sememe-PredictionCode and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"
Stars: ✭ 18 (-25%)
relation-networkTensorflow Implementation of Relation Networks for the bAbI QA Task, detailed in "A Simple Neural Network Module for Relational Reasoning," [https://arxiv.org/abs/1706.01427] by Santoro et. al.
Stars: ✭ 45 (+87.5%)
ML2017FALLMachine Learning (EE 5184) in NTU
Stars: ✭ 66 (+175%)
Python-Machine-Learning-FundamentalsD-Lab's 6 hour introduction to machine learning in Python. Learn how to perform classification, regression, clustering, and do model selection using scikit-learn and TPOT.
Stars: ✭ 46 (+91.67%)
genieclustGenie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (+41.67%)
Linux-adminShell scripts to automate download of GitHub traffic statistics, cluster administration, and create an animated GIF.
Stars: ✭ 23 (-4.17%)
Unsupervised-Learning-in-RWorkshop (6 hours): Clustering (Hdbscan, LCA, Hopach), dimension reduction (UMAP, GLRM), and anomaly detection (isolation forests).
Stars: ✭ 34 (+41.67%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-12.5%)
realtimemap-dotnetA showcase for Proto.Actor - an ultra-fast distributed actors solution for Go, C#, and Java/Kotlin.
Stars: ✭ 47 (+95.83%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+429.17%)
M3CMonte Carlo Reference-based Consensus Clustering
Stars: ✭ 24 (+0%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (-20.83%)
ex unitedEasily spawn Elixir nodes (supervising, Mix configured, easy asserted / refuted) within ExUnit tests
Stars: ✭ 40 (+66.67%)
LinearCorexFast, linear version of CorEx for covariance estimation, dimensionality reduction, and subspace clustering with very under-sampled, high-dimensional data
Stars: ✭ 39 (+62.5%)
hmmA Hidden Markov Model implemented in Javascript
Stars: ✭ 29 (+20.83%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+683.33%)
IR2VecImplementation of IR2Vec, published in ACM TACO
Stars: ✭ 28 (+16.67%)
morphoclusterSource code for the MorphoCluster application described in Schroeder et al. 2020
Stars: ✭ 13 (-45.83%)