topometryA comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.
Stars: ✭ 64 (+255.56%)
watchmanWatchman: An open-source social-media event-detection system
Stars: ✭ 18 (+0%)
opensvcThe OpenSVC node agent
Stars: ✭ 27 (+50%)
impfuzzyFuzzy Hash calculated from import API of PE files
Stars: ✭ 67 (+272.22%)
dtw-pythonPython port of R's Comprehensive Dynamic Time Warp algorithms package
Stars: ✭ 139 (+672.22%)
FredA fast, scalable and light-weight C++ Fréchet distance library, exposed to python and focused on (k,l)-clustering of polygonal curves.
Stars: ✭ 13 (-27.78%)
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (+50%)
G-SimCLRThis is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip Chakraborty, Aritra Roy Gosthipaty and Sayak Paul.
Stars: ✭ 69 (+283.33%)
AnnA Anki neuronal AppendixUsing machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
Stars: ✭ 39 (+116.67%)
DBSCANSDJava implementation for DBSCANSD, a trajectory clustering algorithm.
Stars: ✭ 35 (+94.44%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (+188.89%)
torch DCECPytorch Deep Clustering with Convolutional Autoencoders implementation
Stars: ✭ 73 (+305.56%)
Python-Machine-Learning-FundamentalsD-Lab's 6 hour introduction to machine learning in Python. Learn how to perform classification, regression, clustering, and do model selection using scikit-learn and TPOT.
Stars: ✭ 46 (+155.56%)
RAE基于tensorflow搭建的神经网络recursive autuencode,用于实现句子聚类
Stars: ✭ 12 (-33.33%)
nlp-ltNatural Language Processing for Lithuanian language
Stars: ✭ 17 (-5.56%)
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+200%)
hotspot3d3D hotspot mutation proximity analysis tool
Stars: ✭ 43 (+138.89%)
TrajSuiteTrajSuite is a cross-platform Java application that provides a suite of trajectory data-mining and visualisation features.
Stars: ✭ 15 (-16.67%)
RATTLEReference-free reconstruction and error correction of transcriptomes from Nanopore long-read sequencing
Stars: ✭ 35 (+94.44%)
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
Stars: ✭ 93 (+416.67%)
VOSviewer-OnlineVOSviewer Online is a tool for network visualization. It is a web-based version of VOSviewer, a popular tool for constructing and visualizing bibliometric networks.
Stars: ✭ 44 (+144.44%)
Clustering-DatasetsThis repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
Stars: ✭ 189 (+950%)
LabelPropagationA NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
Stars: ✭ 101 (+461.11%)
DigitalCellSorterDigital Cell Sorter (DCS): single cell RNA-seq analysis toolkit. Documentation:
Stars: ✭ 19 (+5.56%)
ZeitlineA polylinear timeline with clustering, centred on interactions. — Doc and demo https://octree-gva.github.io/Zeitline/
Stars: ✭ 15 (-16.67%)
clustering-pythonDifferent clustering approaches applied on different problemsets
Stars: ✭ 36 (+100%)
kmeansK-Means clustering
Stars: ✭ 51 (+183.33%)
ClusterTransformerTopic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huggingface.
Stars: ✭ 36 (+100%)
FSDAFlexible Statistics and Data Analysis (FSDA) extends MATLAB for a robust analysis of data sets affected by different sources of heterogeneity. It is open source software licensed under the European Union Public Licence (EUPL). FSDA is a joint project by the University of Parma and the Joint Research Centre of the European Commission.
Stars: ✭ 53 (+194.44%)
clopeElixir implementation of CLOPE: A Fast and Effective Clustering Algorithm for Transactional Data
Stars: ✭ 18 (+0%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (+33.33%)
py-lbgPython Implementation for Linde-Buzo-Gray / Generalized Lloyd Algorithm for vector quantization.
Stars: ✭ 22 (+22.22%)
scikit-cmeansFlexible, extensible fuzzy c-means clustering in python.
Stars: ✭ 18 (+0%)
LinearCorexFast, linear version of CorEx for covariance estimation, dimensionality reduction, and subspace clustering with very under-sampled, high-dimensional data
Stars: ✭ 39 (+116.67%)
CoronaDashCOVID-19 spread shiny dashboard with a forecasting model, countries' trajectories graphs, and cluster analysis tools
Stars: ✭ 20 (+11.11%)
product-quantization🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
Stars: ✭ 40 (+122.22%)
tika-similarityTika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Stars: ✭ 92 (+411.11%)
clustersCluster analysis library for Golang
Stars: ✭ 68 (+277.78%)
northstarSingle cell type annotation guided by cell atlases, with freedom to be queer
Stars: ✭ 23 (+27.78%)
hierarchical-clusteringA Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+244.44%)
scalapackScaLAPACK development repository
Stars: ✭ 57 (+216.67%)
GrouProxFedGroup, A Clustered Federated Learning framework based on Tensorflow
Stars: ✭ 20 (+11.11%)
mathematics-statistics-for-data-scienceMathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (+211.11%)
color clothcolor_cloth gets the main colors and its proportions from a cloth image ignoring the background, it uses the EM algorithm from OpenCV library, the algorithm needs an image with an item in the center of the picture.
Stars: ✭ 20 (+11.11%)
syncfluxSyncFlux is an Open Source InfluxDB Data synchronization and replication tool for migration purposes or HA clusters
Stars: ✭ 145 (+705.56%)
morphoclusterSource code for the MorphoCluster application described in Schroeder et al. 2020
Stars: ✭ 13 (-27.78%)
kmeansA simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (+0%)
pyclustertendA python package to assess cluster tendency
Stars: ✭ 38 (+111.11%)
Machine-Learning-SpecializationProject work and Assignments for Machine learning specialization course on Coursera by University of washington
Stars: ✭ 27 (+50%)
magento-clusterHighly Available and Auto-scalable Magento Cluster
Stars: ✭ 21 (+16.67%)
scclustevalSingle Cell Cluster Evaluation
Stars: ✭ 57 (+216.67%)