SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+337.93%)
opensvcThe OpenSVC node agent
Stars: ✭ 27 (-6.9%)
kmeansA simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (-37.93%)
dtw-pythonPython port of R's Comprehensive Dynamic Time Warp algorithms package
Stars: ✭ 139 (+379.31%)
kmeansK-Means clustering
Stars: ✭ 51 (+75.86%)
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (-6.9%)
T-CorExImplementation of linear CorEx and temporal CorEx.
Stars: ✭ 31 (+6.9%)
AnnA Anki neuronal AppendixUsing machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
Stars: ✭ 39 (+34.48%)
hmmA Hidden Markov Model implemented in Javascript
Stars: ✭ 29 (+0%)
torch DCECPytorch Deep Clustering with Convolutional Autoencoders implementation
Stars: ✭ 73 (+151.72%)
impfuzzyFuzzy Hash calculated from import API of PE files
Stars: ✭ 67 (+131.03%)
RAE基于tensorflow搭建的神经网络recursive autuencode,用于实现句子聚类
Stars: ✭ 12 (-58.62%)
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+86.21%)
G-SimCLRThis is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip Chakraborty, Aritra Roy Gosthipaty and Sayak Paul.
Stars: ✭ 69 (+137.93%)
TrajSuiteTrajSuite is a cross-platform Java application that provides a suite of trajectory data-mining and visualisation features.
Stars: ✭ 15 (-48.28%)
peeling-onionsA repository to store Deep Web (onion domain) crawler, scraper, and NLP tools for Tor network.
Stars: ✭ 18 (-37.93%)
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
Stars: ✭ 93 (+220.69%)
VOSviewer-OnlineVOSviewer Online is a tool for network visualization. It is a web-based version of VOSviewer, a popular tool for constructing and visualizing bibliometric networks.
Stars: ✭ 44 (+51.72%)
Python-Machine-Learning-FundamentalsD-Lab's 6 hour introduction to machine learning in Python. Learn how to perform classification, regression, clustering, and do model selection using scikit-learn and TPOT.
Stars: ✭ 46 (+58.62%)
LabelPropagationA NetworkX implementation of Label Propagation from a "Near Linear Time Algorithm to Detect Community Structures in Large-Scale Networks" (Physical Review E 2008).
Stars: ✭ 101 (+248.28%)
nlp-ltNatural Language Processing for Lithuanian language
Stars: ✭ 17 (-41.38%)
tsamA python-based time series aggregation module (tsam) which can be used to reduce the number of time steps using typical periods or by decreasing the temporal resolution
Stars: ✭ 112 (+286.21%)
SpectreA computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
Stars: ✭ 31 (+6.9%)
goudaGolang Utilities for Data Analysis
Stars: ✭ 18 (-37.93%)
RATTLEReference-free reconstruction and error correction of transcriptomes from Nanopore long-read sequencing
Stars: ✭ 35 (+20.69%)
ClusterTransformerTopic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huggingface.
Stars: ✭ 36 (+24.14%)
lannisterA lightweight MQTT broker w/ full spec,Clustering,WebSocket,SSL written in Java
Stars: ✭ 20 (-31.03%)
FSDAFlexible Statistics and Data Analysis (FSDA) extends MATLAB for a robust analysis of data sets affected by different sources of heterogeneity. It is open source software licensed under the European Union Public Licence (EUPL). FSDA is a joint project by the University of Parma and the Joint Research Centre of the European Commission.
Stars: ✭ 53 (+82.76%)
Clustering-DatasetsThis repository contains the collection of UCI (real-life) datasets and Synthetic (artificial) datasets (with cluster labels and MATLAB files) ready to use with clustering algorithms.
Stars: ✭ 189 (+551.72%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-17.24%)
hclustHierarchical clustering in JavaScript
Stars: ✭ 39 (+34.48%)
scikit-cmeansFlexible, extensible fuzzy c-means clustering in python.
Stars: ✭ 18 (-37.93%)
ZeitlineA polylinear timeline with clustering, centred on interactions. — Doc and demo https://octree-gva.github.io/Zeitline/
Stars: ✭ 15 (-48.28%)
CoronaDashCOVID-19 spread shiny dashboard with a forecasting model, countries' trajectories graphs, and cluster analysis tools
Stars: ✭ 20 (-31.03%)
Linux-adminShell scripts to automate download of GitHub traffic statistics, cluster administration, and create an animated GIF.
Stars: ✭ 23 (-20.69%)
ensparaModeling molecular ensembles with scalable data structures and parallel computing
Stars: ✭ 28 (-3.45%)
DigitalCellSorterDigital Cell Sorter (DCS): single cell RNA-seq analysis toolkit. Documentation:
Stars: ✭ 19 (-34.48%)
Apartment-Interest-PredictionPredict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.
Stars: ✭ 17 (-41.38%)
mathematics-statistics-for-data-scienceMathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (+93.1%)
dbscan-python[New Version] Theoretically Efficient and Practical Parallel DBSCAN
Stars: ✭ 18 (-37.93%)
syncfluxSyncFlux is an Open Source InfluxDB Data synchronization and replication tool for migration purposes or HA clusters
Stars: ✭ 145 (+400%)
dbscanDBSCAN Clustering Algorithm C# Implementation
Stars: ✭ 38 (+31.03%)
clopeElixir implementation of CLOPE: A Fast and Effective Clustering Algorithm for Transactional Data
Stars: ✭ 18 (-37.93%)
GrouProxFedGroup, A Clustered Federated Learning framework based on Tensorflow
Stars: ✭ 20 (-31.03%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+334.48%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-27.59%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-3.45%)
magento-clusterHighly Available and Auto-scalable Magento Cluster
Stars: ✭ 21 (-27.59%)
pyclustertendA python package to assess cluster tendency
Stars: ✭ 38 (+31.03%)