scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+42.11%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+231.58%)
MoosefsMooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+2597.37%)
ClusterAnalysis.jlCluster Algorithms from Scratch with Julia Lang. (K-Means and DBSCAN)
Stars: ✭ 22 (-42.11%)
color clothcolor_cloth gets the main colors and its proportions from a cloth image ignoring the background, it uses the EM algorithm from OpenCV library, the algorithm needs an image with an item in the center of the picture.
Stars: ✭ 20 (-47.37%)
kmeansA simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (-52.63%)
couchdb-mangoMirror of Apache CouchDB Mango
Stars: ✭ 34 (-10.53%)
SpectreA computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
Stars: ✭ 31 (-18.42%)
DBSCANSDJava implementation for DBSCANSD, a trajectory clustering algorithm.
Stars: ✭ 35 (-7.89%)
Sampled-MinHashingA method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery
Stars: ✭ 24 (-36.84%)
G-SimCLRThis is the code base for paper "G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling" by Souradip Chakraborty, Aritra Roy Gosthipaty and Sayak Paul.
Stars: ✭ 69 (+81.58%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-10.53%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+234.21%)
hotspot3d3D hotspot mutation proximity analysis tool
Stars: ✭ 43 (+13.16%)
clusterdockclusterdock is a framework for creating Docker-based container clusters
Stars: ✭ 26 (-31.58%)
watchmanWatchman: An open-source social-media event-detection system
Stars: ✭ 18 (-52.63%)
clustersCluster analysis library for Golang
Stars: ✭ 68 (+78.95%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-44.74%)
influxdb-haHigh-availability and horizontal scalability for InfluxDB
Stars: ✭ 45 (+18.42%)
falconMirror of Apache Falcon
Stars: ✭ 95 (+150%)
T-CorExImplementation of linear CorEx and temporal CorEx.
Stars: ✭ 31 (-18.42%)
FredA fast, scalable and light-weight C++ Fréchet distance library, exposed to python and focused on (k,l)-clustering of polygonal curves.
Stars: ✭ 13 (-65.79%)
wranglerWrangler Transform: A DMD system for transforming Big Data
Stars: ✭ 63 (+65.79%)
tf-example-modelsTensorFlow-based implementation of (Gaussian) Mixture Model and some other examples.
Stars: ✭ 42 (+10.53%)
impfuzzyFuzzy Hash calculated from import API of PE files
Stars: ✭ 67 (+76.32%)
big-data-liteSamples to the Oracle Big Data Lite VM
Stars: ✭ 41 (+7.89%)
predictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,510 (+32821.05%)
hclustHierarchical clustering in JavaScript
Stars: ✭ 39 (+2.63%)
protoactor-goProto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 4,138 (+10789.47%)
Linux-adminShell scripts to automate download of GitHub traffic statistics, cluster administration, and create an animated GIF.
Stars: ✭ 23 (-39.47%)
DigitalCellSorterDigital Cell Sorter (DCS): single cell RNA-seq analysis toolkit. Documentation:
Stars: ✭ 19 (-50%)
check-engineData validation library for PySpark 3.0.0
Stars: ✭ 29 (-23.68%)
QuestionClusteringClasificador de preguntas escrito en python 3 que fue implementado en el siguiente vídeo: https://youtu.be/qnlW1m6lPoY
Stars: ✭ 15 (-60.53%)
LinearCorexFast, linear version of CorEx for covariance estimation, dimensionality reduction, and subspace clustering with very under-sampled, high-dimensional data
Stars: ✭ 39 (+2.63%)
mousetrapProcess and Analyze Mouse-Tracking Data
Stars: ✭ 33 (-13.16%)
opendcCollaborative Datacenter Simulation and Exploration for Everybody
Stars: ✭ 40 (+5.26%)
classifai🔥 One of the most comprehensive open-source data annotation platform.
Stars: ✭ 99 (+160.53%)
clopeElixir implementation of CLOPE: A Fast and Effective Clustering Algorithm for Transactional Data
Stars: ✭ 18 (-52.63%)
subsemblesubsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (+5.26%)
nlp-ltNatural Language Processing for Lithuanian language
Stars: ✭ 17 (-55.26%)
hyper-enginePython library for Bayesian hyper-parameters optimization
Stars: ✭ 80 (+110.53%)
hmmA Hidden Markov Model implemented in Javascript
Stars: ✭ 29 (-23.68%)
SparsifiedKMeansKMeans for big data using preconditioning and sparsification, Matlab implementation. Aka k-means
Stars: ✭ 50 (+31.58%)
PyPHLAWDPython version of PHLAWD
Stars: ✭ 16 (-57.89%)
RATTLEReference-free reconstruction and error correction of transcriptomes from Nanopore long-read sequencing
Stars: ✭ 35 (-7.89%)