UmapUniform Manifold Approximation and Projection
Stars: ✭ 5,268 (+5564.52%)
ReductionWrappersR wrappers to connect Python dimensional reduction tools and single cell data objects (Seurat, SingleCellExperiment, etc...)
Stars: ✭ 31 (-66.67%)
topometryA comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.
Stars: ✭ 64 (-31.18%)
ParametricUMAP paperParametric UMAP embeddings for representation and semisupervised learning. From the paper "Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning" (Sainburg, McInnes, Gentner, 2020).
Stars: ✭ 132 (+41.94%)
Unsupervised-Learning-in-RWorkshop (6 hours): Clustering (Hdbscan, LCA, Hopach), dimension reduction (UMAP, GLRM), and anomaly detection (isolation forests).
Stars: ✭ 34 (-63.44%)
dbMAPA fast, accurate, and modularized dimensionality reduction approach based on diffusion harmonics and graph layouts. Escalates to millions of samples on a personal laptop. Adds high-dimensional big data intrinsic structure to your clustering and data visualization workflow.
Stars: ✭ 39 (-58.06%)
bhtsneParallel Barnes-Hut t-SNE implementation written in Rust.
Stars: ✭ 43 (-53.76%)
SpectreA computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
Stars: ✭ 31 (-66.67%)
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
Stars: ✭ 14 (-84.95%)
timecorrEstimate dynamic high-order correlations in multivariate timeseries data
Stars: ✭ 30 (-67.74%)
Embeddings2Imagecreate "Karpathy's style" 2d images out of your image embeddings
Stars: ✭ 52 (-44.09%)
adenineADENINE: A Data ExploratioN PipelINE
Stars: ✭ 15 (-83.87%)
pymdeMinimum-distortion embedding with PyTorch
Stars: ✭ 420 (+351.61%)
tadasetsSynthetic data sets apt for Topological Data Analysis
Stars: ✭ 20 (-78.49%)
walkletsA lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Stars: ✭ 94 (+1.08%)
TDAstatsR pipeline for computing persistent homology in topological data analysis. See https://doi.org/10.21105/joss.00860 for more details.
Stars: ✭ 26 (-72.04%)
scHPFSingle-cell Hierarchical Poisson Factorization
Stars: ✭ 52 (-44.09%)
antzANTz immersive 3D data visualization engine
Stars: ✭ 25 (-73.12%)
tldrTLDR is an unsupervised dimensionality reduction method that combines neighborhood embedding learning with the simplicity and effectiveness of recent self-supervised learning losses
Stars: ✭ 95 (+2.15%)
tmaptopological data analysis of population-scale microbiomes
Stars: ✭ 24 (-74.19%)
50-days-of-Statistics-for-Data-ScienceThis repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-79.57%)
lfdaLocal Fisher Discriminant Analysis in R
Stars: ✭ 74 (-20.43%)
umap-javaA Uniform Manifold Approximation and Projection (UMAP) library for Java, developed by Tag.bio in collaboration with Real Time Genomics.
Stars: ✭ 16 (-82.8%)
enstopEnsemble topic modelling with pLSA
Stars: ✭ 104 (+11.83%)
AnnA Anki neuronal AppendixUsing machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
Stars: ✭ 39 (-58.06%)
playing with vaeComparing FC VAE / FCN VAE / PCA / UMAP on MNIST / FMNIST
Stars: ✭ 53 (-43.01%)
BEERBEER: Batch EffEct Remover for single-cell data
Stars: ✭ 19 (-79.57%)
mosesStreaming, Memory-Limited, r-truncated SVD Revisited!
Stars: ✭ 19 (-79.57%)
topological-autoencodersCode for the paper "Topological Autoencoders" by Michael Moor, Max Horn, Bastian Rieck, and Karsten Borgwardt.
Stars: ✭ 82 (-11.83%)
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (-41.94%)
uapcaUncertainty-aware principal component analysis.
Stars: ✭ 16 (-82.8%)
federated pcaFederated Principal Component Analysis Revisited!
Stars: ✭ 30 (-67.74%)
Machine LearningA repository of resources for understanding the concepts of machine learning/deep learning.
Stars: ✭ 29 (-68.82%)
DRComparisonComparison of dimensionality reduction methods
Stars: ✭ 29 (-68.82%)
sefA Python Library for Similarity-based Dimensionality Reduction
Stars: ✭ 24 (-74.19%)
partitionA fast and flexible framework for data reduction in R
Stars: ✭ 33 (-64.52%)
mathematics-statistics-for-data-scienceMathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (-39.78%)
ezancestryEasy genetic ancestry predictions in Python
Stars: ✭ 38 (-59.14%)
findpeaksThe detection of peaks and valleys in a 1d-vector or 2d-array (image)
Stars: ✭ 121 (+30.11%)
twpca🕝 Time-warped principal components analysis (twPCA)
Stars: ✭ 118 (+26.88%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-83.87%)
Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+2262.37%)
Awesome Single CellCommunity-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Stars: ✭ 1,937 (+1982.8%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-80.65%)
Competitive-Feature-LearningOnline feature-extraction and classification algorithm that learns representations of input patterns.
Stars: ✭ 32 (-65.59%)
tGPLVMtGPLVM: A Nonparametric, Generative Model for Manifold Learning with scRNA-seq experimental data
Stars: ✭ 16 (-82.8%)
NIDS-Intrusion-DetectionSimple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN
Stars: ✭ 45 (-51.61%)
dmlR package for Distance Metric Learning
Stars: ✭ 58 (-37.63%)
PersistenceA topological data analysis library for Haskell
Stars: ✭ 26 (-72.04%)
AlephA library for exploring persistent homology
Stars: ✭ 81 (-12.9%)