Awesome Single CellCommunity-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Stars: ✭ 1,937 (+3625%)
NIDS-Intrusion-DetectionSimple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN
Stars: ✭ 45 (-13.46%)
TangramSpatial alignment of single cell transcriptomic data.
Stars: ✭ 149 (+186.54%)
pypmcClustering with variational Bayes and population Monte Carlo
Stars: ✭ 46 (-11.54%)
pymdeMinimum-distortion embedding with PyTorch
Stars: ✭ 420 (+707.69%)
Machine LearningA repository of resources for understanding the concepts of machine learning/deep learning.
Stars: ✭ 29 (-44.23%)
Competitive-Feature-LearningOnline feature-extraction and classification algorithm that learns representations of input patterns.
Stars: ✭ 32 (-38.46%)
BEERBEER: Batch EffEct Remover for single-cell data
Stars: ✭ 19 (-63.46%)
SpectreA computational toolkit in R for the integration, exploration, and analysis of high-dimensional single-cell cytometry and imaging data.
Stars: ✭ 31 (-40.38%)
scGEAToolboxscGEAToolbox: Matlab toolbox for single-cell gene expression analyses
Stars: ✭ 15 (-71.15%)
SierraDiscover differential transcript usage from polyA-captured single cell RNA-seq data
Stars: ✭ 37 (-28.85%)
dropEstPipeline for initial analysis of droplet-based single-cell RNA-seq data
Stars: ✭ 71 (+36.54%)
federated pcaFederated Principal Component Analysis Revisited!
Stars: ✭ 30 (-42.31%)
cardelinoClone identification from single-cell data
Stars: ✭ 49 (-5.77%)
celltypistA tool for semi-automatic cell type annotation
Stars: ✭ 92 (+76.92%)
UmapUniform Manifold Approximation and Projection
Stars: ✭ 5,268 (+10030.77%)
sefA Python Library for Similarity-based Dimensionality Reduction
Stars: ✭ 24 (-53.85%)
Unsupervised-Learning-in-RWorkshop (6 hours): Clustering (Hdbscan, LCA, Hopach), dimension reduction (UMAP, GLRM), and anomaly detection (isolation forests).
Stars: ✭ 34 (-34.62%)
50-days-of-Statistics-for-Data-ScienceThis repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-63.46%)
UMAP.jlUniform Manifold Approximation and Projection (UMAP) implementation in Julia
Stars: ✭ 93 (+78.85%)
mathematics-statistics-for-data-scienceMathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
Stars: ✭ 56 (+7.69%)
timecorrEstimate dynamic high-order correlations in multivariate timeseries data
Stars: ✭ 30 (-42.31%)
mosesStreaming, Memory-Limited, r-truncated SVD Revisited!
Stars: ✭ 19 (-63.46%)
variational-bayes-csScalable sparse Bayesian learning for large CS recovery problems
Stars: ✭ 17 (-67.31%)
ALRAImputation method for scRNA-seq based on low-rank approximation
Stars: ✭ 48 (-7.69%)
SHARPSHARP: Single-cell RNA-seq Hyper-fast and Accurate processing via ensemble Random Projection
Stars: ✭ 14 (-73.08%)
twpca🕝 Time-warped principal components analysis (twPCA)
Stars: ✭ 118 (+126.92%)
enstopEnsemble topic modelling with pLSA
Stars: ✭ 104 (+100%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-71.15%)
symphonyEfficient and precise single-cell reference atlas mapping with Symphony
Stars: ✭ 69 (+32.69%)
Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+4125%)
cTPnetcTPnet Package
Stars: ✭ 18 (-65.38%)
DRComparisonComparison of dimensionality reduction methods
Stars: ✭ 29 (-44.23%)
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Stars: ✭ 18 (-65.38%)
ARM-gradientLow-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)
Stars: ✭ 26 (-50%)
tGPLVMtGPLVM: A Nonparametric, Generative Model for Manifold Learning with scRNA-seq experimental data
Stars: ✭ 16 (-69.23%)
vireoDemultiplexing pooled scRNA-seq data with or without genotype reference
Stars: ✭ 34 (-34.62%)
dbMAPA fast, accurate, and modularized dimensionality reduction approach based on diffusion harmonics and graph layouts. Escalates to millions of samples on a personal laptop. Adds high-dimensional big data intrinsic structure to your clustering and data visualization workflow.
Stars: ✭ 39 (-25%)
ReactiveMP.jlJulia package for automatic Bayesian inference on a factor graph with reactive message passing
Stars: ✭ 58 (+11.54%)
dmlR package for Distance Metric Learning
Stars: ✭ 58 (+11.54%)
partitionA fast and flexible framework for data reduction in R
Stars: ✭ 33 (-36.54%)
bhtsneParallel Barnes-Hut t-SNE implementation written in Rust.
Stars: ✭ 43 (-17.31%)
ParametricUMAP paperParametric UMAP embeddings for representation and semisupervised learning. From the paper "Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning" (Sainburg, McInnes, Gentner, 2020).
Stars: ✭ 132 (+153.85%)
northstarSingle cell type annotation guided by cell atlases, with freedom to be queer
Stars: ✭ 23 (-55.77%)
adenineADENINE: A Data ExploratioN PipelINE
Stars: ✭ 15 (-71.15%)
lfdaLocal Fisher Discriminant Analysis in R
Stars: ✭ 74 (+42.31%)
walkletsA lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Stars: ✭ 94 (+80.77%)
ezancestryEasy genetic ancestry predictions in Python
Stars: ✭ 38 (-26.92%)
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+3.85%)
cerebraA tool for fast and accurate summarizing of variant calling format (VCF) files
Stars: ✭ 55 (+5.77%)
ReductionWrappersR wrappers to connect Python dimensional reduction tools and single cell data objects (Seurat, SingleCellExperiment, etc...)
Stars: ✭ 31 (-40.38%)
tldrTLDR is an unsupervised dimensionality reduction method that combines neighborhood embedding learning with the simplicity and effectiveness of recent self-supervised learning losses
Stars: ✭ 95 (+82.69%)
topometryA comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.
Stars: ✭ 64 (+23.08%)
uapcaUncertainty-aware principal component analysis.
Stars: ✭ 16 (-69.23%)
scAlignA deep learning-based tool for alignment and integration of single cell genomic data across multiple datasets, species, conditions, batches
Stars: ✭ 32 (-38.46%)