All Projects → simslab → scHPF

simslab / scHPF

Licence: BSD-2-Clause license
Single-cell Hierarchical Poisson Factorization

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to scHPF

Machine Learning
A repository of resources for understanding the concepts of machine learning/deep learning.
Stars: ✭ 29 (-44.23%)
Mutual labels:  dimensionality-reduction
BEER
BEER: Batch EffEct Remover for single-cell data
Stars: ✭ 19 (-63.46%)
Mutual labels:  scrna-seq
50-days-of-Statistics-for-Data-Science
This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-63.46%)
Mutual labels:  dimensionality-reduction
federated pca
Federated Principal Component Analysis Revisited!
Stars: ✭ 30 (-42.31%)
Mutual labels:  dimensionality-reduction
moses
Streaming, Memory-Limited, r-truncated SVD Revisited!
Stars: ✭ 19 (-63.46%)
Mutual labels:  dimensionality-reduction
celltypist
A tool for semi-automatic cell type annotation
Stars: ✭ 92 (+76.92%)
Mutual labels:  scrna-seq
sef
A Python Library for Similarity-based Dimensionality Reduction
Stars: ✭ 24 (-53.85%)
Mutual labels:  dimensionality-reduction
ReactiveMP.jl
Julia package for automatic Bayesian inference on a factor graph with reactive message passing
Stars: ✭ 58 (+11.54%)
Mutual labels:  variational-bayes
ParametricUMAP paper
Parametric UMAP embeddings for representation and semisupervised learning. From the paper "Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning" (Sainburg, McInnes, Gentner, 2020).
Stars: ✭ 132 (+153.85%)
Mutual labels:  dimensionality-reduction
lfda
Local Fisher Discriminant Analysis in R
Stars: ✭ 74 (+42.31%)
Mutual labels:  dimensionality-reduction
SHARP
SHARP: Single-cell RNA-seq Hyper-fast and Accurate processing via ensemble Random Projection
Stars: ✭ 14 (-73.08%)
Mutual labels:  scrna-seq
scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+3.85%)
Mutual labels:  scrna-seq
enstop
Ensemble topic modelling with pLSA
Stars: ✭ 104 (+100%)
Mutual labels:  dimensionality-reduction
symphony
Efficient and precise single-cell reference atlas mapping with Symphony
Stars: ✭ 69 (+32.69%)
Mutual labels:  scrna-seq
cTPnet
cTPnet Package
Stars: ✭ 18 (-65.38%)
Mutual labels:  scrna-seq
DRComparison
Comparison of dimensionality reduction methods
Stars: ✭ 29 (-44.23%)
Mutual labels:  dimensionality-reduction
ARM-gradient
Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)
Stars: ✭ 26 (-50%)
Mutual labels:  variational-bayes
ReductionWrappers
R wrappers to connect Python dimensional reduction tools and single cell data objects (Seurat, SingleCellExperiment, etc...)
Stars: ✭ 31 (-40.38%)
Mutual labels:  dimensionality-reduction
tldr
TLDR is an unsupervised dimensionality reduction method that combines neighborhood embedding learning with the simplicity and effectiveness of recent self-supervised learning losses
Stars: ✭ 95 (+82.69%)
Mutual labels:  dimensionality-reduction
topometry
A comprehensive dimensional reduction framework to recover the latent topology from high-dimensional data.
Stars: ✭ 64 (+23.08%)
Mutual labels:  dimensionality-reduction

Single-cell Hierarchical Poisson Factorization

About

scHPF is a tool for de novo discovery of both discrete and continuous expression patterns in single-cell RNA-sequencing (scRNA-seq). We find that scHPF’s sparse low-dimensional representations, non-negativity, and explicit modeling of variable sparsity across genes and cells produce highly interpretable factors.

Installation

Environment & Dependencies

scHPF requires Python >= 3.6 and the packages:

The easiest way to setup an environment for scHPF is with the Anaconda Python distribution in Miniconda or anaconda:

conda create -n schpf_p37 python=3.7 scikit-learn numba=0.50 pandas

# for newer anaconda versions
conda activate schpf_p37
# XOR older anaconda verstions
source activate schpf_p37

# Optional, for using loom files as input to preprocessing
pip install -U loompy

Installing from source

Once you have set up the environment, clone this repository and install.

git clone [email protected]:simslab/scHPF.git
cd scHPF
pip install .

Testing your installation

This step important because not all micro-versions of numba play nicely with all micro versions of Python or numpy, and sometimes issues vary across machines. Testing will catch some but not all such issues. From the scHPF base directory do:

conda install pytest
pytest

Please get in touch if tests fail, or if you get segmentation faults or very long train times that and no automatic parallelization, and I'm happy to try to help.

Quick Start: Command Line Interface

  1. Prepare your data.

  2. Train a model.

  3. Get gene and cell scores

API

scHPF has a scikit-learn like API. Trained models are stored in a serialized joblib format.

Help and support

If you have any questions/errors/issues, please open an issue and I be happy to to provide whatever help and guidance I can.

Contributing

Contributions to scHPF are welcome. Please get in touch if you would like to discuss/check it's something I've already done but haven't pushed to master yet. To contribute, please fork scHPF, make your changes, and submit a pull request.

References

Hanna Mendes Levitin, Jinzhou Yuan, Yim Ling Cheng, Francisco JR Ruiz, Erin C Bush, Jeffrey N Bruce, Peter Canoll, Antonio Iavarone, Anna Lasorella, David M Blei, Peter A Sims. "De novo gene signature identification from single‐cell RNA‐seq with hierarchical Poisson factorization." Molecular Systems Biology, 2019. [Open access article]

Peter A. Szabo*, Hanna Mendes Levitin*, Michelle Miron, Mark E. Snyder, Takashi Senda, Jinzhou Yuan, Yim Ling Cheng, Erin C. Bush, Pranay Dogra, Puspa Thapa, Donna L. Farber, Peter A. Sims. "Single-cell transcriptomics of human T cells reveals tissue and activation signatures in health and disease." Nature Communications, 2019. [Open access article] * Co-first authors

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].