All Projects β†’ KevinMenden β†’ scaden

KevinMenden / scaden

Licence: MIT license
Deep Learning based cell composition analysis with Scaden.

Programming Languages

python
139335 projects - #7 most used programming language
Dockerfile
14818 projects

Projects that are alternatives of or similar to scaden

EWCE
Expression Weighted Celltype Enrichment. See the package website for up-to-date instructions on usage.
Stars: ✭ 30 (-50.82%)
Mutual labels:  deconvolution, single-cell-rna-seq
alevin-fry
🐟 πŸ”¬πŸ¦€ alevin-fry is an efficient and flexible tool for processing single-cell sequencing data, currently focused on single-cell transcriptomics and feature barcoding.
Stars: ✭ 78 (+27.87%)
Mutual labels:  rna-seq, single-cell-rna-seq
scTCRseq
Processing of single cell RNAseq data for the recovery of TCRs in python
Stars: ✭ 22 (-63.93%)
Mutual labels:  rna-seq, single-cell-rna-seq
CellO
CellO: Gene expression-based hierarchical cell type classification using the Cell Ontology
Stars: ✭ 34 (-44.26%)
Mutual labels:  rna-seq, single-cell-rna-seq
OrchestratingSingleCellAnalysis-release
An online companion to the OSCA manuscript demonstrating Bioconductor resources and workflows for single-cell RNA-seq analysis.
Stars: ✭ 35 (-42.62%)
Mutual labels:  rna-seq, single-cell-rna-seq
dropClust
Version 2.1.0 released
Stars: ✭ 19 (-68.85%)
Mutual labels:  rna-seq
scCODA
A Bayesian model for compositional single-cell data analysis
Stars: ✭ 109 (+78.69%)
Mutual labels:  single-cell-rna-seq
gene-oracle
Feature extraction algorithm for genomic data
Stars: ✭ 13 (-78.69%)
Mutual labels:  rna-seq
MetaOmGraph
MetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets
Stars: ✭ 30 (-50.82%)
Mutual labels:  rna-seq
Taiji
All-in-one analysis pipeline
Stars: ✭ 28 (-54.1%)
Mutual labels:  single-cell-rna-seq
ORNA
Fast in-silico normalization algorithm for NGS data
Stars: ✭ 21 (-65.57%)
Mutual labels:  rna-seq
MINTIE
Method for Identifying Novel Transcripts and Isoforms using Equivalence classes, in cancer and rare disease.
Stars: ✭ 24 (-60.66%)
Mutual labels:  rna-seq
cellSNP
Pileup biallelic SNPs from single-cell and bulk RNA-seq data
Stars: ✭ 42 (-31.15%)
Mutual labels:  rna-seq
cellrank
CellRank for directed single-cell fate mapping
Stars: ✭ 222 (+263.93%)
Mutual labels:  single-cell-rna-seq
CoNekT
CoNekT (short for Co-expression Network Toolkit) is a platform to browse co-expression data and enable cross-species comparisons.
Stars: ✭ 17 (-72.13%)
Mutual labels:  rna-seq
CNN-DICOM-Segmentation
DICOM Image Segmentation with CNNs in Tensorflow
Stars: ✭ 87 (+42.62%)
Mutual labels:  deconvolution
NGS
Next-Gen Sequencing tools from the Horvath Lab
Stars: ✭ 30 (-50.82%)
Mutual labels:  rna-seq
RNASeq
RNASeq pipeline
Stars: ✭ 30 (-50.82%)
Mutual labels:  rna-seq
velodyn
Dynamical systems methods for RNA velocity analysis
Stars: ✭ 16 (-73.77%)
Mutual labels:  rna-seq
CellNet
CellNet: network biology applied to stem cell engineering
Stars: ✭ 39 (-36.07%)
Mutual labels:  rna-seq

Single-cell assisted deconvolutional network

Scaden

Scaden version MIT Install with pip Downloads Docker Scaden CI

Scaden is a deep-learning based algorithm for cell type deconvolution of bulk RNA-seq samples. It was developed at the DZNE TΓΌbingen and the ZMNH in Hamburg. The method is published in Science Advances: Deep-learning based cell composition analysis from tissue expression profiles

A complete documentation is available here

Figure1

Scaden overview. a) Generation of artificial bulk samples with known cell type composition from scRNA-seq data. b) Training of Scaden model ensemble on simulated training data. c) Scaden ensemble architecture. d) A trained Scaden model can be used to deconvolve complex bulk mixtures.

Installation guide

Scaden can be easily installed on a Linux system, and should also work on Mac. There are currently two options for installing Scaden, either using Bioconda or via pip.

pip

To install Scaden via pip, simply run the following command:

pip install scaden

GPU

If you want to make use of your GPU, you will have to additionally install tensorflow-gpu.

For pip:

pip install tensorflow-gpu

For conda:

conda install tensorflow-gpu

Docker

If you don't want to install Scaden at all, but rather use a Docker container, we provide that as well. For every release, we provide two version - one for CPU and one for GPU usage. To pull the CPU container, use this command:

docker pull ghcr.io/kevinmenden/scaden/scaden

For the GPU container:

docker pull ghcr.io/kevinmenden/scaden/scaden-gpu

Webtool (beta)

Additionally, we now proivde a web tool:

https://scaden.ims.bio

It contains pre-generated training datasets for several tissues, and all you need to do is to upload your expression data. Please note that this is still in preview.

Usage

We provide a detailed instructions for how to use Scaden at our Documentation page

A deconvolution workflow with Scaden consists of four major steps:

  • data simulation
  • data processing
  • training
  • prediction

If training data is already available, you can start at the data processing step. Otherwise you will first have to process scRNA-seq datasets and perform data simulation to generate a training dataset. As an example workflow, you can use Scaden's function scaden example to generate example data and go through the whole pipeline.

First, make an example data directory and generate the example data:

mkdir example_data
scaden example --out example_data/

This generates the files "example_counts.txt", "example_celltypes.txt" and "example_bulk_data.txt" in the "example_data" directory. Next, you can generate training data:

scaden simulate --data example_data/ -n 100 --pattern "*_counts.txt

This generates 100 samples of training data in your current working directory. The file you need for your next step is called "data.h5ad". Now you need to perform the preprocessing using the training data and the bulk data file:

scaden process data.h5ad example_data/example_bulk_data.txt

As a result, you should now have a file called "processed.h5ad" in your directory. Now you can perform training. The following command performs training for 5000 steps per model and saves the trained weights to the "model" directory, which will be created:

scaden train processed.h5ad --steps 5000 --model_dir model

Finally, you can use the trained model to perform prediction:

scaden predict --model_dir model example_data/example_bulk_data.txt

Now you should have a file called "scaden_predictions.txt" in your working directory, which contains your estimated cell compositions.

1. System requirements

Scaden was developed and tested on Linux (Ubuntu 16.04 and 18.04). It was not tested on Windows or Mac, but should also be usable on these systems when installing with Pip or Bioconda. Scaden does not require any special hardware (e.g. GPU), however we recommend to have at least 16 GB of memory.

Scaden requires Python 3. All package dependencies should be handled automatically when installing with pip or conda.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].