All Projects → kharchenkolab → dropEst

kharchenkolab / dropEst

Licence: GPL-3.0 License
Pipeline for initial analysis of droplet-based single-cell RNA-seq data

Programming Languages

C++
36643 projects - #6 most used programming language
r
7636 projects
CMake
9771 projects

Projects that are alternatives of or similar to dropEst

scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (-23.94%)
Mutual labels:  scrna-seq, single-cell-rna-seq
skippa
SciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (-53.52%)
Mutual labels:  pipeline, preprocessing
SeqTools
A python library to manipulate and transform indexable data (lists, arrays, ...)
Stars: ✭ 42 (-40.85%)
Mutual labels:  pipeline, preprocessing
SPLiT-Seq demultiplexing
An unofficial demultiplexing strategy for SPLiT-seq RNA-Seq data
Stars: ✭ 20 (-71.83%)
Mutual labels:  scrna-seq, single-cell-rna-seq
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-76.06%)
Mutual labels:  pipeline, preprocessing
scarches
Reference mapping for single-cell genomics
Stars: ✭ 175 (+146.48%)
Mutual labels:  scrna-seq
bifrost
A stream processing framework for high-throughput applications.
Stars: ✭ 48 (-32.39%)
Mutual labels:  pipeline
metagraf
metaGraf is a opinionated specification for describing a software component and what its requirements are from the runtime environment. The mg command, turns metaGraf specifications into Kubernetes resources, supporting CI, CD and GitOps software delivery.
Stars: ✭ 15 (-78.87%)
Mutual labels:  pipeline
text-normalizer
Normalize text string
Stars: ✭ 12 (-83.1%)
Mutual labels:  preprocessing
JT1078Gateway
基于Pipeline实现的JT1078Gateway支持TCP/UDP,目前只支持http-flv、ws-flv、hls三种拉流方式
Stars: ✭ 50 (-29.58%)
Mutual labels:  pipeline
fastq utils
Validation and manipulation of FASTQ files, scRNA-seq barcode pre-processing and UMI quantification.
Stars: ✭ 25 (-64.79%)
Mutual labels:  scrna-seq
babel
Deep learning model for single-cell inference of multi-omic profiles from a single input modality.
Stars: ✭ 20 (-71.83%)
Mutual labels:  scrna-seq
prose
A python framework to process FITS images. Built for Astronomy.
Stars: ✭ 21 (-70.42%)
Mutual labels:  pipeline
image-processing-pipeline
An image build orchestrator for the modern web
Stars: ✭ 43 (-39.44%)
Mutual labels:  pipeline
bistro
A library to build and execute typed scientific workflows
Stars: ✭ 43 (-39.44%)
Mutual labels:  pipeline
pipelines-as-code
Pipelines as Code
Stars: ✭ 37 (-47.89%)
Mutual labels:  pipeline
StackedDAE
Stacked Denoising AutoEncoder based on TensorFlow
Stars: ✭ 23 (-67.61%)
Mutual labels:  single-cell-rna-seq
elasticsearch-ingest-attachment-plugin-example
Example of how to use ElasticSearch ingest-attachment plugin using JavaScript
Stars: ✭ 19 (-73.24%)
Mutual labels:  pipeline
tweets-preprocessor
Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
Stars: ✭ 26 (-63.38%)
Mutual labels:  preprocessing
etl
M-Lab ingestion pipeline
Stars: ✭ 15 (-78.87%)
Mutual labels:  pipeline

dropEst - Pipeline

Pipeline for estimating molecular count matrices for droplet-based single-cell RNA-seq measurements. If you use the pipeline in your research, please cite the corresponding paper. To reproduce results from the paper, please see this repository.

Documentation

For detailed explanations, please see the documentation

Particularly:

If you have problems with installation, please look at the Troubleshooting page and open an issue if there is nothing.

News

[0.8.6] - 2019-08-01

  • Added support for Drop-seq and CEL-Seq2

See Changelog for the full list.

General processing steps

  1. dropTag: extraction of cell barcodes and UMIs from the library. Result: demultiplexed .fastq.gz files, which should be aligned to the reference.
  2. Alignment of the demultiplexed files to reference genome. Result: .bam files with the alignment.
  3. dropEst: building count matrix and estimation of some statistics, necessary for quality control. Result: .rds file with the count matrix and statistics. Optionally: count matrix in MatrixMarket format.
  4. dropReport - Generating report on library quality.
  5. dropEstR - R pacakge for UMI count corrections and cell quality classification

Examples

Complete examples of the pipeline can be found at EXAMPLES.md.

Here are results of processing of neurons_900 10x dataset.

Supported protocols

  • 10x
  • CEL-Seq2
  • Drop-seq
  • iCLIP
  • inDrop (v1-3)
  • Seq-Well
  • SPLiT-seq

Citation

If you find this pipeline useful for your research, please consider citing the paper:

Petukhov, V., Guo, J., Baryawno, N., Severe, N., Scadden, D. T., Samsonova, M. G., & Kharchenko, P. V. (2018). dropEst: pipeline for accurate estimation of molecular counts in droplet-based single-cell RNA-seq experiments. Genome biology, 19(1), 78. doi:10.1186/s13059-018-1449-6

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].