All Projects → theislab → scarches

theislab / scarches

Licence: BSD-3-Clause License
Reference mapping for single-cell genomics

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to scarches

scAlign
A deep learning-based tool for alignment and integration of single cell genomic data across multiple datasets, species, conditions, batches
Stars: ✭ 32 (-81.71%)
Mutual labels:  scrna-seq, single-cell, single-cell-genomics, human-cell-atlas
single cell portal
Tutorials, workflows, and convenience scripts for Single Cell Portal
Stars: ✭ 41 (-76.57%)
Mutual labels:  rna-seq-analysis, single-cell-genomics, human-cell-atlas
scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (-69.14%)
Mutual labels:  scrna-seq, single-cell-genomics, multiomics
fastq utils
Validation and manipulation of FASTQ files, scRNA-seq barcode pre-processing and UMI quantification.
Stars: ✭ 25 (-85.71%)
Mutual labels:  scrna-seq, single-cell
Awesome Single Cell
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Stars: ✭ 1,937 (+1006.86%)
Mutual labels:  data-integration, single-cell
babel
Deep learning model for single-cell inference of multi-omic profiles from a single input modality.
Stars: ✭ 20 (-88.57%)
Mutual labels:  scrna-seq, single-cell
cardelino
Clone identification from single-cell data
Stars: ✭ 49 (-72%)
Mutual labels:  scrna-seq, single-cell
SPLiT-Seq demultiplexing
An unofficial demultiplexing strategy for SPLiT-seq RNA-Seq data
Stars: ✭ 20 (-88.57%)
Mutual labels:  scrna-seq, single-cell
celltypist
A tool for semi-automatic cell type annotation
Stars: ✭ 92 (-47.43%)
Mutual labels:  scrna-seq, single-cell
scGEAToolbox
scGEAToolbox: Matlab toolbox for single-cell gene expression analyses
Stars: ✭ 15 (-91.43%)
Mutual labels:  scrna-seq, rna-seq-analysis
Harmony
Harmony framework for connecting scRNA-seq data from discrete time points
Stars: ✭ 36 (-79.43%)
Mutual labels:  scrna-seq, batch-correction
single-cell-papers-with-code
Papers with code for single cell related papers
Stars: ✭ 20 (-88.57%)
Mutual labels:  scrna-seq, single-cell
EDS
💡 💾 💽 A simple, intuitive and Efficient single cell binary Data Storage format
Stars: ✭ 16 (-90.86%)
Mutual labels:  single-cell
EWCE
Expression Weighted Celltype Enrichment. See the package website for up-to-date instructions on usage.
Stars: ✭ 30 (-82.86%)
Mutual labels:  single-cell
SpiceMix
spatial transcriptome, single cell
Stars: ✭ 19 (-89.14%)
Mutual labels:  single-cell
ideal
Interactive Differential Expression AnaLysis - DE made accessible and reproducible
Stars: ✭ 24 (-86.29%)
Mutual labels:  rna-seq-analysis
data-product-batch
Template to deploy a Data Product for Batch data processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Stars: ✭ 27 (-84.57%)
Mutual labels:  data-integration
alevin-fry
🐟 🔬🦀 alevin-fry is an efficient and flexible tool for processing single-cell sequencing data, currently focused on single-cell transcriptomics and feature barcoding.
Stars: ✭ 78 (-55.43%)
Mutual labels:  single-cell
Multimodal-Future-Prediction
The official repository for the CVPR 2019 paper "Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction"
Stars: ✭ 38 (-78.29%)
Mutual labels:  multimodal-deep-learning
MERINGUE
characterizing spatial gene expression heterogeneity in spatially resolved single-cell transcriptomics data with nonuniform cellular densities
Stars: ✭ 33 (-81.14%)
Mutual labels:  single-cell

PyPI PyPIDownloads Docs travis

scArches - single-cell architecture surgery

scArches is a package to integrate newly produced single-cell datasets into integrated reference atlases. Our method can facilitate large collaborative projects with decentralized training and integration of multiple datasets by different groups.

Updates

  • (6.02.2022) We have added expiMap to scArches code base. expiMap alows interpretable reference mapping. See tutorials here here .

What can you do with scArches?

  • Construct single or multi-modal (CITE-seq) reference atlases and share the trained model and the data (if possible).
  • Download a pre-trained model for your atlas of interest, update it with new datasets and share with your collaborators.
  • Project and integrate query datasets on the top of a reference and use latent representation for downstream tasks, e.g.:diff testing, clustering, classification

What are the different models?

scArches is an algorithm to map to project query on the top of reference datasets and applies to different models. Here we provide a short explanation and hints on when to use which model. Our models are divided into three categories:

Unsupervised

This class of algorithms require no cell type labels, meaning that you can create a reference and project a query without having access to cell type labels. We implemented two algorithms:

  • scVI (Lopez et al., 2018): Requires access to raw counts values for data integration and assumes

count distribution on the data (NB, ZINB, Poisson).

  • trVAE (Lotfollahi et al.,2020): It supports both normalized log transformed or count data as input and applies additional MMD loss to have better merging in the latent space.
Supervised and Semi-supervised

This class of algorithms assumes the user has access to cell type labels when creating the reference data and usually perform better integration compared to. unsupervised methods. However, query data still can be unlabeled. In addition to integration, you can classify your query cells using these methods.

  • scANVI (Xu et al., 2019): It needs cell type labels for reference data. Your query data can be either unlabeled or labeled. In the case of unlabeled query data, you can use this method to also classify your query cells using reference labels.
  • scGen (Lotfollahi et al., 2019): This method requires cell-type labels for both reference building and query mapping. The query mapping for this method solely relies on the integrated reference and requre no fine-tuning.
Bioligically informed
  • expiMap (Lotfollahi, Rybakov et al., 2022): This method takes prior knowledge from gene sets databases or users allowing to analyze your query data in the context of known gene programs.
Multi-modal

These algorithms can be used to construct multi-modal references atlas and map query data from either modality on the top of the reference.

  • totalVI (Gayoso al., 2019): This model can be used to build multi-modal CITE-seq reference atalses. Query datasets can be either from sc-RNAseq or CITE-seq. In addition to integrating query with reference, one can use this model to impute the Proteins in the query datasets.

Usage and installation

See here for documentation and tutorials.

Support and contribute

If you have a question or new architecture or a model that could be integrated into our pipeline, you can post an issue or reach us by email.

Reference

If scArches is useful in your research, please consider citing following paper:

@article{lotfollahi2021mapping,
  title={Mapping single-cell data to reference atlases by transfer learning},
  author={Lotfollahi, Mohammad and Naghipourfar, Mohsen and Luecken, Malte D and Khajavi,
  Matin and B{\"u}ttner, Maren and Wagenstetter, Marco and Avsec, {\v{Z}}iga and Gayoso,
  Adam and Yosef, Nir and Interlandi, Marta and others},
  journal={Nature Biotechnology},
  pages={1--10},
  year={2021},
  publisher={Nature Publishing Group}}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].