All Projects → COMBINE-lab → Salmon

COMBINE-lab / Salmon

Licence: gpl-3.0
🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment

Projects that are alternatives of or similar to Salmon

Grakel
A scikit-learn compatible library for graph kernels
Stars: ✭ 330 (-27.63%)
Mutual labels:  bioinformatics
Jcvi
Python library to facilitate genome assembly, annotation, and comparative genomics
Stars: ✭ 404 (-11.4%)
Mutual labels:  bioinformatics
Circosjs
d3 library to build circular graphs
Stars: ✭ 436 (-4.39%)
Mutual labels:  bioinformatics
Megahit
Ultra-fast and memory-efficient (meta-)genome assembler
Stars: ✭ 343 (-24.78%)
Mutual labels:  bioinformatics
Bowtie2
A fast and sensitive gapped read aligner
Stars: ✭ 365 (-19.96%)
Mutual labels:  bioinformatics
Sambamba
Tools for working with SAM/BAM data
Stars: ✭ 409 (-10.31%)
Mutual labels:  bioinformatics
Dash Bio
Open-source bioinformatics components for Dash
Stars: ✭ 329 (-27.85%)
Mutual labels:  bioinformatics
Vsearch
Versatile open-source tool for microbiome analysis
Stars: ✭ 444 (-2.63%)
Mutual labels:  bioinformatics
Jbrowse
A modern genome browser built with JavaScript and HTML5.
Stars: ✭ 393 (-13.82%)
Mutual labels:  bioinformatics
Biojava
📖🔬☕️ BioJava is an open-source project dedicated to providing a Java library for processing biological data.
Stars: ✭ 434 (-4.82%)
Mutual labels:  bioinformatics
Deeppurpose
A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)
Stars: ✭ 342 (-25%)
Mutual labels:  bioinformatics
Nanopolish
Signal-level algorithms for MinION data
Stars: ✭ 367 (-19.52%)
Mutual labels:  bioinformatics
Rush
A cross-platform command-line tool for executing jobs in parallel
Stars: ✭ 421 (-7.68%)
Mutual labels:  bioinformatics
Cutadapt
Cutadapt removes adapter sequences from sequencing reads
Stars: ✭ 340 (-25.44%)
Mutual labels:  bioinformatics
Wdl
Workflow Description Language - Specification and Implementations
Stars: ✭ 438 (-3.95%)
Mutual labels:  bioinformatics
Biopandas
Working with molecular structures in pandas DataFrames
Stars: ✭ 329 (-27.85%)
Mutual labels:  bioinformatics
Bwa Mem2
The next version of bwa-mem
Stars: ✭ 408 (-10.53%)
Mutual labels:  bioinformatics
Deeptools
Tools to process and analyze deep sequencing data.
Stars: ✭ 448 (-1.75%)
Mutual labels:  bioinformatics
Mmseqs2
MMseqs2: ultra fast and sensitive search and clustering suite
Stars: ✭ 441 (-3.29%)
Mutual labels:  bioinformatics
Containers
Bioinformatics containers
Stars: ✭ 435 (-4.61%)
Mutual labels:  bioinformatics
salmon logo

Build Status Documentation Status install with bioconda

Try out alevin (salmon's single-cell processing module)! Get started with the tutorial

Try out the new alevin-fry framework for single-cell analysis!

Help guide the development of Salmon, take our survey

Pre-computed decoy transcriptomes

tl;dr: fast is good but fast and accurate is better ! Although the precomputed decoys (<=v.14.2) are still compatible with the latest major release (v1.0.0). We recommend updating your index using the full genome, as it can give significantly higher accuracy. For more information, please check our extensive benchmarking comparing different alignment methods and their performance on RNA-seq quantification in the latest revised preprint manuscript. Please use the tutorial for a step-by-step guide on how to efficiently index the reference transcriptome and genome for accurate gentrome based RNA-seq quantification.

Specifically, there are 3 possible ways in which the salmon index can be created:

  • cDNA-only index : salmon_index - https://combine-lab.github.io/salmon/getting_started/. This method will result in the smallest index and require the least resources to build, but will be the most prone to possible spurious alignments.

  • SA mashmap index: salmon_partial_sa_index - (regions of genome that have high sequence similarity to the transcriptome) - Details can be found in this README and using this script. While running mashmap can require considerable resources, the resulting decoy files are fairly small. This will result in an index bigger than the cDNA-only index, but still mucch smaller than the full genome index below. It will confer many, though not all, of the benefits of using the entire genome as a decoy sequence.

  • SAF genome index: salmon_sa_index - (the full genome is used as decoy) - The tutorial for creating such an index can be found here. This will result in the largest index, but likely does the best job in avoiding spurious alignments to annotated transcripts.

Facing problems with Indexing ?, Check if anyone else already had this problem in the issues section or fill the index generation request form

What is Salmon?

Salmon is a wicked-fast program to produce a highly-accurate, transcript-level quantification estimates from RNA-seq data. Salmon achieves its accuracy and speed via a number of different innovations, including the use of selective-alignment (accurate but fast-to-compute proxies for traditional read alignments), and massively-parallel stochastic collapsed variational inference. The result is a versatile tool that fits nicely into many different pipelines. For example, you can choose to make use of our selective-alignment algorithm by providing Salmon with raw sequencing reads, or, if it is more convenient, you can provide Salmon with regular alignments (e.g. an unsorted BAM file with alignments to the transcriptome produced with your favorite aligner), and it will use the same wicked-fast, state-of-the-art inference algorithm to estimate transcript-level abundances for your experiment.

Give salmon a try! You can find the latest binary releases here.

The current version number of the master branch of Salmon can be found here

Documentation

The documentation for Salmon is available on ReadTheDocs, check it out here.

Salmon is, and will continue to be, freely and actively supported on a best-effort basis. If you need industrial-grade technical support, please consider the options at oceangenomics.com/support.

Chat live about Salmon

You can chat with the Salmon developers and other users via Gitter!

Join the chat at https://gitter.im/COMBINE-lab/salmon

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].