All Projects → smangul1 → rop

smangul1 / rop

Licence: GPL-3.0 license
The Read Origin Protocol (ROP) is a computational protocol that aims to discover the source of all reads, including those originating from repeat sequences, recombinant B and T cell receptors, and microbial communities.

Programming Languages

shell
77523 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to rop

immunarch
🧬 Immunarch by ImmunoMind: R Package for Fast and Painless Exploration of Single-cell and Bulk T-cell/Antibody Immune Repertoires
Stars: ✭ 204 (+482.86%)
Mutual labels:  t-cell-receptor, immunoglobulin
GeneTonic
Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail
Stars: ✭ 66 (+88.57%)
Mutual labels:  rna-seq-analysis
perf
PERF is an Exhaustive Repeat Finder
Stars: ✭ 26 (-25.71%)
Mutual labels:  repeats
rail
Scalable RNA-seq analysis
Stars: ✭ 74 (+111.43%)
Mutual labels:  rna-seq-analysis
lncpipe
UNDER DEVELOPMENT--- Analysis of long non-coding RNAs from RNA-seq datasets
Stars: ✭ 24 (-31.43%)
Mutual labels:  rna-seq-analysis
GREIN
GREIN : GEO RNA-seq Experiments Interactive Navigator
Stars: ✭ 40 (+14.29%)
Mutual labels:  rna-seq-analysis
scarches
Reference mapping for single-cell genomics
Stars: ✭ 175 (+400%)
Mutual labels:  rna-seq-analysis
CIRCexplorer
A combined strategy to identify circular RNAs (circRNAs and ciRNAs) (Zhang et al., Complementary Sequence-Mediated Exon Circularization, Cell (2014), 159:134-147)
Stars: ✭ 56 (+60%)
Mutual labels:  circrna
single cell portal
Tutorials, workflows, and convenience scripts for Single Cell Portal
Stars: ✭ 41 (+17.14%)
Mutual labels:  rna-seq-analysis
Circle-Map
A method for circular DNA detection based on probabilistic mapping of ultrashort reads
Stars: ✭ 45 (+28.57%)
Mutual labels:  circrna
scGEAToolbox
scGEAToolbox: Matlab toolbox for single-cell gene expression analyses
Stars: ✭ 15 (-57.14%)
Mutual labels:  rna-seq-analysis
CSCv2
Version 2 of my Crazy Small CPU
Stars: ✭ 53 (+51.43%)
Mutual labels:  alu
rna-seq-snakemake
Snakemake based pipeline for RNA-Seq analysis
Stars: ✭ 29 (-17.14%)
Mutual labels:  rna-seq-analysis
krait
An ultrafast tool for genome-wide survey of microsatellites and primer design
Stars: ✭ 25 (-28.57%)
Mutual labels:  repeats
CSCvon8
A crazy small 8-bit CPU built with only seventeen 7400-series chips.
Stars: ✭ 86 (+145.71%)
Mutual labels:  alu
TypeTE
Genotyping of segregating mobile elements insertions
Stars: ✭ 15 (-57.14%)
Mutual labels:  alu
CIRCexplorer2
circular RNA analysis toolset
Stars: ✭ 68 (+94.29%)
Mutual labels:  circrna
sports1.1
Small non-coding RNA annotation Pipeline Optimized for rRNA- and tRNA-Derived Small RNAs
Stars: ✭ 35 (+0%)
Mutual labels:  rna-seq-analysis
iMOKA
interactive Multi Objective K-mer Analysis
Stars: ✭ 19 (-45.71%)
Mutual labels:  rna-seq-analysis
Interactive-3D-Plotting-in-Seurat-3.0.0
This repository contains R code, with which you can create 3D UMAP and tSNE plots of Seurat analyzed scRNAseq data
Stars: ✭ 80 (+128.57%)
Mutual labels:  rna-seq-analysis

Read Origin Protocol

The Read Origin Protocol (ROP) is a computational protocol that aims to discover the source of all reads, including those originating from complex RNA molecules, recombinant antibodies, and microbial communities.

Created by Serghei Mangul, Kevin Hsieh, Linus Chen, and Harry Taegyun Yang at the University of California, Los Angeles (UCLA). Released under the terms of the General Public License, version 3.0 (GPLv3).

Installation

To install ROP, first clone this repository, then run

./install.sh

from the repository's directory. This will download dependencies and databases. The default installation will generally suffice, but the following options are available:

  • -c|--clean: Just remove installed tools.
    • To reinstall, use -r|--reinstall instead.
    • Overrides conflicting options.
  • -f|--force: Unlink databases.
    • Use with caution.
  • -n|--native: Use native python.
    • MiniConda will not be downloaded.
    • You may use environment.yml to set up your python environment.
  • -l|--link LINK: Link databases instead of downloading.
    • Useful if you previously downloaded an ROP database.
    • A symlink will be created in the current directory.
    • Overrides conflicting options.
  • -d|--db-dest DB_DEST (default: .): Change database download location.
    • Useful for managing space.
    • A symlink will be created in the current directory.
  • -o|--organism ORGANISM (default: human): Organism to download databases for.
    • Exactly one of the following: human, mouse.
  • -r|--reinstall: Reinstall tools, even if they're already present.
  • -s|--select-db SELECT_DB (default: all): Database(s) to download for the specified organism.
    • A comma-separated list of one or more of the following: basic, repeats, microbiome (which may be subdivided into metaphlan, viral, fungi, protozoa).
    • -s all selects everything.
  • -h|--help: Displays usage information.

Usage

To use ROP, run

rop.sh unmapped_reads output_dir

Unless otherwise specified using an option, unmapped_reads must be a .fastq/.fq file, and output_dir must not exist (it will be created). Results will be written to output_dir, with one subdirectory for every stage of the pipeline. The following options are available:

  • -o|--organism (default: human): Organism to run ROP for.
    • Exactly one of the following: human, mouse.
  • -s|--steps (default: all except lowq): Select the analysis modes to use.
    • A comma-separated list of one or more of the following: lowq, rdna, reference, repeats, circrna, immune, microbiome (which may be subdivided into metaphlan, bacteria, viral, fungi, protozoa).
    • circrna and bacteria are not available in the current release.
    • -s all selects everything.
  • -a|--fasta: Input unmapped reads in .fasta format instead of .fastq format. Forcibly disables low-quality read filtering.
  • -b|--bam: Input unmapped reads in .bam format instead of .fastq format.
  • -z|--gzip: gunzip the input file.
  • -d|--dev: Keep intermediate FASTA files.
    • Consumes extra space.
  • -f|--force: Overwrite the analysis destination directory.
  • -i|--ignore-extensions: Ignore incorrect .fastq/.fq/.fasta/.fa file extensions. Does not ignore incorrect .gz/.bam file extensions.
  • -m|--max: Use a liberal threshold when remapping to reference.
    • May account for more reads.
  • -x|--commands: Print all commands (diagnostic mode).
  • -h|--help: Displays usage information.

A small example file is included in the repository in various formats. To try it out, run one of the following commands from the repository directory:

rop.sh -b example/example.bam ropout
rop.sh example/example.fastq ropout
rop.sh -z example/example.fastq.gz ropout
rop.sh -a example/example.fasta ropout
rop.sh -az example/example.fasta.gz ropout

Then, browse to the ropout directory to see the analysis results!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].