All Projects → jordanlab → STing

jordanlab / STing

Licence: other
Ultrafast sequence typing and gene detection from NGS raw reads

Programming Languages

C++
36643 projects - #6 most used programming language
c
50402 projects - #5 most used programming language
javascript
184084 projects - #8 most used programming language
HTML
75241 projects
CMake
9771 projects
r
7636 projects

Projects that are alternatives of or similar to STing

bac-genomics-scripts
Collection of scripts for bacterial genomics
Stars: ✭ 39 (+160%)
Mutual labels:  genomics, ngs, computational-biology, mlst
Htsjdk
A Java API for high-throughput sequencing data (HTS) formats.
Stars: ✭ 220 (+1366.67%)
Mutual labels:  genomics, ngs, dna
Galaxy
Data intensive science for everyone.
Stars: ✭ 812 (+5313.33%)
Mutual labels:  genomics, ngs, dna
Gatk
Official code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+6580%)
Mutual labels:  genomics, ngs, dna
Deepvariant
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Stars: ✭ 2,404 (+15926.67%)
Mutual labels:  genomics, ngs, dna
catch
A package for designing compact and comprehensive capture probe sets.
Stars: ✭ 55 (+266.67%)
Mutual labels:  genomics, ngs, dna
Jvarkit
Java utilities for Bioinformatics
Stars: ✭ 313 (+1986.67%)
Mutual labels:  genomics, ngs
Deeptools
Tools to process and analyze deep sequencing data.
Stars: ✭ 448 (+2886.67%)
Mutual labels:  genomics, ngs
Vg
tools for working with genome variation graphs
Stars: ✭ 710 (+4633.33%)
Mutual labels:  genomics, dna
dna-traits
A fast 23andMe genome text file parser, now superseded by arv
Stars: ✭ 64 (+326.67%)
Mutual labels:  genomics, dna
Sns
Analysis pipelines for sequencing data
Stars: ✭ 43 (+186.67%)
Mutual labels:  genomics, dna
Genomics
A collection of scripts and notes related to genomics and bioinformatics
Stars: ✭ 101 (+573.33%)
Mutual labels:  genomics, dna
Pyfaidx
Efficient pythonic random access to fasta subsequences
Stars: ✭ 307 (+1946.67%)
Mutual labels:  genomics, dna
Bio.jl
[DEPRECATED] Bioinformatics and Computational Biology Infrastructure for Julia
Stars: ✭ 257 (+1613.33%)
Mutual labels:  genomics, dna
Nucleus
Python and C++ code for reading and writing genomics data.
Stars: ✭ 657 (+4280%)
Mutual labels:  genomics, dna
mccortex
De novo genome assembly and multisample variant calling
Stars: ✭ 105 (+600%)
Mutual labels:  genomics, kmer
Biopython
Official git repository for Biopython (originally converted from CVS)
Stars: ✭ 2,936 (+19473.33%)
Mutual labels:  genomics, dna
awesome-genetics
A curated list of awesome bioinformatics software.
Stars: ✭ 60 (+300%)
Mutual labels:  genomics, dna
tiptoft
Predict plasmids from uncorrected long read data
Stars: ✭ 27 (+80%)
Mutual labels:  genomics, kmer
Ngless
NGLess: NGS with less work
Stars: ✭ 115 (+666.67%)
Mutual labels:  genomics, ngs

STing

Table of contents:

Documentation

Documentation Status
See the full documentation at: https://sting.readthedocs.io/

Requirements

  • Linux OS
  • gcc >= 4.8 and gcc <= 5.3
    You may use gcc >= 5.3 to compile STing but you will see multiple warning messages. Those warning messages will not interfere with the binary generation.
  • autotols (Ubuntu: autotools)
    • Autoconf >= 2.69
    • Automake >= 1.15.1
    • Libtool >= 2.4.6
  • autotools developer headers (Ubuntu: autotools-dev)
  • zlib >= 1.2.8 (Ubuntu: zlib)
  • zlib developer headers (Ubuntu: zlib1g-dev)

Installation

Using the pre-built static binaries

wget https://github.com/jordanlab/STing/releases/download/v1.0.1/STing_v1.0.1.tar.gz
tar xfv STing_v1.0.1.tar.gz
wget https://github.com/jordanlab/STing/raw/master/scripts/db_util.py -O STing/db_util.py
chmod +x STing/db_util.py
export PATH=$PWD/STing:$PATH

From source:

./autogen.sh
./configure
make
make install

By default, make install will install all the files in /usr/local/bin, /usr/local/lib etc. You can specify an installation prefix other than /usr/local using the --prefix options from ./configure, for instance ./configure --prefix=$HOME. Please check all the available options of ./configure by executing ./configure --help.

Quickstart guide

Pre-built MLST databases

Pre-built MLST databases are provided here and updated weekly: https://github.com/jordanlab/STing_datasets

We recommend building a new database each time STing is run, or daily if under continual use, to ensure the most up to date definitions as possible

Typing

Preparing directory:

mkdir STing_demo
cd STing_demo

Downloading the Neisseria spp. MLST database from PubMLST.org and build a STing index from it:

db_util.py fetch --query "Neisseria spp." --out_dir my_dbs --build_index

Downloading a WGS sample to analyze:

wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR026/ERR026529/ERR026529_1.fastq.gz ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR026/ERR026529/ERR026529_2.fastq.gz

Run STing typer:

typer -x my_dbs/neisseria_spp/db/index -1 ERR026529_1.fastq.gz -2 ERR026529_2.fastq.gz -s ERR026529 -c -a -d -t ERR026529.depth.tsv -y -o ERR026529.results.tsv --sensitive

STing applications

STing has three applications:

  • indexer : Creates the indexes (databases) required to execute a locus-based typing analysis and detect genes.

  • typer : Predicts STs of a read sample based on an index built from a species-specific locus-based typing scheme (allelic profile table and sequences of observed alleles).

  • detector : Detects genes in a read sample based on an index built from sequences of a set of genes of interest.

To explore the usage and available options for each tool, run the corresponding application using the option -h or --help:

indexer -h
typer -h
detector -h
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].