All Projects → MariaNattestad → Assemblytics

MariaNattestad / Assemblytics

Licence: MIT license
Assemblytics is a bioinformatics tool to detect and analyze structural variants from a genome assembly by comparing it to a reference genome.

Programming Languages

javascript
184084 projects - #8 most used programming language
python
139335 projects - #7 most used programming language
PHP
23972 projects - #3 most used programming language
CSS
56736 projects
r
7636 projects
perl
6916 projects

Projects that are alternatives of or similar to Assemblytics

mlst check
Multilocus sequence typing by blast using the schemes from PubMLST
Stars: ✭ 22 (-79.05%)
Mutual labels:  genomics
region-plot
A tool to plot significant regions of GWAS
Stars: ✭ 20 (-80.95%)
Mutual labels:  genomics
adapt
A package for designing activity-informed nucleic acid diagnostics for viruses.
Stars: ✭ 16 (-84.76%)
Mutual labels:  genomics
graphsim
R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)
Stars: ✭ 16 (-84.76%)
Mutual labels:  genomics
dee2
Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (-69.52%)
Mutual labels:  genomics
indelope
find large indels (in the blind spot between GATK/freebayes and SV callers)
Stars: ✭ 38 (-63.81%)
Mutual labels:  genomics
bio-dockers
🐳 Bio-dockers: dockerized bioinformatic tools
Stars: ✭ 33 (-68.57%)
Mutual labels:  genomics
gosling.js
Grammar of Scalable Linked Interactive Nucleotide Graphics
Stars: ✭ 89 (-15.24%)
Mutual labels:  genomics
tidygenomics
Tidy Verbs for Dealing with Genomic Data Frames https://const-ae.github.io/tidygenomics/
Stars: ✭ 97 (-7.62%)
Mutual labels:  genomics
MindTheGap
MindTheGap is a SV caller for short read sequencing data dedicated to insertion variants (all sizes and types). It can also be used as a local assembly tool.
Stars: ✭ 30 (-71.43%)
Mutual labels:  genomics
fq
Command line utility for manipulating Illumina-generated FastQ files.
Stars: ✭ 31 (-70.48%)
Mutual labels:  genomics
TADLib
A Library to Explore Chromatin Interaction Patterns for Topologically Associating Domains
Stars: ✭ 23 (-78.1%)
Mutual labels:  genomics
HumanIdiogramLibrary
Resource of human chromosome schematics & images
Stars: ✭ 76 (-27.62%)
Mutual labels:  genomics
hts-python
pythonic wrapper for libhts (moved to: https://github.com/quinlan-lab/hts-python)
Stars: ✭ 48 (-54.29%)
Mutual labels:  genomics
bxtools
Tools for analyzing 10X Genomics data
Stars: ✭ 39 (-62.86%)
Mutual labels:  genomics
mustache
Multi-scale Detection of Chromatin Loops from Hi-C and Micro-C Maps using Scale-Space Representation
Stars: ✭ 38 (-63.81%)
Mutual labels:  genomics
gnomix
A fast, scalable, and accurate local ancestry method.
Stars: ✭ 36 (-65.71%)
Mutual labels:  genomics
ipyrad
Interactive assembly and analysis of RAD-seq data sets
Stars: ✭ 57 (-45.71%)
Mutual labels:  genomics
phylostratr
An R framework for phylostratigraphy
Stars: ✭ 25 (-76.19%)
Mutual labels:  genomics
macrel
Predict AMPs in (meta)genomes and peptides
Stars: ✭ 34 (-67.62%)
Mutual labels:  genomics

Assemblytics: a web analytics tool for the detection of variants from an assembly

Assemblytics is available online at http://assemblytics.com

Please cite our paper in Bioinformatics: http://www.ncbi.nlm.nih.gov/pubmed/27318204

The preprint is still freely available on the BioRxiv: https://www.biorxiv.org/content/10.1101/044925v1

There are three ways to use Assemblytics:

  1. Use the hosted online version at http://assemblytics.com. This is the easiest option.
  2. Run it from the command-line. For this you need only the contents of the scripts/ directory. See instructions below.
  3. Run the full web app from a local server. See instructions below.

Important: Use only contigs rather than scaffolds from the assembly. This will prevent false positives when the number of Ns in the scaffolded sequence does not match perfectly to the distance in the reference.

Dependencies

  • R
    • ggplot2
    • plyr
    • RColorBrewer
    • scales
  • Python
    • argparse
    • numpy

Command-line instructions

If you prefer to run Assemblytics from the command-line the scripts/ directory contains all the code you need, from unique anchor filtering and calling variants to creating the output plots and summary tables.

To run Assemblytics on the command-line, keep all the scripts together inside the scripts/ directory, either in your PATH or anywhere else you like, and make them all executable:

chmod a+x scripts/Assemblytics*

Keeping the scripts together in the same folder will allow the main Assemblytics script to call all the other scripts that do filtering, analysis, indexing, and plotting.

Follow the instructions at http://assemblytics.com for how to prepare your data and get a delta file for Assemblytics.

Then run Assemblytics:

scripts/Assemblytics <delta_file> <output_prefix> <unique_anchor_length> <min_variant_size> <max_variant_size>

Local web app instructions

The whole web application can be downloaded and run locally, utilizing the graphical user interface and giving the added benefit of the interactive dot plot which is only available in the web version and cannot run from the CLI.

Notes for installation:

  • Use a local server like Apache and follow the instructions there.
  • Clone this repository into a folder called assemblytics, to make the .htaccess file point the server correctly to the public/ folder, where the index.php and other pages and web app resources are located.
  • Make sure to open up permissions in user_uploads and user_data so the webserver can read and write there.
  • It does not contain the examples as some of these are huge files.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].