All Projects → compbiol → CAMSA

compbiol / CAMSA

Licence: MIT License
CAMSA: a tool for Comparative Analysis and Merging of Scaffold Assemblies

Programming Languages

python
139335 projects - #7 most used programming language
javascript
184084 projects - #8 most used programming language
HTML
75241 projects
powershell
5483 projects
CSS
56736 projects
Batchfile
5799 projects
shell
77523 projects

Projects that are alternatives of or similar to CAMSA

downpore
Suite of tools for use in genome assembly and consensus. Work in progress.
Stars: ✭ 32 (+77.78%)
Mutual labels:  genome-assembly, genome-scaffolding
MGSE
Mapping-based Genome Size Estimation (MGSE) performs an estimation of a genome size based on a read mapping to an existing genome sequence assembly.
Stars: ✭ 22 (+22.22%)
Mutual labels:  genomics, genome-assembly
mccortex
De novo genome assembly and multisample variant calling
Stars: ✭ 105 (+483.33%)
Mutual labels:  genomics, genome-assembly
indelope
find large indels (in the blind spot between GATK/freebayes and SV callers)
Stars: ✭ 38 (+111.11%)
Mutual labels:  genomics, genome-assembly
haslr
A fast tool for hybrid genome assembly of long and short reads
Stars: ✭ 68 (+277.78%)
Mutual labels:  genomics, genome-assembly
LRSDAY
LRSDAY: Long-read Sequencing Data Analysis for Yeasts
Stars: ✭ 26 (+44.44%)
Mutual labels:  genomics, genome-assembly
berokka
🍊 💫 Trim, circularise and orient long read bacterial genome assemblies
Stars: ✭ 23 (+27.78%)
Mutual labels:  genomics, genome-assembly
instaGRAAL
Large genome reassembly based on Hi-C data, continuation of GRAAL
Stars: ✭ 32 (+77.78%)
Mutual labels:  genomics, genome-assembly
redundans
Redundans is a pipeline that assists an assembly of heterozygous/polymorphic genomes.
Stars: ✭ 90 (+400%)
Mutual labels:  genomics, genome-assembly
mcscan
Command-line program to wrap dagchainer and combine pairwise results into multi-alignments in column format
Stars: ✭ 18 (+0%)
Mutual labels:  genomics, comparative-genomics
bystro
Bystro genetic analysis (annotation, filtering, statistics)
Stars: ✭ 31 (+72.22%)
Mutual labels:  genomics
FALDO
Feature Annotation Location Description Ontology
Stars: ✭ 28 (+55.56%)
Mutual labels:  genomics
netSmooth
netSmooth: A Network smoothing based method for Single Cell RNA-seq imputation
Stars: ✭ 23 (+27.78%)
Mutual labels:  genomics
Ontologies
Home of the Genomic Feature and Variation Ontology (GFVO)
Stars: ✭ 16 (-11.11%)
Mutual labels:  genomics
smartas
📓Notebook of Climente-González et al. (2017), The Functional Impact of Alternative Splicing in Cancer.
Stars: ✭ 13 (-27.78%)
Mutual labels:  genomics
chromap
Fast alignment and preprocessing of chromatin profiles
Stars: ✭ 93 (+416.67%)
Mutual labels:  genomics
dropClust
Version 2.1.0 released
Stars: ✭ 19 (+5.56%)
Mutual labels:  genomics
staramr
Scans genome contigs against the ResFinder, PlasmidFinder, and PointFinder databases.
Stars: ✭ 52 (+188.89%)
Mutual labels:  genomics
companion
This repository has been archived, currently maintained version is at https://github.com/iii-companion/companion
Stars: ✭ 21 (+16.67%)
Mutual labels:  genomics
switchde
Inference of switch-like differential expression along single-cell trajectories
Stars: ✭ 19 (+5.56%)
Mutual labels:  genomics

CAMSA

MIT licensed Build Status Build status

CAMSA - is a tool for Comparative Analysis and Merging of Scaffold Assemblies, distributed both as a standalone software package and as Python library under the MIT license.

Main CAMSA features:

  1. works with any number of scaffold assemblies in de-novo non-progressive fashion
  2. allows to simultaniously work with scaffold assemblies obtained from any in silico and in vitro techniques, supporting multiple existing formats via built-in converters
  3. creates an extensive report with several comparative quality metrics (both on assembly level and on the level of individual assembly points)
  4. constructs a merged combined scaffold assembly
  5. provides an interactive framework for a visual comparative analysis of the given assemblies

CAMSA is developed using Python programming language and is compatible with both Python2 (2.7+) and Python3 (3.5+). This means CAMSA can be run on any modern operating system properly.

Note: only Linux and MacOS were tested extensively with respect to CAMSA installation and usage. If you want to use CAMSA on windows, please try to do so on your own (we have continuous integration setup on Windows platform as well, but no as detailed as on MacOS/Linux), and if you run into any problems - contact us.

Installation:

The simplest way to install CAMSA is by using pip. Open your terminal and execute the following command:

pip install camsa

For more details and other ways to install CAMSA please refer to the installation wiki page.

Input

CAMSA expect as an input a set of different assemblies on the same set of scaffolds. Eash assembly must be represented as a set of assemblies points in TSF files, using the following format:

origin    seq1    seq1_or    seq2    seq2_or    gap_size    cw
A1        s1      +          s2      -          ?           ?
A1        s2      -          s3      +          ?           ?
...

CAMSA also provides a set of built-in scripts, that allow one to translate other scaffold assembly formats (i.e., FASTA, AGPv2.0, GRIMM, etc) into CAMSA format, and vice-versa (when possible). For more details please refer to the input wiki page.

Usage

CAMSA is very straightforward to use: installation process adds several executable scripts to your (python)path, which either execute CAMSA itself, or some of its utilities. Basic CAMSA usage is

run_camsa.py f1.camsa.points f2.camsa.points ... -o output_dir

This command would run CAMSA in both comparative and merging modes, producing extensive comparative assembly reports as well as a merged assembly. For more details on running CAMSA please refer to the usage wiki page.

Output

CAMSA comparative analysis results are stored in the automatically generated a set of text-based reports as well as a single comprehensive interactive report, powered by HTML, CSS, and JavaScript. This report format allows one to process the output on any operating system, easily share results with colleges and collaborators, as well as ensure reproducibility of each experiment. The output folder also contains all the required libraries for the interactive report to properly work, so internet connection is required.

For more details regarding both text-based and interactive reports please refer to the output wiki page.

Contributing / Bug reports

Please submit any information about identified bugs to the corresponding GitHub bug tracker system.

Thank you!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].