All Projects → pgxcentre → region-plot

pgxcentre / region-plot

Licence: other
A tool to plot significant regions of GWAS

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to region-plot

manhattan generator
Manhattan plot Generator
Stars: ✭ 20 (+0%)
Mutual labels:  genomics, genetics, plot
HumanIdiogramLibrary
Resource of human chromosome schematics & images
Stars: ✭ 76 (+280%)
Mutual labels:  genomics, genetics
dee2
Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (+60%)
Mutual labels:  genomics, genetics
graphsim
R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)
Stars: ✭ 16 (-20%)
Mutual labels:  genomics, genetics
awesome-genetics
A curated list of awesome bioinformatics software.
Stars: ✭ 60 (+200%)
Mutual labels:  genomics, genetics
genipe
Genome-wide imputation pipeline
Stars: ✭ 28 (+40%)
Mutual labels:  genomics, genetics
fwdpy11
Forward-time simulation in Python using fwdpp
Stars: ✭ 25 (+25%)
Mutual labels:  genomics, genetics
GCModeller
GCModeller: genomics CAD(Computer Assistant Design) Modeller system in .NET language
Stars: ✭ 25 (+25%)
Mutual labels:  genomics
variantkey
Numerical Encoding for Human Genetic Variants
Stars: ✭ 32 (+60%)
Mutual labels:  genomics
phastaf
Identify phage regions in bacterial genomes for masking purposes
Stars: ✭ 22 (+10%)
Mutual labels:  genomics
spark-vcf
Spark VCF data source implementation for Dataframes
Stars: ✭ 15 (-25%)
Mutual labels:  genomics
fq
Command line utility for manipulating Illumina-generated FastQ files.
Stars: ✭ 31 (+55%)
Mutual labels:  genomics
mlst check
Multilocus sequence typing by blast using the schemes from PubMLST
Stars: ✭ 22 (+10%)
Mutual labels:  genomics
GeneLab Data Processing
No description or website provided.
Stars: ✭ 32 (+60%)
Mutual labels:  genetics
Swar-Chia-Plot-Manager
This is a Cross-Platform Plot Manager for Chia Plotting that is simple, easy-to-use, and reliable.
Stars: ✭ 1,310 (+6450%)
Mutual labels:  plot
BALSAMIC
Bioinformatic Analysis pipeLine for SomAtic Mutations In Cancer
Stars: ✭ 29 (+45%)
Mutual labels:  genomics
hickit
TAD calling, phase imputation, 3D modeling and more for diploid single-cell Hi-C (Dip-C) and general Hi-C
Stars: ✭ 79 (+295%)
Mutual labels:  genomics
mustache
Multi-scale Detection of Chromatin Loops from Hi-C and Micro-C Maps using Scale-Space Representation
Stars: ✭ 38 (+90%)
Mutual labels:  genomics
OpenPlot
一款开源的地理信息标绘组件
Stars: ✭ 19 (-5%)
Mutual labels:  plot
graphest
A faithful graphing calculator
Stars: ✭ 42 (+110%)
Mutual labels:  plot

region-plot - A tool to plot significant regions of GWAS

region-plot is a bioinformatics pipeline to plot significant regions found by genome-wide association studies (GWAS). It works on both Python 2 and 3 versions.

The following figure (low resolution) is an example from Tardif et al. (2015) (doi:10.1161/CIRCGENETICS.114.000663). The difference with the original figure is the inclusion of the annotation from the HAVANA project in this example.

Dependencies

The tool requires a standard Python installation with the following packages:

  1. numpy version 1.9.1 or latest
  2. pandas version 0.17.0 or latest
  3. six version 1.9.0 or latest
  4. matplotlib version 1.4.3 or latest
  5. gepyto version 0.9.2 or latest

The tool has been tested on Linux only, but should also work on both Mac OSX and Windows.

Usage

For Linux users, make sure that the script is executable (using the chmod command).

$ launch-region-plot --help
usage: launch-region-plot [-h] [-v] [--log-level {INFO,DEBUG}]
                          [--log-file LOGFILE] --assoc FILE --genotypes FILE
                          [--imputed-sites FILE] [--genotypes-format FORMAT]
                          [--keep FILE] [--significant FLOAT]
                          [--plot-p-lower FLOAT] [--snp-col COL]
                          [--chr-col COL] [--pos-col COL] [--p-col COL]
                          [--a1-col ALLELE] [--a2-col ALLELE] --genetic-map
                          FILE [--genetic-chr-col COL] [--genetic-pos-col COL]
                          [--genetic-rate-col COL] [--plot-format {png,pdf}]
                          [--build {GRCh37,GRCh38}] [--region-padding FLOAT]
                          [--whole-dataset] [--output-directory DIR]

Plots significant regions of GWAS (0.1.3).

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         show program's version number and exit
  --log-level {INFO,DEBUG}
                        The logging level. [INFO]
  --log-file LOGFILE    The log file. [region-plot.log]

Input Files:
  --assoc FILE          The association file containing the hits.
  --genotypes FILE      The file containing the genotypes (available format
                        are VCF, IMPUTE2, BGEN or Plink binary files.
  --imputed-sites FILE  The file containing the imputed sites (if absent, all
                        points will have the same darkness).

Genotypes Options:
  --genotypes-format FORMAT
                        The genotype file format. If not specified, the tool
                        will try to guess the format and parse the file
                        accordingly.
  --keep FILE           The list of samples to keep for the LD calculation.

Association Options:
  --significant FLOAT   The significant association threshold. [<5.000000e-08]
  --plot-p-lower FLOAT  Plot markers with p lower than value. [<5.000000e-08]
  --snp-col COL         The name of the SNP column. [snp]
  --chr-col COL         The name of the chromosome column. [chr]
  --pos-col COL         The name of the pos column. [pos]
  --p-col COL           The name of the p-value column. [p]
  --a1-col ALLELE       The name of the column containing the first allele.
                        [minor]
  --a2-col ALLELE       The name of the column containing the second allele.
                        [major]

Genetic Map Options:
  --genetic-map FILE    The file containing the genetic map.
  --genetic-chr-col COL
                        The name of chromosome column for the genetic map.
                        [chromosome]
  --genetic-pos-col COL
                        The name of the position column for the genetic map.
                        [position]
  --genetic-rate-col COL
                        The name of the recombination rate column for the
                        genetic map. [rate]

Plot Options:
  --plot-format {png,pdf}
                        The format of the output file containing the plot
                        (might be 'png' or 'pdf'). [png]
  --build {GRCh37,GRCh38}
                        The build to search the overlapping genes. [GRCh37]
  --region-padding FLOAT
                        The amount of base pairs to pad the region (on each
                        side of the best hit. [500000.0]
  --whole-dataset       Plot all markers (no padding) (WARNING this might take
                        a lot of memory).

Output Options:
  --output-directory DIR
                        The output directory. [.]
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].