All Projects → hexylena → argparse2tool

hexylena / argparse2tool

Licence: Apache-2.0 License
transparently build CWL and Galaxy XML tool definitions for any script that uses argparse

Programming Languages

python
139335 projects - #7 most used programming language
HTML
75241 projects

Projects that are alternatives of or similar to argparse2tool

epiviz
EpiViz is a scientific information visualization tool for genetic and epigenetic data, used to aid in the exploration and understanding of correlations between various genome features.
Stars: ✭ 65 (+170.83%)
Mutual labels:  bioinformatics
motifmatchr
Fast motif matching in R
Stars: ✭ 25 (+4.17%)
Mutual labels:  bioinformatics
AMIDD
Introduction to Applied Mathematics and Informatics in Drug Discovery (AMIDD)
Stars: ✭ 13 (-45.83%)
Mutual labels:  bioinformatics
netSmooth
netSmooth: A Network smoothing based method for Single Cell RNA-seq imputation
Stars: ✭ 23 (-4.17%)
Mutual labels:  bioinformatics
react-msa-viewer
React rerelease of MSAViewer
Stars: ✭ 15 (-37.5%)
Mutual labels:  bioinformatics
bistro
A library to build and execute typed scientific workflows
Stars: ✭ 43 (+79.17%)
Mutual labels:  bioinformatics
perbase
Per-base per-nucleotide depth analysis
Stars: ✭ 46 (+91.67%)
Mutual labels:  bioinformatics
echolocatoR
Automated statistical and functional fine-mapping pipeline with extensive API access to datasets.
Stars: ✭ 13 (-45.83%)
Mutual labels:  bioinformatics
pathway-mapper
PathwayMapper: An interactive and collaborative graphical curation tool for cancer pathways
Stars: ✭ 47 (+95.83%)
Mutual labels:  bioinformatics
matam
Mapping-Assisted Targeted-Assembly for Metagenomics
Stars: ✭ 18 (-25%)
Mutual labels:  bioinformatics
block-aligner
SIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm.
Stars: ✭ 58 (+141.67%)
Mutual labels:  bioinformatics
awesome-genetics
A curated list of awesome bioinformatics software.
Stars: ✭ 60 (+150%)
Mutual labels:  bioinformatics
MSFragger
Ultrafast, comprehensive peptide identification for mass spectrometry–based proteomics
Stars: ✭ 43 (+79.17%)
Mutual labels:  bioinformatics
crazydoc
Read DNA sequences from colourful Microsoft Word documents
Stars: ✭ 18 (-25%)
Mutual labels:  bioinformatics
Introduction to the Unix Shell for biologists
Introduction to the Unix Shell for biologists
Stars: ✭ 16 (-33.33%)
Mutual labels:  bioinformatics
chromap
Fast alignment and preprocessing of chromatin profiles
Stars: ✭ 93 (+287.5%)
Mutual labels:  bioinformatics
calour
exploratory and interactive microbiome analyses based on heatmaps
Stars: ✭ 22 (-8.33%)
Mutual labels:  bioinformatics
picardmetrics
🚦 Run Picard on BAM files and collate 90 metrics into one file.
Stars: ✭ 38 (+58.33%)
Mutual labels:  bioinformatics
SigProfilerExtractor
SigProfilerExtractor allows de novo extraction of mutational signatures from data generated in a matrix format. The tool identifies the number of operative mutational signatures, their activities in each sample, and the probability for each signature to cause a specific mutation type in a cancer sample. The tool makes use of SigProfilerMatrixGen…
Stars: ✭ 86 (+258.33%)
Mutual labels:  bioinformatics
polyRAD
Genotype Calling with Uncertainty from Sequencing Data in Polyploids 🍌🍓🥔🍠🥝
Stars: ✭ 16 (-33.33%)
Mutual labels:  bioinformatics

argparse2tool

PyPI Build Status

This project aims to provide a drop-in replacement for argparse which allows generating Galaxy XML and CWL Tools.

It is quite literally a drop-in replacement. You (or the upstream tool author) can use argparse completely as normal

import argparse

When this package is installed and PYTHONPATH=$(argparse2tool) is set the dropin replacement of argparse is used ahead of the system argparse, argparse2tool will capture all argparse function calls, and process them specially.

This information captured in this process is used to produce Galaxy Tool XML when it's requested with the --generate_galaxy_xml flag, or CWL Tools when requested with the --generate_cwl_tool flag.

For our example python script you can see the generated Galaxy XML and CWL Tools.

Running

To generate XML or CWL, run your tool with the appropriate command line flag

$ PYTHONPATH=$(argparse2tool) <tool command> --generate_galaxy_xml <other options> > tool.xml
$ PYTHONPATH=$(argparse2tool) <tool command> --generate_cwl_tool <other options> > tool.cwl

The project includes a sample example.py file which uses as many argparse features as possible. CWL and Galaxy XML support different portions feature sets which will be visible in the generated outputs.

$ PYTHONPATH=$(argparse2tool) python example.py --generate_galaxy_xml
$ PYTHONPATH=$(argparse2tool) python example.py --generate_cwl_tool

CWL Specific Functionality

Example for CNVkit toolkit

$ cnvkit.py batch --generate_cwl_tool -d ~/cnvkit-tools/ --generate_outputs

If there are subcommands in the provided command, all possible tools will be generated, for instance, for CNVkit

$ cnvkit.py --generate_cwl_tool

will produce CWL tool descriptions for cnvkit.py batch, cnvkit.py access, cnvkit.py export bed, cnvkit.py export cdt and all other subcommands.

Other options (which work only with --generate_cwl_tool provided, except for help message) are:

  • -o FILENAME, --output_section FILENAME: File with manually filled output section which is put to a formed CWL tool. argparse2tool is not very good at generating outputs, it recognizes output files only if they have type argparse.FileType('w'), so output section is often blank and should be filled manually.

  • -go, --generate_outputs: flag for generating outputs not only from arguments that are instances of argparse.FileType('w'), but also from every argument which contains output keyword in its name. For instance, argument --output-file with no type will also be placed to output section. However, '--output-directory' argument will also be treated like File, so generated tools must be checked carefully if when this option is selected.

  • -b, basecommand: command which appears in basecommand field in a resulting tool. It is handy to use this option when you run tool with shebang, but want python to be in basecommand field and the file amidst arguments. Example:

    $ .search.py --generate_cwl_tool -b python.

Basecommand of the formed tool will be ['python'], and search will be a positional argument on position 0.

  • -d, --directory: directory for storing tool descriptions.
  • --help_arg2cwl: prints this help message.

How it works

Internally, argparse2tool, masquerading as argparse attempts to find and import the real argparse. It then stores a reference to the code module for the system argparse, and presents the user with all of the functions that stdlib's argparse provides. Every function call is passed through the system argparse. However, argparse2tool captures the details of those calls and when Tool XML or CWL is requested, it builds up the tool definition and prints it out to standard output.

Examples

You can see the example.py file for an example with numerous types of arguments and options that you might see in real tools. Accordingly there is an example.xml file with the output.

It doesn't work!!

If you are not able to use the --generate_galaxy_xml/--generate_cwl_tool flags after installing, it is probably because of module load order. argparse2tool must precede argparse in the path.

To easily correct this, run the tool argparse2tool which is installed as part of this package. Correctly functioning paths will produce the following:

$ argparse2tool
PATH_TO_THE_DROPINS

while incorrectly ordered paths will produce a helpful error message:

$ argparse2tool
no dropins dir...

This is intended to be used inline:

user@host:$ PYTHONPATH=$(argparse2tool) python my_script.py --generate_galaxy_xml

Limitations

This code doesn't cover the entirety of the argparse API yet, and there are some bugs to work out on the XML generation side:

  • argparse
    • groups not supported (in galaxy, everything should still work in argparse)
    • some features like templating of the version string (please submit bugs)
  • galaxyxml
    • bugs in conditionals/whens (probably)
  • argparse2tool Galaxy XML Output
    • support declaring output files in an argparse-esque manner
  • argparse2tool CWL Output
    • Some of argparse features can not be ported to CWL.
      1. nargs=N. Number of arguments can not be specified in CWL (yet).
      2. const argument of add_argument(). All constants must be specified in job files.
      3. Custom types and custom actions are not supported.
      4. Argument groups don't work in CWL as arguments are sorted with a special algorithm
      5. Mutual exclusion is not supported.

License

Apache License, v2

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].