All Projects → hallamlab → TreeSAPP

hallamlab / TreeSAPP

Licence: GPL-3.0 license
A Python package for gene-centric taxonomic and functional classification using phylogenetic placement

Programming Languages

python
139335 projects - #7 most used programming language
r
7636 projects
C++
36643 projects - #6 most used programming language
UnrealScript
20 projects
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to TreeSAPP

get phylomarkers
A pipeline to select optimal markers for microbial phylogenomics and species tree estimation using coalescent and concatenation approaches
Stars: ✭ 34 (+70%)
Mutual labels:  phylogenetics
CAFE5
Version 5 of the CAFE phylogenetics software
Stars: ✭ 53 (+165%)
Mutual labels:  phylogenetics
FAVITES
FAVITES (FrAmework for VIral Transmission and Evolution Simulation)
Stars: ✭ 33 (+65%)
Mutual labels:  phylogenetics
pastml
Ancestor character reconstruction and visualisation for rooted phylogenetic trees
Stars: ✭ 15 (-25%)
Mutual labels:  phylogenetics
PyPHLAWD
Python version of PHLAWD
Stars: ✭ 16 (-20%)
Mutual labels:  phylogenetics
PhyloTrees.jl
Phylogenetic trees in Julia
Stars: ✭ 15 (-25%)
Mutual labels:  phylogenetics
RNeXML
Implementing semantically rich NeXML I/O in R
Stars: ✭ 12 (-40%)
Mutual labels:  phylogenetics
CAFE
Analyze changes in gene family size and provide a statistical foundation for evolutionary inferences.
Stars: ✭ 88 (+340%)
Mutual labels:  phylogenetics
genesis
A library for working with phylogenetic and population genetic data.
Stars: ✭ 43 (+115%)
Mutual labels:  phylogenetics
Biopython
Official git repository for Biopython (originally converted from CVS)
Stars: ✭ 2,936 (+14580%)
Mutual labels:  phylogenetics
seqotron
Alignment editor for Mac OS X
Stars: ✭ 26 (+30%)
Mutual labels:  phylogenetics
phylogenetics-class
A course in the theory and practice of phylogenetic inference from DNA sequence data
Stars: ✭ 16 (-20%)
Mutual labels:  phylogenetics
archaeopteryx-js
Archaeopteryx.js is a software tool for the visualization and analysis of highly annotated phylogenetic trees.
Stars: ✭ 27 (+35%)
Mutual labels:  phylogenetics
BuddySuite
Bioinformatics toolkits for manipulating sequence, alignment, and phylogenetic tree files
Stars: ✭ 106 (+430%)
Mutual labels:  phylogenetics
taxonium
A tool for exploring very large trees in the browser
Stars: ✭ 65 (+225%)
Mutual labels:  phylogenetics
baltic
baltic - backronymed adaptable lightweight tree import code for molecular phylogeny manipulation, analysis and visualisation. Development is back on the evogytis/baltic branch (i.e. here).
Stars: ✭ 60 (+200%)
Mutual labels:  phylogenetics
PCG
𝙋𝙝𝙮𝙡𝙤𝙜𝙚𝙣𝙚𝙩𝙞𝙘 𝘾𝙤𝙢𝙥𝙤𝙣𝙚𝙣𝙩 𝙂𝙧𝙖𝙥𝙝 ⸺ Haskell program and libraries for general phylogenetic graph search
Stars: ✭ 20 (+0%)
Mutual labels:  phylogenetics
bac-genomics-scripts
Collection of scripts for bacterial genomics
Stars: ✭ 39 (+95%)
Mutual labels:  microbial-genomics
traitar
From genomes to phenotypes: Traitar, the microbial trait analyzer
Stars: ✭ 41 (+105%)
Mutual labels:  microbial-genomics
libpll
Phylogenetic Likelihood Library
Stars: ✭ 21 (+5%)
Mutual labels:  phylogenetics

TreeSAPP: Tree-based Sensitive and Accurate Phylogenetic Profiler

tests PyPI version Anaconda-Server Badge Anaconda-Server Badge Docker Repository on Quay

Codacy Badge install with bioconda Python version codecov Anaconda-Server Badge

Overview

TreeSAPP is a python package for functional and taxonomic annotation of proteins from genomes and metagenomes using phylogenetic placement.

Quick start

We recommend installing TreeSAPP into its own conda environment with the following command:

conda create -n treesapp_cenv -c bioconda -c conda-forge treesapp
conda activate treesapp_cenv

To list all the sub-commands run treesapp.

To test the assign workflow, run:

treesapp assign -i TreeSAPP/tests/test_data/marker_test_suite.faa -m prot --trim_align -o assign_test -t McrA,DsrAB

To classify sequences in your genome of interest:

treesapp assign -i my.fasta -o ~/path/to/output/directory/

TreeSAPP comes installed with 33 reference packages involved in a variety of biogeochemical and cellular processes. We also have many more reference packages available on our RefPkgs repository and you can view the complete list here.

Tutorials

All of our tutorials are available on the GitHub wiki page. Here are some specific tutorial examples:

If we do not yet have a reference package for a gene you are interested in, please try building a new reference package. Of course, if you run into any problems or would like to collaborate on building many reference packages don't hesitate to email us or create a new issue with an 'enhancement' label.

To determine whether the sequences used to build your new reference package are what you think they are, and whether it might unexpectedly annotate homologous sequences, see the purity tutorial.

If you are working with a particularly complex reference package, from an orthologous group for example, or have extra phylogenetic information you'd like to include in your classifications, try annotating extra features with treesapp layer.

Citation

If you found TreeSAPP useful in your work, please cite the following paper:

Morgan-Lang, C., McLaughlin, R., Armstrong, Z., Zhang, G., Chan, K., & Hallam, S. J. (2020). TreeSAPP: The Tree-based Sensitive and Accurate Phylogenetic Profiler. Bioinformatics, 1–8.

This was brought to you by the team:

References

If you're feeling extra citation-happy, please consider citing the following works as well:

  • Eddy, S. R. (1998). Profile hidden Markov models. Bioinformatics (Oxford, England), 14(9), 755–763.
  • Criscuolo, A., & Gribaldo, S. (2010). BMGE (Block Mapping and Gathering with Entropy): A new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evolutionary Biology, 10(1).
  • Kozlov, A. M., Darriba, D., Flouri, T., Morel, B., & Stamatakis, A. (2019). RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics, 35(21), 4453–4455.
  • Barbera, P., Kozlov, A. M., Czech, L., Morel, B., & Stamatakis, A. (2018). EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences. Systematic Biology, 0(0), 291658.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].