dnaioRead and write FASTQ and FASTA efficiently from Python
saffrontreeSaffronTree: Reference free rapid phylogenetic tree construction from raw read data
ntHashFast hash function for DNA sequences
go4bioGolang for Bioinformatics
simplesamSimple pure Python SAM parser and objects for working with SAM records
RNArtistCoreA Kotlin DSL and library to create and plot RNA 2D structures
wdlRunRElastic, reproducible, and reusable genomic data science tools from R backed by cloud resources
pyrpipeReproducible bioinformatics pipelines in python. Import any Unix tool/command in python.
tibannaTibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integration Center) to process data. Tibanna supports CWL/WDL (w/ docker), Snakemake (w/ conda) and custom Docker/shell command.
NeuroSEEDImplementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch (NeurIPS 2021)
hotsubCommand line tool to run batch jobs concurrently with ETL framework on AWS or other cloud computing resources
slamdunkStreamlining SLAM-seq analysis with ultra-high sensitivity
Binning refinerImproving genome bins through the combination of different binning programs
admixrAn R package for reproducible and automated ADMIXTOOLS analyses
paccmann datasetspytoda - PaccMann PyTorch Dataset Classes. Read the docs: https://paccmann.github.io/paccmann_datasets/
rkmhClassify sequencing reads using MinHash.
siriusSIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)
GRAFIMOGRAph-based Finding of Individual Motif Occurrences
awesome-phagesA curated list of phage related software and computational resources for phage scientists, bioinformaticians and enthusiasts.
dysgudysgu-SV is a collection of tools for calling structural variants using short or long reads
protwisProtwis is the backbone of the GPCRdb. The GPCRdb contains reference data, interactive visualisation and experiment design tools for G protein-coupled receptors (GPCRs).
BridgeDbThe BridgeDb Library source code
sample-sheetA permissively licensed library designed to replace Illumina's Experiment Manager
gnparserGNparser normalises scientific names and extracts their semantic elements.
TeamTeriGenomics using open source tools, running on GCP or AWS
unimapA EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment
PrimerMinerR mased batch sequence downloader, with primer development and in silico evaluation capabilities
wgs2ncbiToolkit for preparing genomes for submission to NCBI
PHATPathogen-Host Analysis Tool - A modern Next-Generation Sequencing (NGS) analysis platform
CANDOComputational Analysis of Novel Drug Opportunities
Scaff10XPipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads
jgi-queryA simple command-line tool to download data from Joint Genome Institute databases
paladinProtein Alignment and Detection Interface
klib.nimExperimental getopt, gzip reader, FASTA/Q parser and interval queries in nim-lang
crimsonBioinformatics tool outputs converter to JSON or YAML
open-cravatA modular annotation tool for genomic variants
ribotricerA tool for accurately detecting actively translating ORFs from Ribo-seq data
propy3A Python 3 version of the protein descriptor package propy
CellBenchR package for benchmarking single cell analysis methods
jannovarAnnotation of VCF variants with functional impact and from databases (executable+library)
NGI-RNAseqNextflow RNA-Seq Best Practice analysis pipeline, used at the SciLifeLab National Genomics Infrastructure.
cath-toolsProtein structure comparison tools such as SSAP and SNAP
IsoQuantReference-based transcript discovery from long RNA read
polyA Go package for engineering organisms.
dentistClose assembly gaps using long-reads at high accuracy.
SeqVecModelling the Language of Life - Deep Learning Protein Sequences
fucFrequently used commands in bioinformatics
xpclrCode to compute the XP-CLR statistic to infer natural selection
READemptionA pipeline for the computational evaluation of RNA-Seq data