HgvsPython library to parse, format, validate, normalize, and map sequence variants. `pip install hgvs`
ArtemisArtemis is a free genome viewer and annotation tool that allows visualization of sequence features and the results of analyses within the context of the sequence, and its six-frame translation
HifiasmHifiasm: a haplotype-resolved assembler for accurate Hifi reads
Awesome Single CellCommunity-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
OctopusBayesian haplotype-based mutation calling
MrbayesMrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. For documentation and downloading the program, please see the home page:
BiotiteA comprehensive library for computational molecular biology
Hts Nimnim wrapper for htslib for parsing genomics data files
ReadfqFast multi-line FASTA/Q reader in several programming languages
SplatterSimple simulation of single-cell RNA sequencing data
Somalierfast sample-swap and relatedness checks on BAMs/CRAMs/VCFs/GVCFs... "like damn that is one smart wine guy"
PlipProtein-Ligand Interaction Profiler - Analyze and visualize non-covalent protein-ligand interactions in PDB files according to 📝 Salentin et al. (2015), https://www.doi.org/10.1093/nar/gkv315
SarekDetect germline or somatic variants from normal or tumour/normal whole-genome or targeted sequencing
DeepecgECG classification programs based on ML/DL methods
Krakenuniq🐙 KrakenUniq: Metagenomics classifier with unique k-mer counting for more specific results
Kmer CntCode examples of fast and simple k-mer counters for tutorial purposes
ScgenSingle cell perturbation prediction
CirclatorA tool to circularize genome assemblies
BlacklistApplication for making ENCODE Blacklists
HicexplorerHiCExplorer is a powerful and easy to use set of tools to process, normalize and visualize Hi-C data.
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
NglessNGLess: NGS with less work
Apbs Pdb2pqrAPBS - software for biomolecular electrostatics and solvation
CoolerA cool place to store your Hi-C
FqtoolsAn efficient FASTQ manipulation suite
Bio4jBio4j abstract model and general entry point to the project
UgeneUGENE is free open-source cross-platform bioinformatics software
BioconvertBioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
BiofastBenchmarking programming languages/implementations for common tasks in Bioinformatics
PyaniPython module for average nucleotide identity analyses
CgrangesA C/C++ library for fast interval overlap queries (with a "bedtools coverage" example)
PegasusPegasus Workflow Management System - Automate, recover, and debug scientific computations.
TaxonkitA Practical and Efficient NCBI Taxonomy Toolkit
SortmernaSortMeRNA: next-generation sequence filtering and alignment tool
IndraINDRA (Integrated Network and Dynamical Reasoning Assembler) is an automated model assembly system interfacing with NLP systems and databases to collect knowledge, and through a process of assembly, produce causal graphs and dynamical models.
BedtkA simple toolset for BED files (warning: CLI may change before bedtk becomes stable)
GenomicsA collection of scripts and notes related to genomics and bioinformatics
PymzmlpymzML - an interface between Python and mzML Mass spectrometry Files
SmudgeplotInference of ploidy and heterozygosity structure using whole genome sequencing data
BionitioDemonstrating best practices for bioinformatics command line tools
AribaAntimicrobial Resistance Identification By Assembly
Dnachisel✏️ A versatile DNA sequence optimizer
NextflowA DSL for data-driven computational pipelines
FastqtFastQC port to Qt5: A quality control tool for high throughput sequence data.
RiddleRace and ethnicity Imputation from Disease history with Deep LEarning
BioBioinformatics library for .NET
SwarmA robust and fast clustering method for amplicon-based studies
MolgenisMOLGENIS - for scientific data: management, exploration, integration and analysis.
DecontamSimple statistical identification and removal of contaminants in marker-gene and metagenomics sequencing data
ClusterflowA pipelining tool to automate and standardise bioinformatics analyses on cluster environments.
VdjtoolsPost-analysis of immune repertoire sequencing data
TruvariStructural variant toolkit for VCFs