All Categories → Data Processing → bioinformatics

Top 704 bioinformatics open source projects

Parallel Fastq Dump
parallel fastq-dump wrapper
Hgvs
Python library to parse, format, validate, normalize, and map sequence variants. `pip install hgvs`
Artemis
Artemis is a free genome viewer and annotation tool that allows visualization of sequence features and the results of analyses within the context of the sequence, and its six-frame translation
Hifiasm
Hifiasm: a haplotype-resolved assembler for accurate Hifi reads
Octopus
Bayesian haplotype-based mutation calling
Mrbayes
MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. For documentation and downloading the program, please see the home page:
Biotite
A comprehensive library for computational molecular biology
Hts Nim
nim wrapper for htslib for parsing genomics data files
Readfq
Fast multi-line FASTA/Q reader in several programming languages
Splatter
Simple simulation of single-cell RNA sequencing data
Somalier
fast sample-swap and relatedness checks on BAMs/CRAMs/VCFs/GVCFs... "like damn that is one smart wine guy"
Plip
Protein-Ligand Interaction Profiler - Analyze and visualize non-covalent protein-ligand interactions in PDB files according to 📝 Salentin et al. (2015), https://www.doi.org/10.1093/nar/gkv315
Sarek
Detect germline or somatic variants from normal or tumour/normal whole-genome or targeted sequencing
Krakenuniq
🐙 KrakenUniq: Metagenomics classifier with unique k-mer counting for more specific results
Kmer Cnt
Code examples of fast and simple k-mer counters for tutorial purposes
Scgen
Single cell perturbation prediction
Circlator
A tool to circularize genome assemblies
Blacklist
Application for making ENCODE Blacklists
Hicexplorer
HiCExplorer is a powerful and easy to use set of tools to process, normalize and visualize Hi-C data.
Dna2vec
dna2vec: Consistent vector representations of variable-length k-mers
Ngless
NGLess: NGS with less work
Apbs Pdb2pqr
APBS - software for biomolecular electrostatics and solvation
Cooler
A cool place to store your Hi-C
Fqtools
An efficient FASTQ manipulation suite
Bio4j
Bio4j abstract model and general entry point to the project
Ugene
UGENE is free open-source cross-platform bioinformatics software
Bioconvert
Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.
Biofast
Benchmarking programming languages/implementations for common tasks in Bioinformatics
Pyani
Python module for average nucleotide identity analyses
Cgranges
A C/C++ library for fast interval overlap queries (with a "bedtools coverage" example)
Pegasus
Pegasus Workflow Management System - Automate, recover, and debug scientific computations.
Taxonkit
A Practical and Efficient NCBI Taxonomy Toolkit
Sortmerna
SortMeRNA: next-generation sequence filtering and alignment tool
Indra
INDRA (Integrated Network and Dynamical Reasoning Assembler) is an automated model assembly system interfacing with NLP systems and databases to collect knowledge, and through a process of assembly, produce causal graphs and dynamical models.
Bedtk
A simple toolset for BED files (warning: CLI may change before bedtk becomes stable)
Genomics
A collection of scripts and notes related to genomics and bioinformatics
Pymzml
pymzML - an interface between Python and mzML Mass spectrometry Files
Smudgeplot
Inference of ploidy and heterozygosity structure using whole genome sequencing data
Bionitio
Demonstrating best practices for bioinformatics command line tools
Ariba
Antimicrobial Resistance Identification By Assembly
Dnachisel
✏️ A versatile DNA sequence optimizer
Gcp For Bioinformatics
GCP Essentials for Bioinformatics Researchers
Fastqt
FastQC port to Qt5: A quality control tool for high throughput sequence data.
Riddle
Race and ethnicity Imputation from Disease history with Deep LEarning
Bio
Bioinformatics library for .NET
Swarm
A robust and fast clustering method for amplicon-based studies
Molgenis
MOLGENIS - for scientific data: management, exploration, integration and analysis.
Decontam
Simple statistical identification and removal of contaminants in marker-gene and metagenomics sequencing data
Awesome Bioinformatics
A curated list of awesome Bioinformatics libraries and software.
Clusterflow
A pipelining tool to automate and standardise bioinformatics analyses on cluster environments.
Vdjtools
Post-analysis of immune repertoire sequencing data
Obofoundry.github.io
Metadata and website for the Open Bio Ontologies Foundry Ontology Registry
Bioinformatics Workbook
Bioinformatics Workbook repository
Truvari
Structural variant toolkit for VCFs
Bioconda Recipes
Conda recipes for the bioconda channel.
61-120 of 704 bioinformatics projects