All Categories → Data Processing → bioinformatics

Top 704 bioinformatics open source projects

Awesome 10x Genomics
List of tools and resources related to the 10x Genomics GEMCode/Chromium system
Squigglekit
SquiggleKit: A toolkit for manipulating nanopore signal data
Edamontology
EDAM is an ontology of bioinformatics types of data including identifiers, data formats, operations and topics.
Goenrich
GO enrichment with python -- pandas meets networkx
Svtyper
Bayesian genotyper for structural variants
Mygene.info
MyGene.info: A BioThings API for gene annotations
Biosequences.jl
Biological sequences for the julia language
Sibeliaz
A fast whole-genome aligner based on de Bruijn graphs
Fastq.bio
An interactive web tool for quality control of DNA sequencing data
Oswitch
Provides access to complex Bioinformatics software (even BioLinux!) in just one command.
Plass
Protein-Level ASSembler (PLASS): sensitive and precise protein assembler
Flowr
Robust and efficient workflows using a simple language agnostic approach
Startapp
The START App: R Shiny Transcriptome Analysis Resource Tool
Bgt
Flexible genotype query among 30,000+ samples whole-genome
Coursera Specializations
Solutions to assignments of Coursera Specializations - Deep learning, Machine learning, Algorithms & Data Structures, Image Processing and Python For Everybody
Awesome Expression Browser
😎 A curated list of software and resources for exploring and visualizing (browsing) expression data 😎
Globalbioticinteractions
Global Biotic Interactions provides access to existing species interaction datasets
Bcalm
compacted de Bruijn graph construction in low memory
Charger
Characterization of Germline variants
Arcs
🌈Scaffold genome sequence assemblies using linked read sequencing data
Gubbins
Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
Gramtools
Genome inference from a population reference graph
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Lambda
LAMBDA – the Local Aligner for Massive Biological DatA
Qiime16stutorial
A tutorial on methods of 16S analysis with QIIME 1
Dna Nn
Model and predict short DNA sequence features with neural networks
Pairix
1D/2D indexing and querying on bgzipped text file with a pair of genomic coordinates
Cwl Svg
A library for generating an interactive SVG visualization of CWL workflows
Sv2
Support Vector Structural Variation Genotyper
Emperor
Emperor a tool for the analysis and visualization of large microbial ecology datasets
Yacrd
Yet Another Chimeric Read Detector
Dram
Distilled and Refined Annotation of Metabolism: A tool for the annotation and curation of function for microbial and viral genomes
Sns
Analysis pipelines for sequencing data
Liger
Lightweight Iterative Gene set Enrichment in R
Verifybamid
VerifyBamID2: A robust tool for DNA contamination estimation from sequence reads using ancestry-agnostic method.
Awesome Vdj
Tools and databases for analyzing HLA and VDJ genes.
Singlecellhaystack
Finding surprising needles (=genes) in haystacks (=single cell transcriptome data).
Gatk
Official code repository for GATK versions 4 and up
Migmap
HTS-compatible wrapper for IgBlast V-(D)-J mapping tool
Uta
Universal Transcript Archive: comprehensive genome-transcript alignments; multiple transcript sources, versions, and alignment methods; available as a docker image
Locuszoom Standalone
Create regional association plots from GWAS or meta-analysis
Etrf
Exact Tandem Repeat Finder (not a TRF replacement)
Genevalidator
GeneValidator: Identify problems with predicted genes
Bwa
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
Metasra Pipeline
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Fastp
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
Protr
Comprehensive toolkit for generating various numerical features of protein sequences
Cytometry Clustering Comparison
R scripts to reproduce analyses in our paper comparing clustering methods for high-dimensional cytometry data
Sv Callers
Snakemake-based workflow for detecting structural variants in WGS data
Rasusa
Randomly subsample sequencing reads to a specified coverage
Sevenbridges R
Seven Bridges API Client, CWL Schema, Meta Schema, and SDK Helper in R
Vdjviz
A lightweight immune repertoire browser
Minimap2
A versatile pairwise aligner for genomic and spliced nucleotide sequences
Awesome Sequencing Tech Papers
A collection of publications on comparison of high-throughput sequencing technologies.
Uncurl python
UNCURL is a tool for single cell RNA-seq data analysis.
Scanpy
Single-Cell Analysis in Python. Scales to >1M cells.
Scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
Nonpareil
Estimate metagenomic coverage and sequence diversity
121-180 of 704 bioinformatics projects