All Categories → Data Processing → bioinformatics

Top 704 bioinformatics open source projects

dnaio
Read and write FASTQ and FASTA efficiently from Python
go4bio
Golang for Bioinformatics
simplesam
Simple pure Python SAM parser and objects for working with SAM records
wdlRunR
Elastic, reproducible, and reusable genomic data science tools from R backed by cloud resources
compomics-utilities
Open source Java library for computational proteomics
pyrpipe
Reproducible bioinformatics pipelines in python. Import any Unix tool/command in python.
tibanna
Tibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integration Center) to process data. Tibanna supports CWL/WDL (w/ docker), Snakemake (w/ conda) and custom Docker/shell command.
NeuroSEED
Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch (NeurIPS 2021)
hotsub
Command line tool to run batch jobs concurrently with ETL framework on AWS or other cloud computing resources
Binning refiner
Improving genome bins through the combination of different binning programs
admixr
An R package for reproducible and automated ADMIXTOOLS analyses
codon-usage-tables
📊 Codon usage tables in code-friendly format + Python bindings
paccmann datasets
pytoda - PaccMann PyTorch Dataset Classes. Read the docs: https://paccmann.github.io/paccmann_datasets/
rkmh
Classify sequencing reads using MinHash.
sirius
SIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)
awesome-phages
A curated list of phage related software and computational resources for phage scientists, bioinformaticians and enthusiasts.
dysgu
dysgu-SV is a collection of tools for calling structural variants using short or long reads
protwis
Protwis is the backbone of the GPCRdb. The GPCRdb contains reference data, interactive visualisation and experiment design tools for G protein-coupled receptors (GPCRs).
sample-sheet
A permissively licensed library designed to replace Illumina's Experiment Manager
gnparser
GNparser normalises scientific names and extracts their semantic elements.
unimap
A EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment
PrimerMiner
R mased batch sequence downloader, with primer development and in silico evaluation capabilities
wgs2ncbi
Toolkit for preparing genomes for submission to NCBI
Scaff10X
Pipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads
jgi-query
A simple command-line tool to download data from Joint Genome Institute databases
gcMapExplorer
Genome Contact Map Explorer - gcMapExplorer. Visit:
klib.nim
Experimental getopt, gzip reader, FASTA/Q parser and interval queries in nim-lang
crimson
Bioinformatics tool outputs converter to JSON or YAML
ribotricer
A tool for accurately detecting actively translating ORFs from Ribo-seq data
CellBench
R package for benchmarking single cell analysis methods
jannovar
Annotation of VCF variants with functional impact and from databases (executable+library)
NGI-RNAseq
Nextflow RNA-Seq Best Practice analysis pipeline, used at the SciLifeLab National Genomics Infrastructure.
IsoQuant
Reference-based transcript discovery from long RNA read
SeqVec
Modelling the Language of Life - Deep Learning Protein Sequences
xpclr
Code to compute the XP-CLR statistic to infer natural selection
READemption
A pipeline for the computational evaluation of RNA-Seq data
361-420 of 704 bioinformatics projects