SquigglekitSquiggleKit: A toolkit for manipulating nanopore signal data
EdamontologyEDAM is an ontology of bioinformatics types of data including identifiers, data formats, operations and topics.
GoenrichGO enrichment with python -- pandas meets networkx
SvtyperBayesian genotyper for structural variants
Mygene.infoMyGene.info: A BioThings API for gene annotations
SibeliazA fast whole-genome aligner based on de Bruijn graphs
Fastq.bioAn interactive web tool for quality control of DNA sequencing data
OswitchProvides access to complex Bioinformatics software (even BioLinux!) in just one command.
PlassProtein-Level ASSembler (PLASS): sensitive and precise protein assembler
FlowrRobust and efficient workflows using a simple language agnostic approach
StartappThe START App: R Shiny Transcriptome Analysis Resource Tool
BgtFlexible genotype query among 30,000+ samples whole-genome
Coursera SpecializationsSolutions to assignments of Coursera Specializations - Deep learning, Machine learning, Algorithms & Data Structures, Image Processing and Python For Everybody
Bcalmcompacted de Bruijn graph construction in low memory
ChargerCharacterization of Germline variants
Arcs🌈Scaffold genome sequence assemblies using linked read sequencing data
GubbinsRapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
GramtoolsGenome inference from a population reference graph
LambdaLAMBDA – the Local Aligner for Massive Biological DatA
Dna NnModel and predict short DNA sequence features with neural networks
Pairix1D/2D indexing and querying on bgzipped text file with a pair of genomic coordinates
Cwl SvgA library for generating an interactive SVG visualization of CWL workflows
Sv2Support Vector Structural Variation Genotyper
EmperorEmperor a tool for the analysis and visualization of large microbial ecology datasets
YacrdYet Another Chimeric Read Detector
DramDistilled and Refined Annotation of Metabolism: A tool for the annotation and curation of function for microbial and viral genomes
SnsAnalysis pipelines for sequencing data
LigerLightweight Iterative Gene set Enrichment in R
VerifybamidVerifyBamID2: A robust tool for DNA contamination estimation from sequence reads using ancestry-agnostic method.
Awesome VdjTools and databases for analyzing HLA and VDJ genes.
SinglecellhaystackFinding surprising needles (=genes) in haystacks (=single cell transcriptome data).
GatkOfficial code repository for GATK versions 4 and up
MigmapHTS-compatible wrapper for IgBlast V-(D)-J mapping tool
UtaUniversal Transcript Archive: comprehensive genome-transcript alignments; multiple transcript sources, versions, and alignment methods; available as a docker image
EtrfExact Tandem Repeat Finder (not a TRF replacement)
GenevalidatorGeneValidator: Identify problems with predicted genes
BwaBurrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
FastpAn ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
ProtrComprehensive toolkit for generating various numerical features of protein sequences
Sv CallersSnakemake-based workflow for detecting structural variants in WGS data
RasusaRandomly subsample sequencing reads to a specified coverage
Sevenbridges RSeven Bridges API Client, CWL Schema, Meta Schema, and SDK Helper in R
VdjvizA lightweight immune repertoire browser
Minimap2A versatile pairwise aligner for genomic and spliced nucleotide sequences
Uncurl pythonUNCURL is a tool for single cell RNA-seq data analysis.
ScanpySingle-Cell Analysis in Python. Scales to >1M cells.
ScispacyA full spaCy pipeline and models for scientific/biomedical documents.
NonpareilEstimate metagenomic coverage and sequence diversity