SeqtkToolkit for processing sequences in FASTA/Q formats
Stars: ✭ 799 (+4105.26%)
Rnaseq WorkflowA repository for setting up a RNAseq workflow
Stars: ✭ 170 (+794.74%)
HailScalable genomic data analysis.
Stars: ✭ 706 (+3615.79%)
CromwellScientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
Stars: ✭ 655 (+3347.37%)
FgbioTools for working with genomic and high throughput sequencing data.
Stars: ✭ 166 (+773.68%)
KhmerIn-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
Stars: ✭ 640 (+3268.42%)
open-cravatA modular annotation tool for genomic variants
Stars: ✭ 74 (+289.47%)
RagooFast Reference-Guided Scaffolding of Genome Assembly Contigs. RagTag, the successor to RaGOO, is now available here: https://github.com/malonge/RagTag
Stars: ✭ 158 (+731.58%)
Cs Video CoursesList of Computer Science courses with video lectures.
Stars: ✭ 27,209 (+143105.26%)
OpenGene.jl(No maintenance) OpenGene, core libraries for NGS data analysis and bioinformatics in Julia
Stars: ✭ 60 (+215.79%)
BioawkBWK awk modified for biological data
Stars: ✭ 462 (+2331.58%)
candockA time series signal analysis and classification framework
Stars: ✭ 56 (+194.74%)
DeeptoolsTools to process and analyze deep sequencing data.
Stars: ✭ 448 (+2257.89%)
BiograknBioGrakn Knowledge Graph
Stars: ✭ 152 (+700%)
Mmseqs2MMseqs2: ultra fast and sensitive search and clustering suite
Stars: ✭ 441 (+2221.05%)
simplesamSimple pure Python SAM parser and objects for working with SAM records
Stars: ✭ 50 (+163.16%)
Circosjsd3 library to build circular graphs
Stars: ✭ 436 (+2194.74%)
MixcrMiXCR is a universal software for fast and accurate extraction of T- and B- cell receptor repertoires from any type of sequencing data. Free for academic use only.
Stars: ✭ 148 (+678.95%)
ContainersBioinformatics containers
Stars: ✭ 435 (+2189.47%)
FluentDNAFluentDNA allows you to browse sequence data of any size using a zooming visualization similar to Google Maps. You can use FluentDNA as a standalone program or as a python module for your own bioinformatics projects.
Stars: ✭ 52 (+173.68%)
SambambaTools for working with SAM/BAM data
Stars: ✭ 409 (+2052.63%)
KaijuFast taxonomic classification of metagenomic sequencing reads using a protein reference database
Stars: ✭ 146 (+668.42%)
JcviPython library to facilitate genome assembly, annotation, and comparative genomics
Stars: ✭ 404 (+2026.32%)
Bowtie2A fast and sensitive gapped read aligner
Stars: ✭ 365 (+1821.05%)
PlantcvPlant image analysis using OpenCV
Stars: ✭ 352 (+1752.63%)
gchromVARCell type specific enrichments using finemapped variants and quantitative epigenetic data
Stars: ✭ 31 (+63.16%)
MegahitUltra-fast and memory-efficient (meta-)genome assembler
Stars: ✭ 343 (+1705.26%)
HgvsPython library to parse, format, validate, normalize, and map sequence variants. `pip install hgvs`
Stars: ✭ 138 (+626.32%)
GrakelA scikit-learn compatible library for graph kernels
Stars: ✭ 330 (+1636.84%)
Market-Mix-ModelingMarket Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (+63.16%)
Dash BioOpen-source bioinformatics components for Dash
Stars: ✭ 329 (+1631.58%)
HifiasmHifiasm: a haplotype-resolved assembler for accurate Hifi reads
Stars: ✭ 134 (+605.26%)
Dash CytoscapeInteractive network visualization in Python and Dash, powered by Cytoscape.js
Stars: ✭ 309 (+1526.32%)
PyfaidxEfficient pythonic random access to fasta subsequences
Stars: ✭ 307 (+1515.79%)
OctopusBayesian haplotype-based mutation calling
Stars: ✭ 131 (+589.47%)
EdlibLightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.
Stars: ✭ 298 (+1468.42%)
klar-EDAA python library for automated exploratory data analysis
Stars: ✭ 15 (-21.05%)
TdcTherapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics
Stars: ✭ 291 (+1431.58%)
BiotiteA comprehensive library for computational molecular biology
Stars: ✭ 132 (+594.74%)
Dada2Accurate sample inference from amplicon data with single nucleotide resolution
Stars: ✭ 276 (+1352.63%)
SMMTSocial Media Mining Toolkit (SMMT) main repository
Stars: ✭ 116 (+510.53%)
AnvioAn analysis and visualization platform for 'omics data
Stars: ✭ 273 (+1336.84%)
ReadfqFast multi-line FASTA/Q reader in several programming languages
Stars: ✭ 128 (+573.68%)
CobrapyCOBRApy is a package for constraint-based modeling of metabolic networks.
Stars: ✭ 267 (+1305.26%)
LambdaLAMBDA – the Local Aligner for Massive Biological DatA
Stars: ✭ 59 (+210.53%)
MiniasmUltrafast de novo assembly for long noisy reads (though having no consensus step)
Stars: ✭ 216 (+1036.84%)
Qiime16stutorialA tutorial on methods of 16S analysis with QIIME 1
Stars: ✭ 59 (+210.53%)
Dna NnModel and predict short DNA sequence features with neural networks
Stars: ✭ 59 (+210.53%)
ngstoolsMy own tools code for NGS data analysis (Next Generation Sequencing)
Stars: ✭ 28 (+47.37%)
SVCollectorMethod to optimally select samples for validation and resequencing
Stars: ✭ 20 (+5.26%)