foofahFoofah: programming-by-example data transformation program synthesizer
Stars: ✭ 24 (+26.32%)
mulledMulled - Automatized Containerized Software Repository
Stars: ✭ 49 (+157.89%)
unimapA EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment
Stars: ✭ 76 (+300%)
dnaioRead and write FASTQ and FASTA efficiently from Python
Stars: ✭ 27 (+42.11%)
skimpyskimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Stars: ✭ 236 (+1142.11%)
vcf2gwasPython API for comprehensive GWAS analysis using GEMMA
Stars: ✭ 27 (+42.11%)
ntHashFast hash function for DNA sequences
Stars: ✭ 66 (+247.37%)
Scaff10XPipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads
Stars: ✭ 21 (+10.53%)
companionThis repository has been archived, currently maintained version is at https://github.com/iii-companion/companion
Stars: ✭ 21 (+10.53%)
StrelkaStrelka2 germline and somatic small variant caller
Stars: ✭ 244 (+1184.21%)
open-cravatA modular annotation tool for genomic variants
Stars: ✭ 74 (+289.47%)
OpenGene.jl(No maintenance) OpenGene, core libraries for NGS data analysis and bioinformatics in Julia
Stars: ✭ 60 (+215.79%)
candockA time series signal analysis and classification framework
Stars: ✭ 56 (+194.74%)
simplesamSimple pure Python SAM parser and objects for working with SAM records
Stars: ✭ 50 (+163.16%)
FluentDNAFluentDNA allows you to browse sequence data of any size using a zooming visualization similar to Google Maps. You can use FluentDNA as a standalone program or as a python module for your own bioinformatics projects.
Stars: ✭ 52 (+173.68%)
tftargets🎯 Human transcription factor target genes.
Stars: ✭ 77 (+305.26%)
GRAFIMOGRAph-based Finding of Individual Motif Occurrences
Stars: ✭ 22 (+15.79%)
SourmashQuickly search, compare, and analyze genomic and metagenomic data sets.
Stars: ✭ 237 (+1147.37%)
gchromVARCell type specific enrichments using finemapped variants and quantitative epigenetic data
Stars: ✭ 31 (+63.16%)
Market-Mix-ModelingMarket Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
Stars: ✭ 31 (+63.16%)
klar-EDAA python library for automated exploratory data analysis
Stars: ✭ 15 (-21.05%)
SMMTSocial Media Mining Toolkit (SMMT) main repository
Stars: ✭ 116 (+510.53%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-26.32%)
awesome-small-molecule-mlA curated list of resources for machine learning for small-molecule drug discovery
Stars: ✭ 54 (+184.21%)
reskitA library for creating and curating reproducible pipelines for scientific and industrial machine learning
Stars: ✭ 27 (+42.11%)
tibannaTibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integration Center) to process data. Tibanna supports CWL/WDL (w/ docker), Snakemake (w/ conda) and custom Docker/shell command.
Stars: ✭ 61 (+221.05%)
rssRegression with Summary Statistics.
Stars: ✭ 42 (+121.05%)
referenceseekerRapid determination of appropriate reference genomes.
Stars: ✭ 65 (+242.11%)
lme4qtlMixed models @lme4 + custom covariances + parameter constraints
Stars: ✭ 39 (+105.26%)
hotsubCommand line tool to run batch jobs concurrently with ETL framework on AWS or other cloud computing resources
Stars: ✭ 29 (+52.63%)
GenAMapVisual Machine Learning of Genome-Phenome Associations
Stars: ✭ 22 (+15.79%)
S-PCGCHeritability, genetic correlation and functional enrichment estimation for case-control studies
Stars: ✭ 13 (-31.58%)
adjclustAdjacency-constrained hierarchical clustering of a similarity matrix
Stars: ✭ 15 (-21.05%)
qtcatQuantitative Trait Cluster Association Test in R
Stars: ✭ 25 (+31.58%)
bystroBystro genetic analysis (annotation, filtering, statistics)
Stars: ✭ 31 (+63.16%)
hessEstimate local SNP heritability and genetic covariance from GWAS summary association statistics.
Stars: ✭ 27 (+42.11%)
admixrAn R package for reproducible and automated ADMIXTOOLS analyses
Stars: ✭ 20 (+5.26%)
rvtestsRare variant test software for next generation sequencing data
Stars: ✭ 114 (+500%)
CoNekTCoNekT (short for Co-expression Network Toolkit) is a platform to browse co-expression data and enable cross-species comparisons.
Stars: ✭ 17 (-10.53%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+7010.53%)
paccmann datasetspytoda - PaccMann PyTorch Dataset Classes. Read the docs: https://paccmann.github.io/paccmann_datasets/
Stars: ✭ 15 (-21.05%)
BioDiscMLLarge-scale automatic feature selection for biomarker discovery in high-dimensional OMICs data
Stars: ✭ 17 (-10.53%)
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-10.53%)
BiopythonOfficial git repository for Biopython (originally converted from CVS)
Stars: ✭ 2,936 (+15352.63%)
catchA package for designing compact and comprehensive capture probe sets.
Stars: ✭ 55 (+189.47%)
Cyvcf2cython + htslib == fast VCF and BCF processing
Stars: ✭ 243 (+1178.95%)
CeleScopeSingle Cell Analysis Pipelines
Stars: ✭ 36 (+89.47%)
Single Cell PseudotimeAn overview of algorithms for estimating pseudotime in single-cell RNA-seq data
Stars: ✭ 239 (+1157.89%)
biskitA Python platform for Structural Bioinformatics
Stars: ✭ 47 (+147.37%)
Homebrew Bio🍺🔬 Bioinformatics formulae for the Homebrew package manager (macOS and Linux)
Stars: ✭ 237 (+1147.37%)
dysgudysgu-SV is a collection of tools for calling structural variants using short or long reads
Stars: ✭ 47 (+147.37%)
protwisProtwis is the backbone of the GPCRdb. The GPCRdb contains reference data, interactive visualisation and experiment design tools for G protein-coupled receptors (GPCRs).
Stars: ✭ 20 (+5.26%)
awesome-phagesA curated list of phage related software and computational resources for phage scientists, bioinformaticians and enthusiasts.
Stars: ✭ 14 (-26.32%)
ngstoolsMy own tools code for NGS data analysis (Next Generation Sequencing)
Stars: ✭ 28 (+47.37%)