HailScalable genomic data analysis.
Stars: ✭ 706 (+2615.38%)
SvtyperBayesian genotyper for structural variants
Stars: ✭ 79 (+203.85%)
Hap.pyHaplotype VCF comparison tools
Stars: ✭ 249 (+857.69%)
Cyvcf2cython + htslib == fast VCF and BCF processing
Stars: ✭ 243 (+834.62%)
Vcfannoannotate a VCF with other VCFs/BEDs/tabixed files
Stars: ✭ 259 (+896.15%)
GenomicsA collection of scripts and notes related to genomics and bioinformatics
Stars: ✭ 101 (+288.46%)
PygenoPersonalized Genomics and Proteomics. Main diet: Ensembl, side dishes: SNPs
Stars: ✭ 261 (+903.85%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+3753.85%)
Deep RulesTen Quick Tips for Deep Learning in Biology
Stars: ✭ 179 (+588.46%)
staramrScans genome contigs against the ResFinder, PlasmidFinder, and PointFinder databases.
Stars: ✭ 52 (+100%)
plasmidtronAssembling the cause of phenotypes and genotypes from NGS data
Stars: ✭ 27 (+3.85%)
chromapFast alignment and preprocessing of chromatin profiles
Stars: ✭ 93 (+257.69%)
catchA package for designing compact and comprehensive capture probe sets.
Stars: ✭ 55 (+111.54%)
calN50Compute N50/NG50 and auN/auNG
Stars: ✭ 20 (-23.08%)
GenomicDataCommonsProvide R access to the NCI Genomic Data Commons portal.
Stars: ✭ 64 (+146.15%)
SVCollectorMethod to optimally select samples for validation and resequencing
Stars: ✭ 20 (-23.08%)
tiptoftPredict plasmids from uncorrected long read data
Stars: ✭ 27 (+3.85%)
dna-traitsA fast 23andMe genome text file parser, now superseded by arv
Stars: ✭ 64 (+146.15%)
bacnetBACNET is a Java based platform to develop website for multi-omics analysis
Stars: ✭ 12 (-53.85%)
netSmoothnetSmooth: A Network smoothing based method for Single Cell RNA-seq imputation
Stars: ✭ 23 (-11.54%)
fermikitDe novo assembly based variant calling pipeline for Illumina short reads
Stars: ✭ 98 (+276.92%)
Bio.jl[DEPRECATED] Bioinformatics and Computational Biology Infrastructure for Julia
Stars: ✭ 257 (+888.46%)
Sk DistDistributed scikit-learn meta-estimators in PySpark
Stars: ✭ 260 (+900%)
Gwa tutorialA comprehensive tutorial about GWAS and PRS
Stars: ✭ 303 (+1065.38%)
PyfaidxEfficient pythonic random access to fasta subsequences
Stars: ✭ 307 (+1080.77%)
Helmsmanhighly-efficient & lightweight mutation signature matrix aggregation
Stars: ✭ 19 (-26.92%)
saffrontreeSaffronTree: Reference free rapid phylogenetic tree construction from raw read data
Stars: ✭ 17 (-34.62%)
companionThis repository has been archived, currently maintained version is at https://github.com/iii-companion/companion
Stars: ✭ 21 (-19.23%)
ntHashFast hash function for DNA sequences
Stars: ✭ 66 (+153.85%)
bystroBystro genetic analysis (annotation, filtering, statistics)
Stars: ✭ 31 (+19.23%)
reg-genRegulatory Genomics Toolbox: Python library and set of tools for the integrative analysis of high throughput regulatory genomics data.
Stars: ✭ 64 (+146.15%)
simplesamSimple pure Python SAM parser and objects for working with SAM records
Stars: ✭ 50 (+92.31%)
OntologiesHome of the Genomic Feature and Variation Ontology (GFVO)
Stars: ✭ 16 (-38.46%)
awesome-geneticsA curated list of awesome bioinformatics software.
Stars: ✭ 60 (+130.77%)
EarlGreyEarl Grey: A fully automated TE curation and annotation pipeline
Stars: ✭ 25 (-3.85%)
TypeTEGenotyping of segregating mobile elements insertions
Stars: ✭ 15 (-42.31%)
gff3toemblConverts Prokka GFF3 files to EMBL files for uploading annotated assemblies to EBI
Stars: ✭ 27 (+3.85%)
GenomeAnalysisModuleWelcome to the website and github repository for the Genome Analysis Module. This website will guide the learning experience for trainees in the UBC MSc Genetic Counselling Training Program, as they embark on a journey to learn about analyzing genomes.
Stars: ✭ 19 (-26.92%)
wdlRunRElastic, reproducible, and reusable genomic data science tools from R backed by cloud resources
Stars: ✭ 34 (+30.77%)
Spark NotebookInteractive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+11750%)
ArvadosAn open source platform for managing and analyzing biomedical big data
Stars: ✭ 274 (+953.85%)
Dash CytoscapeInteractive network visualization in Python and Dash, powered by Cytoscape.js
Stars: ✭ 309 (+1088.46%)
SeqA high-performance, Pythonic language for bioinformatics
Stars: ✭ 263 (+911.54%)
MegahitUltra-fast and memory-efficient (meta-)genome assembler
Stars: ✭ 343 (+1219.23%)
Bowtie2A fast and sensitive gapped read aligner
Stars: ✭ 365 (+1303.85%)
JbrowseA modern genome browser built with JavaScript and HTML5.
Stars: ✭ 393 (+1411.54%)
Bwa Mem2The next version of bwa-mem
Stars: ✭ 408 (+1469.23%)
JvarkitJava utilities for Bioinformatics
Stars: ✭ 313 (+1103.85%)
JcviPython library to facilitate genome assembly, annotation, and comparative genomics
Stars: ✭ 404 (+1453.85%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+1488.46%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+84700%)
HtslibC library for high-throughput sequencing data formats
Stars: ✭ 529 (+1934.62%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+21653.85%)
DeeptoolsTools to process and analyze deep sequencing data.
Stars: ✭ 448 (+1623.08%)
GalaxyData intensive science for everyone.
Stars: ✭ 812 (+3023.08%)