DeeptoolsTools to process and analyze deep sequencing data.
Stars: ✭ 448 (+1444.83%)
catchA package for designing compact and comprehensive capture probe sets.
Stars: ✭ 55 (+89.66%)
GalaxyData intensive science for everyone.
Stars: ✭ 812 (+2700%)
NglessNGLess: NGS with less work
Stars: ✭ 115 (+296.55%)
MTBseq sourceMTBseq is an automated pipeline for mapping, variant calling and detection of resistance mediating and phylogenetic variants from illumina whole genome sequence data of Mycobacterium tuberculosis complex isolates.
Stars: ✭ 26 (-10.34%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+3355.17%)
STingUltrafast sequence typing and gene detection from NGS raw reads
Stars: ✭ 15 (-48.28%)
JvarkitJava utilities for Bioinformatics
Stars: ✭ 313 (+979.31%)
reg-genRegulatory Genomics Toolbox: Python library and set of tools for the integrative analysis of high throughput regulatory genomics data.
Stars: ✭ 64 (+120.69%)
bio-dockers🐳 Bio-dockers: dockerized bioinformatic tools
Stars: ✭ 33 (+13.79%)
MGSEMapping-based Genome Size Estimation (MGSE) performs an estimation of a genome size based on a read mapping to an existing genome sequence assembly.
Stars: ✭ 22 (-24.14%)
DeepvariantDeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Stars: ✭ 2,404 (+8189.66%)
HtsjdkA Java API for high-throughput sequencing data (HTS) formats.
Stars: ✭ 220 (+658.62%)
binMy bioinfo toolbox
Stars: ✭ 42 (+44.83%)
nthashntHash implementation in Rust
Stars: ✭ 26 (-10.34%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-44.83%)
Circle-MapA method for circular DNA detection based on probabilistic mapping of ultrashort reads
Stars: ✭ 45 (+55.17%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+48.28%)
CuteVCFsimple viewer for variant call format using htslib
Stars: ✭ 30 (+3.45%)
cryfaA secure encryption tool for genomic data
Stars: ✭ 53 (+82.76%)
mandrakeMandrake 🌿/👨🔬🦆 – Fast visualisation of the population structure of pathogens using Stochastic Cluster Embedding
Stars: ✭ 29 (+0%)
instaGRAALLarge genome reassembly based on Hi-C data, continuation of GRAAL
Stars: ✭ 32 (+10.34%)
SplitThreaderExplore rearrangements and copy-number amplifications in a cancer genome
Stars: ✭ 65 (+124.14%)
bapBead-based single-cell atac processing
Stars: ✭ 20 (-31.03%)
atacrAnalysing Capture Seq Count Data
Stars: ✭ 14 (-51.72%)
DISCOVERDISCOVER co-occurrence and mutual exclusivity analysis for cancer genomics data
Stars: ✭ 21 (-27.59%)
haslrA fast tool for hybrid genome assembly of long and short reads
Stars: ✭ 68 (+134.48%)
phenolphenol: Phenotype ontology library
Stars: ✭ 15 (-48.28%)
OpenOmicsA bioinformatics API and web-app to integrate multi-omics datasets & interface with public databases.
Stars: ✭ 22 (-24.14%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+34.48%)
human genomics pipelineA Snakemake workflow to process single samples or cohorts of paired-end sequencing data (WGS or WES) using trim galore/bwa/GATK4/parabricks.
Stars: ✭ 19 (-34.48%)
fwdpy11Forward-time simulation in Python using fwdpp
Stars: ✭ 25 (-13.79%)
vrs-pythonGA4GH Variation Representation Python Implementation
Stars: ✭ 35 (+20.69%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-27.59%)
ngs pipelineExome/Capture/RNASeq Pipeline Implementation using snakemake
Stars: ✭ 40 (+37.93%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (+0%)
genoiseruse the noise
Stars: ✭ 15 (-48.28%)
corcAn ORC File Scheme for the Cascading data processing platform.
Stars: ✭ 14 (-51.72%)
smart-data-lakeSmart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (+172.41%)
CUT-RUNTools-2.0CUT&RUN and CUT&Tag data processing and analysis
Stars: ✭ 36 (+24.14%)
gubbinsRapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
Stars: ✭ 103 (+255.17%)
LogAnalyzeHelper论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)
Stars: ✭ 33 (+13.79%)
psmcImplementation of the Pairwise Sequentially Markovian Coalescent (PSMC) model
Stars: ✭ 121 (+317.24%)
DRAMDistilled and Refined Annotation of Metabolism: A tool for the annotation and curation of function for microbial and viral genomes
Stars: ✭ 159 (+448.28%)
bactmapA mapping-based pipeline for creating a phylogeny from bacterial whole genome sequences
Stars: ✭ 36 (+24.14%)
LRSDAYLRSDAY: Long-read Sequencing Data Analysis for Yeasts
Stars: ✭ 26 (-10.34%)
learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+403.45%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (+10.34%)
gnomad-browserExplore gnomAD datasets on the web
Stars: ✭ 61 (+110.34%)
CliqueSNVNo description or website provided.
Stars: ✭ 13 (-55.17%)
mitymity: A highly sensitive mitochondrial variant analysis pipeline for whole genome sequencing data
Stars: ✭ 27 (-6.9%)