All Projects → spark-vcf → Similar Projects or Alternatives

440 Open source projects that are alternatives of or similar to spark-vcf

CuteVCF
simple viewer for variant call format using htslib
Stars: ✭ 30 (+100%)
Mutual labels:  genomics, vcf, variants
rvtests
Rare variant test software for next generation sequencing data
Stars: ✭ 114 (+660%)
Mutual labels:  variants, genotype, vcf-files
vcf stuff
📊Evaluating, filtering, comparing, and visualising VCF
Stars: ✭ 19 (+26.67%)
Mutual labels:  vcf, variants
rare-disease-wf
(WIP) best-practices workflow for rare disease
Stars: ✭ 47 (+213.33%)
Mutual labels:  genomics, variants
Pygeno
Personalized Genomics and Proteomics. Main diet: Ensembl, side dishes: SNPs
Stars: ✭ 261 (+1640%)
Mutual labels:  genomics, vcf
Svtyper
Bayesian genotyper for structural variants
Stars: ✭ 79 (+426.67%)
Mutual labels:  genomics, vcf
learning vcf file
Learning the Variant Call Format
Stars: ✭ 104 (+593.33%)
Mutual labels:  vcf, vcf-files
Tiledb Vcf
Efficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (+73.33%)
Mutual labels:  genomics, vcf
Genozip
Compressor for genomic files (FASTQ, SAM/BAM, VCF, FASTA, GVF, 23andMe...), up to 5x better than gzip and faster too
Stars: ✭ 53 (+253.33%)
Mutual labels:  genomics, vcf
mgatk
mgatk: mitochondrial genome analysis toolkit
Stars: ✭ 65 (+333.33%)
Mutual labels:  genomics, genotype
Vcfanno
annotate a VCF with other VCFs/BEDs/tabixed files
Stars: ✭ 259 (+1626.67%)
Mutual labels:  genomics, vcf
SNPGenie
Program for estimating πN/πS, dN/dS, and other diversity measures from next-generation sequencing data
Stars: ✭ 81 (+440%)
Mutual labels:  vcf, vcf-files
Htsjdk
A Java API for high-throughput sequencing data (HTS) formats.
Stars: ✭ 220 (+1366.67%)
Mutual labels:  genomics, vcf
phenomenet-vp
A phenotype-based tool for variant prioritization in WES and WGS data
Stars: ✭ 31 (+106.67%)
Mutual labels:  variants, vcf-files
Genomics
A collection of scripts and notes related to genomics and bioinformatics
Stars: ✭ 101 (+573.33%)
Mutual labels:  genomics, vcf
indelope
find large indels (in the blind spot between GATK/freebayes and SV callers)
Stars: ✭ 38 (+153.33%)
Mutual labels:  genomics, vcf
Cyvcf2
cython + htslib == fast VCF and BCF processing
Stars: ✭ 243 (+1520%)
Mutual labels:  genomics, vcf
MTBseq source
MTBseq is an automated pipeline for mapping, variant calling and detection of resistance mediating and phylogenetic variants from illumina whole genome sequence data of Mycobacterium tuberculosis complex isolates.
Stars: ✭ 26 (+73.33%)
Mutual labels:  genomics, variants
Hap.py
Haplotype VCF comparison tools
Stars: ✭ 249 (+1560%)
Mutual labels:  genomics, vcf
vcfstats
Powerful statistics for VCF files
Stars: ✭ 32 (+113.33%)
Mutual labels:  vcf, vcf-files
variantkey
Numerical Encoding for Human Genetic Variants
Stars: ✭ 32 (+113.33%)
Mutual labels:  genomics, variants
Ontologies
Home of the Genomic Feature and Variation Ontology (GFVO)
Stars: ✭ 16 (+6.67%)
Mutual labels:  genomics, vcf
Hail
Scalable genomic data analysis.
Stars: ✭ 706 (+4606.67%)
Mutual labels:  genomics, vcf
cljam
A DNA Sequence Alignment/Map (SAM) library for Clojure
Stars: ✭ 85 (+466.67%)
Mutual labels:  genomics, vcf
HLA
xHLA: Fast and accurate HLA typing from short read sequence data
Stars: ✭ 84 (+460%)
Mutual labels:  genomics, variants
manhattan generator
Manhattan plot Generator
Stars: ✭ 20 (+33.33%)
Mutual labels:  genomics
atacr
Analysing Capture Seq Count Data
Stars: ✭ 14 (-6.67%)
Mutual labels:  genomics
CUT-RUNTools-2.0
CUT&RUN and CUT&Tag data processing and analysis
Stars: ✭ 36 (+140%)
Mutual labels:  genomics
disq
A library for manipulating bioinformatics sequencing formats in Apache Spark
Stars: ✭ 29 (+93.33%)
Mutual labels:  genomics
2vcf
convert 23andme or Ancestry.com raw genotype calls into VCF format, with dbSNP annotations
Stars: ✭ 42 (+180%)
Mutual labels:  vcf
assembly improvement
Improve the quality of a denovo assembly by scaffolding and gap filling
Stars: ✭ 46 (+206.67%)
Mutual labels:  genomics
bactmap
A mapping-based pipeline for creating a phylogeny from bacterial whole genome sequences
Stars: ✭ 36 (+140%)
Mutual labels:  genomics
shell-genomics
Introduction to the Command Line for Genomics
Stars: ✭ 54 (+260%)
Mutual labels:  genomics
mapping-iterative-assembler
Consensus calling (or "reference assisted assembly"), chiefly of ancient mitochondria
Stars: ✭ 15 (+0%)
Mutual labels:  genomics
workflows
Bioinformatics workflows developed for and used on the St. Jude Cloud project.
Stars: ✭ 16 (+6.67%)
Mutual labels:  genomics
MultiAssayExperiment
Bioconductor package for management of multi-assay data
Stars: ✭ 57 (+280%)
Mutual labels:  genomics
fwdpy11
Forward-time simulation in Python using fwdpp
Stars: ✭ 25 (+66.67%)
Mutual labels:  genomics
scipp
Multi-dimensional data arrays with labeled dimensions
Stars: ✭ 55 (+266.67%)
Mutual labels:  dataframe
SplitThreader
Explore rearrangements and copy-number amplifications in a cancer genome
Stars: ✭ 65 (+333.33%)
Mutual labels:  genomics
opaque-sql
An encrypted data analytics platform
Stars: ✭ 169 (+1026.67%)
Mutual labels:  spark-sql
DISCOVER
DISCOVER co-occurrence and mutual exclusivity analysis for cancer genomics data
Stars: ✭ 21 (+40%)
Mutual labels:  genomics
Clair3
Clair3 - Symphonizing pileup and full-alignment for high-performance long-read variant calling
Stars: ✭ 119 (+693.33%)
Mutual labels:  genomics
VariantRetriever
VariantRetriever is a minimalist package for feature flagging
Stars: ✭ 23 (+53.33%)
Mutual labels:  variants
Julia-data-science
Data science and numerical computing with Julia
Stars: ✭ 54 (+260%)
Mutual labels:  dataframe
civic-server
Backend Server for CIViC Project
Stars: ✭ 39 (+160%)
Mutual labels:  variants
dataframe
Structured data processing in Kotlin
Stars: ✭ 319 (+2026.67%)
Mutual labels:  dataframe
nthash
ntHash implementation in Rust
Stars: ✭ 26 (+73.33%)
Mutual labels:  genomics
albis
Albis: High-Performance File Format for Big Data Systems
Stars: ✭ 20 (+33.33%)
Mutual labels:  spark-sql
biowasm
WebAssembly modules for genomics
Stars: ✭ 115 (+666.67%)
Mutual labels:  genomics
go enrichment
Transcripts annotation and GO enrichment Fisher tests
Stars: ✭ 24 (+60%)
Mutual labels:  genomics
nf-hack17-tutorial
Nextflow basic tutorial for newbie users
Stars: ✭ 32 (+113.33%)
Mutual labels:  genomics
phenol
phenol: Phenotype ontology library
Stars: ✭ 15 (+0%)
Mutual labels:  genomics
OpenOmics
A bioinformatics API and web-app to integrate multi-omics datasets & interface with public databases.
Stars: ✭ 22 (+46.67%)
Mutual labels:  genomics
dt-sql-parser
SQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (+800%)
Mutual labels:  spark-sql
bap
Bead-based single-cell atac processing
Stars: ✭ 20 (+33.33%)
Mutual labels:  genomics
human genomics pipeline
A Snakemake workflow to process single samples or cohorts of paired-end sequencing data (WGS or WES) using trim galore/bwa/GATK4/parabricks.
Stars: ✭ 19 (+26.67%)
Mutual labels:  genomics
ladybug-pandas
🐞 <3 🐼 A ladybug extension powered by pandas
Stars: ✭ 15 (+0%)
Mutual labels:  dataframe
soda
Python-based UCSC genome browser snapshot-taker and gallery-maker
Stars: ✭ 12 (-20%)
Mutual labels:  genomics
bow
Go data analysis / manipulation library built on top of Apache Arrow
Stars: ✭ 20 (+33.33%)
Mutual labels:  dataframe
h3ron
Rust crates for the H3 geospatial indexing system
Stars: ✭ 52 (+246.67%)
Mutual labels:  dataframe
1-60 of 440 similar projects