All Projects → hail-is → Hail

hail-is / Hail

Licence: mit
Scalable genomic data analysis.

Programming Languages

python
139335 projects - #7 most used programming language
scala
5932 projects

Projects that are alternatives of or similar to Hail

Tiledb Vcf
Efficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-96.32%)
Mutual labels:  spark, bioinformatics, genomics, vcf
Vcfanno
annotate a VCF with other VCFs/BEDs/tabixed files
Stars: ✭ 259 (-63.31%)
Mutual labels:  bioinformatics, genomics, vcf
Svtyper
Bayesian genotyper for structural variants
Stars: ✭ 79 (-88.81%)
Mutual labels:  bioinformatics, genomics, vcf
Pygeno
Personalized Genomics and Proteomics. Main diet: Ensembl, side dishes: SNPs
Stars: ✭ 261 (-63.03%)
Mutual labels:  bioinformatics, genomics, vcf
Genomics
A collection of scripts and notes related to genomics and bioinformatics
Stars: ✭ 101 (-85.69%)
Mutual labels:  bioinformatics, genomics, vcf
Cyvcf2
cython + htslib == fast VCF and BCF processing
Stars: ✭ 243 (-65.58%)
Mutual labels:  bioinformatics, genomics, vcf
Hap.py
Haplotype VCF comparison tools
Stars: ✭ 249 (-64.73%)
Mutual labels:  bioinformatics, genomics, vcf
Gatk
Official code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+41.93%)
Mutual labels:  spark, bioinformatics, genomics
Seq
A high-performance, Pythonic language for bioinformatics
Stars: ✭ 263 (-62.75%)
Mutual labels:  bioinformatics, genomics
Arvados
An open source platform for managing and analyzing biomedical big data
Stars: ✭ 274 (-61.19%)
Mutual labels:  bioinformatics, genomics
Pyfaidx
Efficient pythonic random access to fasta subsequences
Stars: ✭ 307 (-56.52%)
Mutual labels:  bioinformatics, genomics
Bowtie2
A fast and sensitive gapped read aligner
Stars: ✭ 365 (-48.3%)
Mutual labels:  bioinformatics, genomics
Postgui
A React web application to query and share any PostgreSQL database.
Stars: ✭ 260 (-63.17%)
Mutual labels:  bioinformatics, genomics
Gwa tutorial
A comprehensive tutorial about GWAS and PRS
Stars: ✭ 303 (-57.08%)
Mutual labels:  bioinformatics, genomics
Bio.jl
[DEPRECATED] Bioinformatics and Computational Biology Infrastructure for Julia
Stars: ✭ 257 (-63.6%)
Mutual labels:  bioinformatics, genomics
Jvarkit
Java utilities for Bioinformatics
Stars: ✭ 313 (-55.67%)
Mutual labels:  bioinformatics, genomics
Nucleus
Python and C++ code for reading and writing genomics data.
Stars: ✭ 657 (-6.94%)
Mutual labels:  bioinformatics, genomics
Megahit
Ultra-fast and memory-efficient (meta-)genome assembler
Stars: ✭ 343 (-51.42%)
Mutual labels:  bioinformatics, genomics
Bwa Mem2
The next version of bwa-mem
Stars: ✭ 408 (-42.21%)
Mutual labels:  bioinformatics, genomics
Jcvi
Python library to facilitate genome assembly, annotation, and comparative genomics
Stars: ✭ 404 (-42.78%)
Mutual labels:  bioinformatics, genomics

Hail

Zulip

Hail is an open-source, general-purpose, Python-based data analysis tool with additional data types and methods for working with genomic data.

Hail is built to scale and has first-class support for multi-dimensional structured data, like the genomic data in a genome-wide association study (GWAS).

Hail is exposed as a Python library, using primitives for distributed queries and linear algebra implemented in Scala, Spark, and increasingly C++.

See the documentation for more info on using Hail.

Community

Hail has been widely adopted in academia and industry, including as the analysis platform for the genome aggregation database and UK Biobank rapid GWAS. Learn more about Hail-powered science.

Contribute

If you'd like to discuss or contribute to the development of methods or infrastructure, please:

Hail uses a continuous deployment approach to software development, which means we frequently add new features. We update users about changes to Hail via the Discussion Forum. We recommend creating an account on the Discussion Forum so that you can subscribe to these updates as well.

Maintainer

Hail is maintained by a team in the Neale lab at the Stanley Center for Psychiatric Research of the Broad Institute of MIT and Harvard and the Analytic and Translational Genetics Unit of Massachusetts General Hospital.

Contact the Hail team at [email protected].

Citing Hail

If you use Hail for published work, please cite the software. You can get a citation for the version of Hail you installed by executing:

import hail as hl
print(hl.citation())

Which will look like:

Hail Team. Hail 0.2.13-81ab564db2b4. https://github.com/hail-is/hail/releases/tag/0.2.13.
Acknowledgements

The Hail team has several sources of funding at the Broad Institute:

  • The Stanley Center for Psychiatric Research, which together with Neale Lab has provided an incredibly supportive and stimulating home.
  • Principal Investigators Benjamin Neale and Daniel MacArthur, whose scientific leadership has been essential for solving the right problems.
  • Jeremy Wertheimer, whose strategic advice and generous philanthropy have been essential for growing the impact of Hail.

We are grateful for generous support from:

  • The National Institute of Diabetes and Digestive and Kidney Diseases
  • The National Institute of Mental Health
  • The National Human Genome Research Institute
  • The Chan Zuckerberg Initiative

We would like to thank Zulip for supporting open-source by providing free hosting, and YourKit, LLC for generously providing free licenses for YourKit Java Profiler for open-source development.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].