All Projects → divonlan → Genozip

divonlan / Genozip

Licence: other
Compressor for genomic files (FASTQ, SAM/BAM, VCF, FASTA, GVF, 23andMe...), up to 5x better than gzip and faster too

Programming Languages

c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to Genozip

cljam
A DNA Sequence Alignment/Map (SAM) library for Clojure
Stars: ✭ 85 (+60.38%)
Mutual labels:  genomics, sam, vcf
Htsjdk
A Java API for high-throughput sequencing data (HTS) formats.
Stars: ✭ 220 (+315.09%)
Mutual labels:  sam, genomics, vcf
ratarmount
Random Access Read-Only Tar Mount
Stars: ✭ 217 (+309.43%)
Mutual labels:  compression, gzip
pyrus-cramjam
Thin Python wrapper to de/compression algorithms in Rust - lightweight & no dependencies
Stars: ✭ 40 (-24.53%)
Mutual labels:  compression, gzip
lambda-smush-py
Gain additional code space via cheeky compression for Python AWS Lambda functions defined in-line to CloudFormation templates.
Stars: ✭ 17 (-67.92%)
Mutual labels:  compression, gzip
deflate-rs
An implementation of a DEFLATE encoder in rust
Stars: ✭ 47 (-11.32%)
Mutual labels:  compression, gzip
sanic compress
An extension which allows you to easily compress your Sanic responses with gzip.
Stars: ✭ 26 (-50.94%)
Mutual labels:  compression, gzip
EasyCompressor
⚡ A compression library that implements many compression algorithms such as LZ4, Zstd, LZMA, Snappy, Brotli, GZip, and Deflate. It helps you to improve performance by reducing Memory Usage and Network Traffic for caching.
Stars: ✭ 167 (+215.09%)
Mutual labels:  compression, gzip
zlib
Compression and decompression in the gzip and zlib formats
Stars: ✭ 32 (-39.62%)
Mutual labels:  compression, gzip
Zoonavigator
Web-based ZooKeeper UI / editor / browser
Stars: ✭ 326 (+515.09%)
Mutual labels:  compression, gzip
Pygeno
Personalized Genomics and Proteomics. Main diet: Ensembl, side dishes: SNPs
Stars: ✭ 261 (+392.45%)
Mutual labels:  genomics, vcf
Htslib
C library for high-throughput sequencing data formats
Stars: ✭ 529 (+898.11%)
Mutual labels:  sam, vcf
hts-python
pythonic wrapper for htslib
Stars: ✭ 18 (-66.04%)
Mutual labels:  genomics, sam
fuc
Frequently used commands in bioinformatics
Stars: ✭ 23 (-56.6%)
Mutual labels:  sam, vcf
simplesam
Simple pure Python SAM parser and objects for working with SAM records
Stars: ✭ 50 (-5.66%)
Mutual labels:  genomics, sam
http compression
🗜️ Deno HTTP compression middleware
Stars: ✭ 34 (-35.85%)
Mutual labels:  compression, gzip
Ontologies
Home of the Genomic Feature and Variation Ontology (GFVO)
Stars: ✭ 16 (-69.81%)
Mutual labels:  genomics, vcf
Hail
Scalable genomic data analysis.
Stars: ✭ 706 (+1232.08%)
Mutual labels:  genomics, vcf
indelope
find large indels (in the blind spot between GATK/freebayes and SV callers)
Stars: ✭ 38 (-28.3%)
Mutual labels:  genomics, vcf
hasmin
Hasmin - A Haskell CSS Minifier
Stars: ✭ 55 (+3.77%)
Mutual labels:  compression, gzip

Genozip



(available on Conda, Docker Hub and https://github.com/divonlan/genozip ; Documentation: http://genozip.com)

Genozip is a compressor for genomic files - while it can compress any file (i.e. not only genomic files), it is optimized to compress FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP, Chain and 23andMe files.

Citing Do you find Genozip useful? Please cite:
Lan, D., et al. (2021) Genozip: a universal extensible genomic data compressor. Bioinformatics, https://doi.org/10.1093/bioinformatics/btab102
Lan, D., et al. (2020) genozip: a fast and efficient compression tool for VCF files Bioinformatics, 36, 4091–4092, https://doi.org/10.1093/bioinformatics/btaa290

Typically, a 2X-5X improvement over the existing compression is achieved when compressing already-compressed files like .fastq.gz .bam vcf.gz and much higher ratios in some other cases.

Yes, Genozip can compress already-compressed files (.gz .bz2 .xz .bam .cram).

The compression is lossless - the decompressed file is 100% identical to the original file (see documentation for exceptions).
Sign up to receive low-frequency updates related to Genozip: https://tinyurl.com/genozip

The command line options are similar to gzip and samtools/bcftools, so if you are familiar with these, it works pretty much the same. To get started, see: http://genozip.com

Genozip is free for non-commercial use. For a commercial license, please contact [email protected]

Usage is subject to terms and conditions. The non-commercial license can be viewed on http://genozip.com/license.html

THIS SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, TITLE AND NON-INFRINGEMENT. IN NO EVENT SHALL THE COPYRIGHT HOLDERS OR ANYONE DISTRIBUTING THE SOFTWARE BE LIABLE FOR ANY DAMAGES OR OTHER LIABILITY, WHETHER IN CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].