1. MiniasmUltrafast de novo assembly for long noisy reads (though having no consensus step)
2. MinigraphProof-of-concept seq-to-graph mapper and graph generator
4. ReadfqFast multi-line FASTA/Q reader in several programming languages
5. Kmer CntCode examples of fast and simple k-mer counters for tutorial purposes
6. BiofastBenchmarking programming languages/implementations for common tasks in Bioinformatics
7. CgrangesA C/C++ library for fast interval overlap queries (with a "bedtools coverage" example)
8. BedtkA simple toolset for BED files (warning: CLI may change before bedtk becomes stable)
9. MinimapThis repo is DEPRECATED. Please use minimap2, the successor of minimap.
10. BgtFlexible genotype query among 30,000+ samples whole-genome
11. Dna NnModel and predict short DNA sequence features with neural networks
12. TabtkToolkit for processing TAB-delimited format
13. EtrfExact Tandem Repeat Finder (not a TRF replacement)
14. BwaBurrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
15. Minimap2A versatile pairwise aligner for genomic and spliced nucleotide sequences
17. SeqtkToolkit for processing sequences in FASTA/Q formats
19. bioseq-jsFor live demo, see http://lh3lh3.users.sourceforge.net/bioseq.shtml
20. fermikitDe novo assembly based variant calling pipeline for Illumina short reads
22. samtoolsThis is *NOT* the official repository of samtools.
25. asubA unified array job submitter for LSF, SGE/UGE and Slurm
26. dipcallReference-based variant calling pipeline for a pair of phased haplotype assemblies
27. unimapA EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment
28. klib.nimExperimental getopt, gzip reader, FASTA/Q parser and interval queries in nim-lang
30. ropebwt2Incremental construction of FM-index for DNA sequences
33. hickitTAD calling, phase imputation, 3D modeling and more for diploid single-cell Hi-C (Dip-C) and general Hi-C
34. sdustSymmetric DUST for finding low-complexity regions in DNA sequences
35. psmcImplementation of the Pairwise Sequentially Markovian Coalescent (PSMC) model
36. bfcHigh-performance error correction for Illumina resequencing data
37. fermiA WGS de novo assembler based on the FMD-index for large genomes