All Projects → nanoporetech → rerio

nanoporetech / rerio

Licence: other
Research release basecalling models and configurations

Programming Languages

python
139335 projects - #7 most used programming language
Roff
2310 projects

Projects that are alternatives of or similar to rerio

poreCov
SARS-CoV-2 workflow for nanopore sequence data
Stars: ✭ 34 (-43.33%)
Mutual labels:  nanopore, basecalling
rkmh
Classify sequencing reads using MinHash.
Stars: ✭ 42 (-30%)
Mutual labels:  nanopore
wengan
An accurate and ultra-fast hybrid genome assembler
Stars: ✭ 81 (+35%)
Mutual labels:  nanopore
vbz compression
VBZ compression plugin for nanopore signal data
Stars: ✭ 31 (-48.33%)
Mutual labels:  nanopore
Clair3
Clair3 - Symphonizing pileup and full-alignment for high-performance long-read variant calling
Stars: ✭ 119 (+98.33%)
Mutual labels:  nanopore
RATTLE
Reference-free reconstruction and error correction of transcriptomes from Nanopore long-read sequencing
Stars: ✭ 35 (-41.67%)
Mutual labels:  nanopore
pipeline-structural-variation
Pipeline for calling structural variations in whole genomes sequencing Oxford Nanopore data
Stars: ✭ 104 (+73.33%)
Mutual labels:  nanopore
streamformatics
Real-time species-typing visualisation for nanopore data.
Stars: ✭ 13 (-78.33%)
Mutual labels:  nanopore
interARTIC
InterARTIC - An interactive local web application for viral whole genome sequencing utilising the artic network pipelines..
Stars: ✭ 22 (-63.33%)
Mutual labels:  nanopore
nanoseq
Nanopore demultiplexing, QC and alignment pipeline
Stars: ✭ 82 (+36.67%)
Mutual labels:  nanopore
pychopper
A tool to identify, orient, trim and rescue full length cDNA reads
Stars: ✭ 74 (+23.33%)
Mutual labels:  nanopore
taeper
A small python program to simulate a real-time Nanopore sequencing run based on a previous experiment.
Stars: ✭ 18 (-70%)
Mutual labels:  nanopore
Winnowmap
Long read / genome alignment software
Stars: ✭ 151 (+151.67%)
Mutual labels:  nanopore
wub
Tools and software library developed by the ONT Applications group
Stars: ✭ 57 (-5%)
Mutual labels:  nanopore
fast-sg
Fast-SG: An alignment-free algorithm for ultrafast scaffolding graph construction from short or long reads.
Stars: ✭ 22 (-63.33%)
Mutual labels:  nanopore
haslr
A fast tool for hybrid genome assembly of long and short reads
Stars: ✭ 68 (+13.33%)
Mutual labels:  nanopore
nanoflow
🔬 De novo assembly of nanopore reads using nextflow
Stars: ✭ 20 (-66.67%)
Mutual labels:  nanopore
recentrifuge
Recentrifuge: robust comparative analysis and contamination removal for metagenomics
Stars: ✭ 79 (+31.67%)
Mutual labels:  nanopore
tiptoft
Predict plasmids from uncorrected long read data
Stars: ✭ 27 (-55%)
Mutual labels:  nanopore
IsoQuant
Reference-based transcript discovery from long RNA read
Stars: ✭ 26 (-56.67%)
Mutual labels:  nanopore

/ONT_logo.png


Rerio

Rerio is comprised of "research release" basecalling models and configuration files. All basecalling models are compatible with Guppy (see Nanopore Community page for download/install instructions). Since research models often utilise new features, the latest version of Guppy may be required.

The research models provide cutting-edge functions, speeds and accuracies that have not been productionised or validated by Oxford Nanopore Technologies in the Guppy executable basecaller. Nevertheless, models and config files can be run with the basecalling infrastructure in Guppy executable by using the instructions available in this repository.

Models are provided for DNA and RNA, various pore types and to basecall different modified bases in a variety of contexts.

Additionally, Rerio hosts research models for the short variant caller Clair3.

Features

  • New and advanced research models that are at the forefront of nanopore sequencing analyses (e.g. highest accuracies, quickest speeds, more modified basecalls in more contexts)
  • Functions with Megalodon (2.0 and above) to call modified DNA bases

Note: The results generated by research basecalling models have not been scrutinized nor validated and Oxford Nanopore cannot support each use case (see Research Release disclaimer)


Getting started

Follow the instructions below to use research models with Guppy executable. See further details for running Guppy here.

Dependencies

You will require:

  • Python-compatible environment (e.g. Terminal)
  • The most recent Guppy version, available on the Nanopore Community Software downloads page.

Note: Rerio is versioned by the minimum version of Guppy required to run _all_ models in the repository. A new release of Rerio will be tagged when a model is added that requires a more recent version of Guppy.

Installation

Rerio can be downloaded by cloning from GitHub git clone https://github.com/nanoporetech/rerio. Once Rerio has been downloaded, models can be downloaded via the download_model.py script.

# Download all models
rerio/download_model.py
# Download specific model(s)
rerio/download_model.py rerio/basecall_models/res_dna_r9.4.1_e8.1_sup_v033.cfg

Once desired models have been downloaded, they can be run by specifying the Guppy data path option (-d or --data_path) and selecting the desired config file (-c or --config).

./ont-guppy/bin/guppy_basecaller -i fast5s/ -s basecalled_fast5s \
   -d ./rerio/basecall_models/ \
   -c res_dna_r9.4.1_e8.1_sup_v033.cfg

Use and description of basecaller models

Config DNA/RNA Pore Device Tested Guppy Notes
res_dna_r9.4.1_e8.1_{fast,hac,sup}_v033.cfg DNA R9.4.1 All v5.0.11 Kit 12 E8.1 CRF Models
res_dna_r941_min_crf_v032.cfg DNA R9.4.1 MinION/GridION v4.4.0 Bonito CRF
res_dna_r103_min_crf_v032.cfg DNA R10.3 MinION/GridION v4.4.0 Bonito CRF
res_dna_r103_q20ea_crf_v033.cfg DNA R10.3 PromethION v5.0.11 Q20 early access CRF
res_dna_r103_q20ea_crf_v034.cfg DNA R10.3 PromethION v5.0.11 Q20 early access CRF
res_dna_r941_min_flipflop_v001.cfg DNA R9.4.1 MinION/GridION v3.5.1  
res_dna_r941_min_dUfast_v001.cfg DNA R9.4.1 MinION/GridION v3.5.1 Calls dU as dT (fast)
res_dna_r941_min_dUhac_v001.cfg DNA R9.4.1 MinION/GridION v3.5.1 Calls dU as dT (high acc.)
res_dna_r941_min_rle_v001.cfg DNA R9.4.1 MinION/GridION v3.5.1  
res_dna_r103_min_flipflop_v001.cfg DNA R10.3 MinION/GridION v3.5.1  
res_dna_r103_prom_rle_v001.cfg DNA R10.3 PromethION v3.5.1  
res_rna2_r941_min_flipflop_v001.cfg RNA2 R9.4.1 MinION/GridION v3.5.1  

Megalodon Support

Rerio research models can be run within megalodon by specifying the data directory in the --guppy-params argument.

megalodon fast5s/ --guppy-params "-d ./rerio/basecall_models/" \
    --guppy-config res_dna_r9.4.1_e8.1_sup_v033.cfg

Barcoding Support

The Rerio GitHub code repository includes a minimal barcoding stub to allow Guppy to run successfully. In order to enable full Guppy barcoding capabilities, all barcoding files must be transferred from the guppy data directory to the rerio data directory.

cp ont-guppy/data/barcoding/* rerio/basecall_models/barcoding/

Taiyaki Models

Taiyaki checkpoint files corresponding to Rerio research models are provided. Not all of these are compatible with the public release of Taiyaki.

#  Download all models
python3 download_models.py --checkpoints
#  Download particular model
python3 download_models.py --checkpoints taiyaki_checkpoint/model

Clair3 Models

Clair3 models for the following configurations are available:

Latest:

Config Chemistry Guppy basecaller Dorado basecaller model
r1041_e82_400bps_sup_v400 R10.4.1 E8.2
v4.0.0 SUP
r1041_e82_400bps_hac_v400 R10.4.1 E8.2
v4.0.0 HAC
r1041_e82_400bps_fast_g632 R10.4.1 E8.2 v6.3.2 FAST v3.5.2 FAST
r1041_e82_260bps_sup_v400 R10.4.1 E8.2
v4.0.0 SUP
r1041_e82_260bps_hac_v400 R10.4.1 E8.2
v4.0.0 HAC
r1041_e82_260bps_fast_g632 R10.4.1 E8.2 v6.3.2 FAST v3.5.2 FAST
r104_e81_sup_g5015 R10.4 E8.1 v5.0.15 SUP
r104_e81_hac_g5015 R10.4 E8.1 v5.0.15 HAC

Deprecated:

Config Chemistry Guppy basecaller Dorado basecaller model
r1041_e82_400bps_sup_g615 R10.4.1 E8.2 v6.1.5 SUP v3.5.2 SUP
r1041_e82_400bps_hac_g632 R10.4.1 E8.2 v6.3.2 HAC v3.5.2 HAC
r1041_e82_400bps_hac_g615 R10.4.1 E8.2 v6.1.5 HAC
r1041_e82_400bps_fast_g615 R10.4.1 E8.2 v6.1.5 FAST
r1041_e82_260bps_sup_g632 R10.4.1 E8.2 v6.3.2 SUP v3.5.2 SUP
r1041_e82_260bps_hac_g632 R10.4.1 E8.2 v6.3.2 HAC v3.5.2 HAC
#  Download all models
python3 download_model.py --clair3
#  Download particular model
python3 download_model.py --clair3 clair3_models/{config}_model

Each model will be downloaded to the folder clair3_models/{config}.


Remora Models

Most Remora models are supplied along with the Remora repository, but models with less validation intended for research purposes will be released in Rerio.

Config DNA/RNA Pore Device Tested Guppy Notes
5mC_all_context_sup_r1041_e82 DNA R10.4.1 Any v6.1.2 5mC in all context (with SUP basecaller)
#  Download all models
python3 download_model.py --remora
#  Download particular model
python3 download_model.py --remora remora_models/5mC_all_context_sup_r1041_e82

Each model will be downloaded to remora_models/{config}.onnx.

These models can be supplied directly to Bonito via the --modified-base-model argument and to Megalodon via the --remora-model argument.


Help

Licence and Copyright

© 2020, 2021 Oxford Nanopore Technologies Ltd.

Rerio is distributed under the terms of the Oxford Nanopore Technologies, Ltd. Public License, v. 1.0. If a copy of the License was not distributed with this file, You can obtain one at http://nanoporetech.com

Research Release

Research releases are provided as technology demonstrators to provide early access to features or stimulate Community development of tools. Support for this software will be minimal and is only provided directly by the developers. Feature requests, improvements, and discussions are welcome and can be implemented by forking and pull requests. However much as we would like to rectify every issue and piece of feedback users may have, the developers may have limited resource for support of this software. Research releases may be unstable and subject to rapid iteration by Oxford Nanopore Technologies.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].