All Projects → yjx1217 → LRSDAY

yjx1217 / LRSDAY

Licence: MIT license
LRSDAY: Long-read Sequencing Data Analysis for Yeasts

Programming Languages

perl
6916 projects
shell
77523 projects
Makefile
30231 projects
c
50402 projects - #5 most used programming language
Visual Basic 6.0
7 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to LRSDAY

GCModeller
GCModeller: genomics CAD(Computer Assistant Design) Modeller system in .NET language
Stars: ✭ 25 (-3.85%)
Mutual labels:  genomics, genome-annotation
CAMSA
CAMSA: a tool for Comparative Analysis and Merging of Scaffold Assemblies
Stars: ✭ 18 (-30.77%)
Mutual labels:  genomics, genome-assembly
indelope
find large indels (in the blind spot between GATK/freebayes and SV callers)
Stars: ✭ 38 (+46.15%)
Mutual labels:  genomics, genome-assembly
instaGRAAL
Large genome reassembly based on Hi-C data, continuation of GRAAL
Stars: ✭ 32 (+23.08%)
Mutual labels:  genomics, genome-assembly
varsome-api-client-python
Example client programs for Saphetor's VarSome annotation API
Stars: ✭ 21 (-19.23%)
Mutual labels:  genomics, genome-annotation
redundans
Redundans is a pipeline that assists an assembly of heterozygous/polymorphic genomes.
Stars: ✭ 90 (+246.15%)
Mutual labels:  genomics, genome-assembly
wgs2ncbi
Toolkit for preparing genomes for submission to NCBI
Stars: ✭ 25 (-3.85%)
Mutual labels:  genomics, genome-annotation
haslr
A fast tool for hybrid genome assembly of long and short reads
Stars: ✭ 68 (+161.54%)
Mutual labels:  genomics, genome-assembly
mccortex
De novo genome assembly and multisample variant calling
Stars: ✭ 105 (+303.85%)
Mutual labels:  genomics, genome-assembly
EarlGrey
Earl Grey: A fully automated TE curation and annotation pipeline
Stars: ✭ 25 (-3.85%)
Mutual labels:  genomics, genome-annotation
LTR retriever
LTR_retriever is a highly accurate and sensitive program for identification of LTR retrotransposons; The LTR Assembly Index (LAI) is also included in this package.
Stars: ✭ 131 (+403.85%)
Mutual labels:  genome-annotation, genome-assembly
MGSE
Mapping-based Genome Size Estimation (MGSE) performs an estimation of a genome size based on a read mapping to an existing genome sequence assembly.
Stars: ✭ 22 (-15.38%)
Mutual labels:  genomics, genome-assembly
TOGA
TOGA (Tool to infer Orthologs from Genome Alignments): implements a novel paradigm to infer orthologous genes. TOGA integrates gene annotation, inferring orthologs and classifying genes as intact or lost.
Stars: ✭ 35 (+34.62%)
Mutual labels:  genomics, genome-annotation
tiptoft
Predict plasmids from uncorrected long read data
Stars: ✭ 27 (+3.85%)
Mutual labels:  genomics, oxford-nanopore
berokka
🍊 💫 Trim, circularise and orient long read bacterial genome assemblies
Stars: ✭ 23 (-11.54%)
Mutual labels:  genomics, genome-assembly
gawn
Genome Annotation Without Nightmares
Stars: ✭ 35 (+34.62%)
Mutual labels:  genomics, genome-annotation
assigner
Population assignment analysis using R
Stars: ✭ 17 (-34.62%)
Mutual labels:  genomics
biopython-coronavirus
Biopython Jupyter Notebook tutorial to characterize a small genome
Stars: ✭ 80 (+207.69%)
Mutual labels:  genomics
sequencework
programs and scripts, mainly python, for analyses related to nucleic or protein sequences
Stars: ✭ 22 (-15.38%)
Mutual labels:  genomics
aws-genomics-workflows
Genomics Workflows on AWS
Stars: ✭ 131 (+403.85%)
Mutual labels:  genomics

LRSDAY

LRSDAY logo

LRSDAY: Long-read Sequencing Data Analysis for Yeasts

A highly transparent, automated and powerful computational framework for high-quality genome assembly and annotation.

License: MIT

Description

Long-read sequencing technologies have become increasingly popular in genome projects due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast, Saccharomyces cerevisiae, has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here we present LRSDAY, the first one-stop solution to streamline this process. LRSDAY can produce chromosome-level end-to-end genome assembly and comprehensive annotations for various genomic features (including centromeres, protein-coding genes, tRNAs, transposable elements and telomere-associated elements) that are ready for downstream analysis. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable for virtually any eukaryotic organisms.

LRSDAY_flowchart

Citations

Jia-Xing Yue & Gianni Liti. (2018) Long-read sequencing data analysis for yeasts. Nature Protocols, 13:1213–1231.

Jia-Xing Yue, Jing Li, Louise Aigrain, Johan Hallin, Karl Persson, Karen Oliver, Anders Bergström, Paul Coupland, Jonas Warringer, Marco Cosentino Lagomarsino, Gilles Fischer, Richard Durbin, Gianni Liti. (2017) Contrasting evolutionary genome dynamics between domesticated and wild yeasts. Nature Genetics, 49:913-924.

Release history

  • v1.7.0 Released on 2022/12/31
  • v1.6.0 Released on 2019/10/03
  • v1.5.0 Released on 2019/05/13
  • v1.4.0 Released on 2019/03/21
  • v1.3.1 Released on 2019/01/22
  • v1.3.0 Released on 2018/11/13
  • v1.2.0 Released on 2018/10/15
  • v1.1.0 Released on 2018/07/11
  • v1.0.0 Released on 2018/02/04

License

LRSDAY itself is distributed under the MIT license. A number of LRSDAY's dependencies (e.g. CAP3, MAKER, GATK, blat, RepBase, etc) are under more restricted licenses, for which commerical use of the software needs to be discussed with the corresponding developers.

Requirements

Hardware, operating system and network

This protocol is designed for a desktop or computing server running an x86-64-bit Linux operating system. Multithreaded processors are preferred to speed up the process since many steps can be configured to use multiple threads in parallel. For assembling and analyzing the budding yeast genomes (genome size = ~12.5 Mb), at least 16 Gb of RAM and 100 Gb of free disk space are recomended. More disk space is needed if you want to use LRSDAY to do Nanopore basecalling. When adapted for other eukaryotic organisms with larger genome sizes, the RAM and disk space consumption will scale up, majorly during de novo genome assembly (performed by Canu by default. Plese refer to Canu’s manual for suggested RAM and disk space consumption for assembling large genomes. Stable Internet connection is required for the installation and configuration of LRSDAY as well as for retrieving the test data.

Software or library requirements

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].