All Projects → Phillip-a-richmond → GenomeAnalysisModule

Phillip-a-richmond / GenomeAnalysisModule

Licence: other
Welcome to the website and github repository for the Genome Analysis Module. This website will guide the learning experience for trainees in the UBC MSc Genetic Counselling Training Program, as they embark on a journey to learn about analyzing genomes.

Programming Languages

HTML
75241 projects
shell
77523 projects

Projects that are alternatives of or similar to GenomeAnalysisModule

Pygeno
Personalized Genomics and Proteomics. Main diet: Ensembl, side dishes: SNPs
Stars: ✭ 261 (+1273.68%)
Mutual labels:  bioinformatics, genomics, genome
catch
A package for designing compact and comprehensive capture probe sets.
Stars: ✭ 55 (+189.47%)
Mutual labels:  bioinformatics, genomics, genome
companion
This repository has been archived, currently maintained version is at https://github.com/iii-companion/companion
Stars: ✭ 21 (+10.53%)
Mutual labels:  bioinformatics, genomics, genome
Gatk
Official code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+5173.68%)
Mutual labels:  bioinformatics, genomics, genome
Ribbon
A genome browser that shows long reads and complex variants better
Stars: ✭ 184 (+868.42%)
Mutual labels:  bioinformatics, genomics, genome
Deepvariant
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Stars: ✭ 2,404 (+12552.63%)
Mutual labels:  bioinformatics, genomics, genome
Genometools
GenomeTools genome analysis system.
Stars: ✭ 186 (+878.95%)
Mutual labels:  bioinformatics, genomics, genome
Scaff10X
Pipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads
Stars: ✭ 21 (+10.53%)
Mutual labels:  bioinformatics, genomics, genome
dna-traits
A fast 23andMe genome text file parser, now superseded by arv
Stars: ✭ 64 (+236.84%)
Mutual labels:  bioinformatics, genomics
staramr
Scans genome contigs against the ResFinder, PlasmidFinder, and PointFinder databases.
Stars: ✭ 52 (+173.68%)
Mutual labels:  bioinformatics, genomics
bacnet
BACNET is a Java based platform to develop website for multi-omics analysis
Stars: ✭ 12 (-36.84%)
Mutual labels:  bioinformatics, genomics
calN50
Compute N50/NG50 and auN/auNG
Stars: ✭ 20 (+5.26%)
Mutual labels:  bioinformatics, genomics
GenomicDataCommons
Provide R access to the NCI Genomic Data Commons portal.
Stars: ✭ 64 (+236.84%)
Mutual labels:  bioinformatics, genomics
saffrontree
SaffronTree: Reference free rapid phylogenetic tree construction from raw read data
Stars: ✭ 17 (-10.53%)
Mutual labels:  bioinformatics, genomics
ntHash
Fast hash function for DNA sequences
Stars: ✭ 66 (+247.37%)
Mutual labels:  bioinformatics, genomics
bystro
Bystro genetic analysis (annotation, filtering, statistics)
Stars: ✭ 31 (+63.16%)
Mutual labels:  bioinformatics, genomics
plasmidtron
Assembling the cause of phenotypes and genotypes from NGS data
Stars: ✭ 27 (+42.11%)
Mutual labels:  bioinformatics, genomics
full spectrum bioinformatics
An open-access bioinformatics text
Stars: ✭ 26 (+36.84%)
Mutual labels:  bioinformatics, genomics
netSmooth
netSmooth: A Network smoothing based method for Single Cell RNA-seq imputation
Stars: ✭ 23 (+21.05%)
Mutual labels:  bioinformatics, genomics
chromap
Fast alignment and preprocessing of chromatin profiles
Stars: ✭ 93 (+389.47%)
Mutual labels:  bioinformatics, genomics

Genome Analysis Module

This course is intended for trainees in the Genetic Counselling MSc Program at the University of British Columbia, and summer students within the Summer Student Research Program at BC Children's Hospital Research Institute. The computing resources are hosted at BC Children's Hospital Research Institute.
For more information contact Dr. Phillip Richmond ([email protected]). If you are looking for content related to the manuscript, please see Release 1. This version: November 2021


General Course Information & Prerequisites

Course Format

The course will be designed in a flexible format, adaptable for teaching online, or in person, or a combination of both.

Prerequisites

Setting up the working environment.

  • Content
    • Installing necessary software (see below)
    • Connecting to the cluster
    • Making a workspace
  • Slides: Link

Software Installs

  • For mac/linux users, only need native terminal which comes with the operating system. You can access it in the Applications section for Mac users. You also need to have X windows installed Link
  • For PC users, download and install MobaXterm
  • IGV installed on local machine
  • Filezilla (or other file transfer client) installed on local machine
  • BCCHR VPN

Resources

To-Do before the first class

Complete the Prerequisites (above). If you have any challenges, please contact Dr. Phillip Richmond ([email protected]).


Course Syllabus

Prerequisite – to be done before Session 1 (15 minutes):

Setting up the working environment

  • Installing secure shell and sshfs
  • Connecting to the cluster
  • Making a workspace
  • Link

Monday November 1st, 2021 (1:00-3:00) - Intro to course, Intro to Genome Analysis for Rare Disease, Case Distribution

  • Soft-start (15 min: 1:00-1:10)
  • Course Introduction (Lecture, 75 min: 1:10-2:25)
    • Link
    • Goals of the course
    • Format of the course
    • Genome Analysis for Rare Disease Diagnosis
    • OMIM, ClinVar, HPO
    • Limitations of whole exome sequencing, short reads, gene discovery, where the field is going
  • Distribute cases & link to resources (Work-along, 15 min - 2:25-2:40)
    • Introduce concepts presented in the cases
  • Establish working environment (Work-along, 20 minutes - 2:40-3:00)
    • Install IGV
    • Login to cluster
    • Install SSHFS

Wednesday November 3rd, 2021 (1:00-3:00) - Intro to linux command line & file handling

  • Introduction to command line and HPC (Work-along, 45 min - 1:00-1:45)
    • Link
    • Key commands for navigating
      • ls, mkdir, cd, pwd
    • Key commands for file handling (& exposure to genomics file formats)
      • more, less, grep, head, cut, tail, wc, chmod
    • Fasta, Fastq, VCF
  • Introduction to file editing, bash scripts (Work-along, 30 min - 1:45-2:15)
    • Nano
    • Bash scripts
    • Variables
  • Problem Set 1 (45 min - 2:15-3:00) - basic linux navigation + file handling

Friday November 5th, 2021 (8:30-10:30) - Short read mapping & visualization (Command line/bash)

  • Problem Set 1 Review (20 min: 8:30-8:50)
  • Short read mapping & Visualization (Work-along, 60 min - 8:50-9:50)
    • Link
    • Pipeline Overview
    • Exploring command line tools and how to use them
    • Map short reads to reference genome with BWA mem
    • Convert, sort, index, with Samtools
  • Problem Set 2 (40 min: 9:50-10:30) - short read mapping with bash scripts

Monday November 8th, 2021 (1:00-3:00) - Process cases through pipeline using cluster

  • Problem Set 2 Review (30 min: 1:00-1:30)
  • Interacting with the cluster scheduler (Work-along, 45 min - 1:30-2:15)
    • Link
    • Concept of job scheduler on cluster
    • Basic job scripts
    • Mapping and visualization through cluster
  • Problem Set 3 (45 min - 2:15-3:00) - Case processing fastq-->BAM + Visualize

Wednesday November 10th, 2021 (1:00-3:00) - Variant calling and visualization

  • Problem Set 3 Review (30 min - 1:00-1:30)
  • Call variants with DeepTrio (Work-along, 45 min - 1:30-2:15)
    • Link
    • Call variants using pre-built DeepTrio scripts on pre-processed small BAM files
    • Visualize variants in IGV
    • Load GRCh38 genome
    • Load read files
    • Zoom to region with read coverage
    • Take snapshot of variant
  • Problem Set 4 (45 min - 2:15-3:00) - Call variants on samples

Friday November 12th, 2021 (8:30-10:30) - Variant annotation and interpretation

  • Extra time for Problem Set 4 (60 min - 8:30-9:30)
  • Open discussion/review of sessions 1-5
  • Ensure that everyone has mapped, called variants, and visualized their case data

Monday November 15th, 2021 (1:00-3:00) - Variant prioritization with Exomiser

  • Running Exomiser on a VCF (Work-along, 45 min - 1:00-1:45)
    • Link
    • Run exomiser using pre-built configs and scripts on Case 6
  • Interpreting Exomiser output in excel/html (Work-along, 45 min - 1:45-2:30)
    • Download exomiser output in TSV format, open and explore tabs in excel
    • Visualize output HTML file
  • Problem Set 5 (30 min - 2:30-3:00) - Run Exomiser on your case data

Wednesday November 17th, 2021 (1:00-3:00) - Catch-up & open help session

  • Review Problem Set 5, address any issues, make sure everyone has their case exomiser output. (60 min - 1:00-2:00)
  • Open help session, working on Practical (60 min - 2:00-3:00)

Friday November 19th, 2021 (8:30-10:30) - Practical

  • Practical (120 min - 8:30-10:30)
    • Each student presents their case to the group, 8 students ~15 minutes / presentation (10 min present, 5 min questions).
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].