All Projects → stemangiola → bioc_2020_tidytranscriptomics

stemangiola / bioc_2020_tidytranscriptomics

Licence: other
Workshop on tidytranscriptomics: Performing tidy transcriptomics analyses with tidybulk, tidyverse and tidyheatmap

Programming Languages

r
7636 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to bioc 2020 tidytranscriptomics

R-data-wrangling
Materials for my my R data workshop. https://cengel.github.io/R-data-wrangling/
Stars: ✭ 17 (-32%)
Mutual labels:  workshop, tidyverse
DEGreport
Create a cromphensive report of DEG list coming from any analysis of RNAseq data
Stars: ✭ 18 (-28%)
Mutual labels:  bioconductor, differential-expression
diffxpy
Differential expression analysis for single-cell RNA-seq data.
Stars: ✭ 137 (+448%)
Mutual labels:  transcriptomics, differential-expression
ideal
Interactive Differential Expression AnaLysis - DE made accessible and reproducible
Stars: ✭ 24 (-4%)
Mutual labels:  bioconductor, differential-expression
Nanny
A tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Stars: ✭ 17 (-32%)
Mutual labels:  tidyverse, pca
tidysq
tidy processing of biological sequences in R
Stars: ✭ 29 (+16%)
Mutual labels:  tidyverse, bioconductor
GeneTonic
Enjoy your transcriptomic data and analysis responsibly - like sipping a cocktail
Stars: ✭ 66 (+164%)
Mutual labels:  bioconductor, transcriptomics
Tidyheatmap
Draw heatmap simply using a tidy data frame
Stars: ✭ 151 (+504%)
Mutual labels:  heatmap, tidyverse
2017-summer-workshop
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (+32%)
Mutual labels:  workshop, tidyverse
node-express-reddit-clone
Build a Node, Express and MySQL-based clone of Reddit for DecodeMTL web development bootcamp
Stars: ✭ 28 (+12%)
Mutual labels:  workshop
gis-for-geoscientists
Repository for "GIS for Geoscientists" workshop series. This repo contains data, protocols, outputs, lectures, and resources used the workshop. Course taught by Nicholas Barber. Available for future booking upon request! Contact me ([email protected]) for a quote.
Stars: ✭ 19 (-24%)
Mutual labels:  workshop
pythonworkshops
Materials for Python workshops
Stars: ✭ 28 (+12%)
Mutual labels:  workshop
touchdesigner-summit-2019-external-python
No description or website provided.
Stars: ✭ 35 (+40%)
Mutual labels:  workshop
eeguana
A package for manipulating EEG data in R.
Stars: ✭ 16 (-36%)
Mutual labels:  tidyverse
lambdajam
Workshop on Program Transformations
Stars: ✭ 22 (-12%)
Mutual labels:  workshop
recount
R package for the recount2 project. Documentation website: http://leekgroup.github.io/recount/
Stars: ✭ 40 (+60%)
Mutual labels:  bioconductor
mds-provider-services
Services for working with MDS Provider data, built as runnable Docker containers.
Stars: ✭ 15 (-40%)
Mutual labels:  mds
influxdbr
R Interface for InfluxDB
Stars: ✭ 95 (+280%)
Mutual labels:  tidyverse
BiocPkgTools
Computable build reports, package metadata, and download stats from the Bioconductor project
Stars: ✭ 20 (-20%)
Mutual labels:  bioconductor
biomisc R
command line bioinformatic scripts written in R
Stars: ✭ 20 (-20%)
Mutual labels:  differential-expression

DOI .github/workflows/basic_checks.yaml Docker

A Tidy Transcriptomics introduction to RNA sequencing analyses

bioc2020 tidybulk

Instructor names and contact information

  • Maria Doyle <Maria.Doyle at petermac.org>
  • Stefano Mangiola <mangiola.s at wehi.edu.au>

Syllabus

Material webpage.

Video recording of the workshop.

This material was created for a Bioc2020 conference workshop but it can also be used for self-learning.

More details on the workshop are below.

Workshop package installation

This is necessary in order to reproduce the code shown in the workshop. The workshop is designed for R 4.0 and packages from the 3.12 devel branch of Bioconductor. It can be installed using one of the two ways below.

Via Docker image

If you're familiar with Docker you could use the Docker image which has all the software pre-configured to the correct versions.

docker run -e PASSWORD=abc -p 8787:8787 stemangiola/bioc_2020_tidytranscriptomics:bioc2020

Once running, navigate to http://localhost:8787/ and then login with Username:rstudio and Password:abc.

You should see the Rmarkdown file with all the workshop code which you can run.

Via GitHub

Alternatively, you could install the workshop using the commands below in R 4.0.

devtools::install_github("stemangiola/[email protected]")
devtools::install_github("stemangiola/bioc_2020_tidytranscriptomics", build_vignettes = TRUE)
library(bioc2020tidytranscriptomics)
browseVignettes("bioc2020tidytranscriptomics")

To run the code, you could then copy and paste the code from the workshop R markdown file into a new R Markdown file on your computer.

Workshop Description

This workshop will present how to perform analysis of RNA sequencing data following the tidy data paradigm. The tidy data paradigm provides a standard way to organise data values within a dataset, where each variable is a column, each observation is a row, and data is manipulated using an easy-to-understand vocabulary. Most importantly, the data structure remains consistent across manipulation and analysis functions.

This can be achieved for RNA sequencing data with the tidybulk, tidyHeatmap and tidyverse packages. The tidybulk package provides a tidy data structure and a modular framework for bulk transcriptional analyses. tidyHeatmap provides a tidy implementation of ComplexHeatmap. These packages are part of the tidytranscriptomics suite that introduces a tidy approach to RNA sequencing data.

The topics presented in this workshop will be

  • Data exploration
  • Data dimensionality reduction and clustering
  • Differential gene expression analysis
  • Data visualisation

Pre-requisites

  • Basic knowledge of RStudio
  • Familiarity with tidyverse syntax

Recommended Background Reading Introduction to R for Biologists

Workshop Participation

The workshop format is a 55 min session consisting of a 30 min demo followed by 25 min opportunity for attendees to try out the code, exercises and Q&A.

R / Bioconductor packages used

  • tidyverse
  • tidybulk
  • tidyHeatmap
  • edgeR
  • ggrepel
  • airway

Time outline

Activity Time
Demo 30m
Introduction and Data preprocessing
Data dimensionality reduction and clustering
Differential gene expression
Data visualisation
Try out code, Exercises, Q&A 25m

Workshop goals and objectives

In exploring and analysing RNA sequencing data, there are a number of key concepts, such as filtering, scaling, dimensionality reduction, hypothesis testing, clustering and visualisation, that need to be understood. These concepts can be intuitively explained to new users, however, (i) the use of a heterogeneous vocabulary and jargon by methodologies/algorithms/packages, (ii) the complexity of data wrangling, and (iii) the coding burden, impede effective learning of the statistics and biology underlying an informed RNA sequencing analysis.

The tidytranscriptomics approach to RNA sequencing data analysis abstracts out the coding-related complexity and provides tools that use an intuitive and jargon-free vocabulary, enabling focus on the statistical and biological challenges.

Learning goals

  • To understand the key concepts and steps of bulk RNA sequencing data analysis
  • To approach data representation and analysis though a tidy data paradigm, integrating tidyverse with tidybulk and tidyHeatmap.

Learning objectives

  • Recall the key concepts of RNA sequencing data analysis
  • Apply the concepts to publicly available data
  • Create plots that summarise the information content of the data and analysis results
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].