All Projects → cwickham → r_intro_bc_stats

cwickham / r_intro_bc_stats

Licence: other
An Introduction to R and RStudio with the tidyverse delivered at BC Stats

Programming Languages

TeX
3793 projects

Projects that are alternatives of or similar to r intro bc stats

training-microservices
Node.js Microservices training
Stars: ✭ 50 (+61.29%)
Mutual labels:  training
nntrainer
NNtrainer is Software Framework for Training Neural Network Models on Devices.
Stars: ✭ 92 (+196.77%)
Mutual labels:  training
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-54.84%)
Mutual labels:  training
responsivebootstrap
This is the repository for my course, Bootstrap Layouts: Responsive Single-Page Design on LinkedIn Learning and Lynda.com.
Stars: ✭ 49 (+58.06%)
Mutual labels:  training
kedro-training
Find documentation and a template project for delivering Kedro training.
Stars: ✭ 26 (-16.13%)
Mutual labels:  training
carte-uso
Cartea "Utilizarea sistemelor de operare"
Stars: ✭ 18 (-41.94%)
Mutual labels:  training
hello-kubecon
A Charmed Operator demonstration for Operator Day 2021, hosted by Canonical
Stars: ✭ 14 (-54.84%)
Mutual labels:  training
ansible-traininglab
Dockerized Ansible Training Lab
Stars: ✭ 16 (-48.39%)
Mutual labels:  training
formations
Supports de cours des formations OpenStack et conteneurs de la société alter way
Stars: ✭ 43 (+38.71%)
Mutual labels:  training
gruntworkflows
Repository for my tutorial course: Grunt.js Web Workflows on LinkedIn Learning and Lynda.com.
Stars: ✭ 28 (-9.68%)
Mutual labels:  training
chainer-fcis
[This project has moved to ChainerCV] Chainer Implementation of Fully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 45 (+45.16%)
Mutual labels:  training
Python-Studies
All studies about python
Stars: ✭ 56 (+80.65%)
Mutual labels:  training
admin-training
Galaxy Admin Training
Stars: ✭ 55 (+77.42%)
Mutual labels:  training
Korean-OCR-Model-Design-based-on-Keras-CNN
Korean OCR Model Design(한글 OCR 모델 설계)
Stars: ✭ 34 (+9.68%)
Mutual labels:  training
gitworkshop
Git Workshop covering git essentials & advanced topics
Stars: ✭ 15 (-51.61%)
Mutual labels:  training
HPC
A collection of various resources, examples, and executables for the general NREL HPC user community's benefit. Use the following website for accessing documentation.
Stars: ✭ 64 (+106.45%)
Mutual labels:  training
npmtooling
This is the repository for my course, Tooling with NPM Scripts on LinkedIn Learning and Lynda.com.
Stars: ✭ 13 (-58.06%)
Mutual labels:  training
angular
Repository for my tutorial course: Learning AngularJS on LinkedIn Learning and Lynda.com. http://raybo.org/angular
Stars: ✭ 79 (+154.84%)
Mutual labels:  training
neutronics-workshop
A workshop covering a range of fusion relevant analysis and simulations with OpenMC, DAGMC, Paramak and other open source fusion neutronics tools
Stars: ✭ 29 (-6.45%)
Mutual labels:  training
KataTODOApiClientKotlin
TODO API Client Kata for Kotlin Developers. The main goal is to practice integration testing using MockWebServer
Stars: ✭ 59 (+90.32%)
Mutual labels:  training

This is the repo for an introduction to R using the tidyverse developed for BC Stats.

The repo currently reflects a 1 day version to be delivered Oct 2019. If you are looking for an older version, try:

Solutions to “Your Turns” available at r_intro_bc_stats_solutions.

License

Creative Commons License
Introduction to data science with the tidyverse by Charlotte Wickham is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at https://github.com/rstudio/master-the-tidyverse.

Overview

This course will give you a feel for the complete data analysis process in R - from importing and manipulating data through visualization. You'll see how using code to capture the analysis pipeline leads to deliverables that are documented, easily reproduced and easily automated.

We'll focus on tools in the tidyverse a core set of R packages that are designed to be easy to learn, easy to use, and solve the most frequent data analysis problems.

During the course, we'll alternate between me introducing a new concept with some examples, and you applying that concept on your own. You should expect to spend at least 50% of your time writing code in RStudio on your own laptop.

The first half day is specifically for those who are new to R. Take a look at the prerequisites to see if you might be able to skip it.

Schedule

Session

Date/Time

Topic

Day 1: afternoon

Tue Jul 24th 1pm-4:30pm

Getting Started with R and RStudio

Day 2: morning

Wed Jul 25th 8:30am-12pm

Data Visualization with ggplot2

Day 2: afternoon

Wed Jul 25th 1pm-4:30pm

Data Manipulation with dplyr and tidyr

Day 1 - Getting Started with R and RStudio

On your first afternoon you'll focus on getting comfortable writing code and executing it in RStudio. We'll take things slow as you learn to navigate RStudio, learn some syntax rules, and how to get help when you get stuck. Along the way you'll meet R's most ubiquitous objects for holding data and learn to import data whether it is a CSV, SPSS or Excel data file.

By the end of the day you will be able to:

  • Open a notebook in RStudio and execute the code chunks in it

  • Install and load an R package

  • Open the help page for a function or built-in dataset

  • Identify the components of an R function: the function name and arguments

  • Assign the results of a function to a new variable

  • Get an overview of a dataset that is in a data frame or tibble

  • Import CSV, SPSS and SAS data files

Day 2 - Visualization and Manipulation of Data

We'll start the day with visualization of data in R using the package ggplot2. You'll see how ggplot2 provides a framework for thinking about plots, which means you only need to learn one template to make almost any plot you can imagine. To practice, you'll make some of the most common kinds of data visualizations: histograms, scatterplots and time series plots, and continue building your skills as we continue through data manipulation.

In the afternoon we'll focus of the most common types of data manipulation: extracting subsets from data, adding new variables and creating grouped summaries. You'll find that doing this is quite intuitive using the dplyr package which boils down manipulation into a set of verbs like: filter(), mutate() and summarise(). Occasionally, data won't come in quite the right shape for manipulation or visualization you want to do, so we'll also talk about the key parts of the tidyr package that help to reshape not not-so-tidy data.

By the end of the day you will be able to:

  • Create plots in ggplot2 to explore data

  • Select variables and filter observations to subset data

  • Add new variables, and transform variables

  • Create grouped summaries of data

  • Reshape data for use with tidy tools

Prerequisites

The first half-day is specifically for people that are new to R. You can safely join us starting on day 2 if you already:

  • know how to define variables in R

  • have called a few basic functions (e.g. mean()), and

  • know how to open .R script files, and run code in the console

Although I'll assume on the first half-day you haven't used R, you might like to get a little experience before we meet. Some options are:

Software Requirements

You'll need to bring a laptop with R and RStudio installed. In addition, you'll want to install the following packages:

install.packages(c("tidyverse", "rmarkdown", "gapminder", "usethis"))

If you've installed the tidyverse before, re-installing it may not update all the component packages, in which case run,

tidyverse::tidyverse_update()

to identify any out-of-date packages, and follow the instructions to update them.

Don't forget to bring your power cable!

I'll also be providing some additional materials (slides, code and data) prior to our meeting, keep your eyes out for an email about soon.

Getting the materials

To download the materials, open RStudio and on the Console run:

usethis::use_course("bit.ly/rintro-bc")

After a pause, you'll be asked a few questions about the download process. The materials will be downloaded, unzipped to your Desktop, and a new RStudio session will open.

You only need to do this once. If you close RStudio, and want to pick it up again later (i.e. on the day of the training):

  1. Navigate to where the folder was downloaded (this should be a folder called r_intro_bc_stats-master on your Desktop)

  2. Double-click the file r_intro_bc_stats.Rproj to open up RStudio

rstudio.cloud

If you have a good internet connection, an alternative to installing the pacakges, and getting the materials, locally is to use https://rstudio.cloud/project/49127

You may need to sign up for an account (free) first.

Instructor Info

Charlotte Wickham

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].