underthecurve / R Data Cleaning Tricks
Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop
Stars: ✭ 58
Programming Languages
r
7636 projects
Labels
Projects that are alternatives of or similar to R Data Cleaning Tricks
Taskscheduler
Schedule R scripts/processes with the Windows task scheduler.
Stars: ✭ 270 (+365.52%)
Mutual labels: rstudio
Moderndive book
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (+808.62%)
Mutual labels: rstudio
Docker Images
Out-of-box Data Science / AI platform | AI/数据科学的瑞士军刀
Stars: ✭ 25 (-56.9%)
Mutual labels: rstudio
Posterdown
Use RMarkdown to generate PDF Conference Posters via HTML
Stars: ✭ 602 (+937.93%)
Mutual labels: rstudio
rgdax
Wrapper for Coinbase pro (erstwhile GDAX) Cryptocurrency exchange
Stars: ✭ 34 (-41.38%)
Mutual labels: rstudio
Nanny
A tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Stars: ✭ 17 (-70.69%)
Mutual labels: rstudio
Summarytools
R Package to Quickly and Neatly Summarize Data
Stars: ✭ 390 (+572.41%)
Mutual labels: rstudio
Jupytext
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Stars: ✭ 4,969 (+8467.24%)
Mutual labels: rstudio
Wowchemy Hugo Modules
🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, build with widgets! 创建在线课程,学术简历或初创网站。
Stars: ✭ 6,093 (+10405.17%)
Mutual labels: rstudio
Course Starter R
👩🏫🇷 Starter repo for building interactive R courses
Stars: ✭ 281 (+384.48%)
Mutual labels: rstudio
R Dataviz Ggplot2
"Basic data viz" & "Advancing with data viz in R using ggplot2" for Boston University's "Data+Narrative" workshop
Stars: ✭ 21 (-63.79%)
Mutual labels: rstudio
rfordatasciencewiki
Resources for the R4DS Online Learning Community, including answer keys to the text
Stars: ✭ 40 (-31.03%)
Mutual labels: rstudio
Rstudioconf tweets
🖥 A repository for tracking tweets about rstudio::conf
Stars: ✭ 32 (-44.83%)
Mutual labels: rstudio
Tricks for cleaning your data in R
Data + Code for "Tricks for cleaning your data in R" at the Storytelling with Data workshop at Boston University on Tuesday, June 6th 2017.
Equivalent materials for "Advancing with data visualization in R using ggplot2" available here.
Links to install R and RStudio
- R: website for the R software
- RStudio: website for RStudio, a powerful graphical user interface for R
Files included
Annotated code and step-by step instructions for the workshop
- R-datacleaning-tricks.md: Markdown file (for viewing on the web)
- R-datacleaning-tricks.pdf: PDF file (for printing out)
R code
- R-datacleaning-tricks.R: the R code, which can be run in RStudio
Underlying data needed to run the R code
- employee-earnings-report-2016.csv: data on earnings for Boston's municipal employees, from the city's open data portal
- unemployment.xlsx: data on global unemployment rates from 2012 to 2016, from the International Monetary Fund
- attendees.csv: data on some attendees of this workshop, with names and identifying information removed
How to follow this workshop
- You can clone or download this repository by clicking on the green button above, "Clone or download"
- Open the
.R
file in RStudio - Follow along by reading the
.md
file online or printing the.pdf
file out by clicking the Github links above
Questions / Feedback?
ychristinezhang at gmail dot com
or on Twitter
This work is licensed under a Creative Commons Attribution 4.0 International License.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].