Data wrangling with tidyverse
Some introductionary information about tidyverse
Real life datasources seldom provide data in exactly the format you need for the analysis. Hence most of the time you need to manipulate the data after reading it into R. There are several ways to do this, each with their pros and cons. We highly recommend the tidyverse
collection of packages. The command library(tidyverse)
will actually load the following packages: ggplot2
, dplyr
, tidyr
, readr
, purrr
, tibble
, stringr
and forecats
.
Where to find good information on these packages:
- official tidyverse website
- the R for data science book (R4DS) by Garrett Grolemund and Hadley Wickham. Note that this book is freely available online. A printed version is available at the INBO library.
- video tutorials:
- Data wrangling with R and RStudio: a good introduction on
dplyr
andtidyr
by Garrett Grolemund dplyr
tutorial at useR!2014 by Hadley Wickham (video part 1 and part 2)- tidyverse, visualization, and manipulation basics: a high-level overview of
tidyverse
by Garrett Grolemund
- Data wrangling with R and RStudio: a good introduction on
- Data Transformation Cheat Sheet: a two page document which covers the most important function for
dplyr