In the third course of the EmbRaceR collection, the most tedious task in a project, the data preparation task, in covered. Attendees learn about many of the basic and also more advanced techniques, and why and how to use them.
The first module teaches you how to manipulate the data frames. You learn how to make a projection, how to filter, join, bind, and sort data frames. Then the module deals with missing values and shows how to crate derived variables.
Package dplyr is bringing a modern way for data manipulation. You will learn how to use it successfully. Then the module shows how to smooth and normalize data. The last lesson of the second module is dealing with numerous ways of aggregating, pivoting, and unpivoting data in R.
In this course, you learn how to create numerical variables from strings, and how to create discrete variables from numerical ones. Data preparation interleaves with data overview. In the last lesson of this course, the most popular graphical function ggplot is introduced.