Description
EmbRaceR: Data Preparation with R
In the third course of the EmbRaceR collection, the most tedious task in a project, the data preparation task, in covered. Attendees learn about many of the basic and also more advanced techniques, and why and how to use them.
The first module teaches you how to manipulate the data frames. You learn how to make a projection, how to filter, join, bind, and sort data frames. Then the module deals with missing values and shows how to crate derived variables.
Package dplyr is bringing a modern way for data manipulation. You will learn how to use it successfully. Then the module shows how to smooth and normalize data. The last lesson of the second module is dealing with numerous ways of aggregating, pivoting, and unpivoting data in R.
In this course, you learn how to create numerical variables from strings, and how to create discrete variables from numerical ones. Data preparation interleaves with data overview. In the last lesson of this course, the most popular graphical function ggplot is introduced.
Link to the agenda and sample videos from the course
Length: 1 Hour and 41 Minutes
Instructor
Dejan Sarka
Dejan Sarka, MCT and Data Platform MVP, is an independent trainer and consultant that focuses on the development of database and business intelligence applications. Besides projects, he spends about half of the time on training and mentoring. He is the founder of the Slovenian SQL Server and .NET Users Group. Dejan Sarka is the main author or co-author of eighteen books about databases and SQL Server. Dejan Sarka has also developed many courses and seminars for Microsoft, SolidQ, and Pluralsight.
Reviews
There are no reviews yet.