Classbaze

Disclosure: when you buy through links on our site, we may earn an affiliate commission.

R Data Pre-Processing & Data Management – Shape your Data!

Learn how to prepare your data for great analytics in R.
4.9
4.9/5
(614 reviews)
4,526 students
Created by

8.5

Classbaze Grade®

5.7

Freshness

9.6

Popularity

9.5

Material

Learn how to prepare your data for great analytics in R.
Platform: Udemy
Video: 6h 25m
Language: English
Next start: On Demand

Best R classes:

Classbaze Rating

Classbaze Grade®

8.5 / 10

CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.

Freshness

5.7 / 10
This course was last updated on 11/2018.

Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.

Popularity

9.6 / 10
We analyzed factors such as the rating (4.9/5) and the ratio between the number of reviews and the number of students, which is a great signal of student commitment.

New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.

Material

9.5 / 10
Video Score: 8.5 / 10
The course includes 6h 25m video content. Courses with more videos usually have a higher average rating. We have found that the sweet spot is 16 hours of video, which is long enough to teach a topic comprehensively, but not overwhelming. Courses over 16 hours of video gets the maximum score.
The average video length is 6 hours 00 minutes of 161 R courses on Udemy.
Detail Score: 10.0 / 10

The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.

Extra Content Score: 9.9 / 10

Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.

This course contains:

14 articles.
5 resources.
0 exercise.
0 test.

In this page

About the course

Let’s get your data in shape!
Data Pre-Processing is the very first step in data analytics. You cannot escape it, it is too important. Unfortunately this topic is widely overlooked and information is hard to find.
With this course I will change this!
Data Pre-Processing as taught in this course has the following steps:
1.       Data Import: this might sound trivial but if you consider all the different data formats out there you can imagine that this can be confusing. In the course we will take a look at a standard way of importing csv files, we will learn about the very fast fread method and I will show you what you can do if you have more exotic file formats to handle.
2.       Selecting the object class: a standard data.frame might be fine for easy standard tasks, but there are more advanced classes out there like the data.table. Especially with those huge datasets nowadays, a data.frame might not do it anymore. Alternatives will be demonstrated in this course.
3.       Getting your data in a tidy form: a tidy dataset has 1 row for each observation and 1 column for each variable. This might sound trivial, but in your daily work you will find instances where this simple rule is not followed. Often times you will not even notice that the dataset is not tidy in its layout. We will learn how tidyr can help you in getting your data into a clean and tidy format.
4.       Querying and filtering: when you have a huge dataset you need to filter for the desired parameters. We will learn about the combination of parameters and implementation of advanced filtering methods. Especially data.table has proven effective for that sort of querying on huge datasets, therefore we will focus on this package in the querying section.
5.       Data joins: when your data is spread over 2 different tables but you want to join them together based on given criteria, you will need joins for that. There are several methods of data joins in R, but here we will take a look at dplyr and the 2 table verbs which are such a great tool to work with 2 tables at the same time.
6.       Integrating and interacting with SQL: R is great at interacting with SQL. And SQL is of course the leading database language, which you will have to learn sooner or later as a data scientist. I will show you how to use SQL code within R and there is even a R to SQL translator for standard R code. And we will set up a SQLite database from within R. 
7.  Outlier detection: Datasets often contain values outside a plausible range. Faulty data generation or entry happens regularly. Statistical methods of outlier detection help to identify these values. We will take a look at the implemention of these.
8. Character strings as well as dates and time have their own rules when it comes to pre-processing. In this course we will also take a look at these types of data and how to effectively handle it in R.
How do you best prepare yourself for this course?
You only need a basic knowledge of R to fully benefit from this course. Once you know the basics of RStudio and R you are ready to follow along with the course material. Of course you will also get the R scripts which makes it even easier.
The screencasts are made in RStudio so you should get this program on top of R. Add on packages required are listed in the course.
Again, if you want to make sure that you have proper data with a tidy format, take a look at this course. It will make your analytics with R much easier!

What can you learn from this course?

✓ import data into R in several ways while also beeing able to identify a suitable import tool
✓ select and implement a proper object class (data.frame, data.table, data_frame)
✓ convert your data into (and understand) a tidy data format
✓ filter and query your data based on a wide range of parameters
✓ join 2 data tables together with dplyr 2 table verb syntax
✓ use SQL code within R
✓ translate basic R into SQL
✓ work with dates and time
✓ work with strings using regular expressions
✓ detecting outliers in datasets

What you need to start the course?

• Computer with R and RStudio ready to use
• You should have basic R / RStudio knowledge
• Required add on packages will be listed in the course orientation video

Who is this course is made for?

• Data pre-processing is a crucial step of data related work – therefore this course is intended for all R users

Are there coupons or discounts for R Data Pre-Processing & Data Management - Shape your Data! ? What is the current price?

The course costs $17.99. And currently there is a 82% discount on the original price of the course, which was $99.99. So you save $82 if you enroll the course now.
The average price is $18.7 of 161 R courses. So this course is 4% cheaper than the average R course on Udemy.

Will I be refunded if I'm not satisfied with the R Data Pre-Processing & Data Management - Shape your Data! course?

YES, R Data Pre-Processing & Data Management – Shape your Data! has a 30-day money back guarantee. The 30-day refund policy is designed to allow students to study without risk.

Are there any financial aid for this course?

Currently we could not find a scholarship for the R Data Pre-Processing & Data Management - Shape your Data! course, but there is a $82 discount from the original price ($99.99). So the current price is just $17.99.

Who will teach this course? Can I trust R-Tutorials Training?

R-Tutorials Training has created 24 courses that got 31,105 reviews which are generally positive. R-Tutorials Training has taught 253,639 students and received a 4.4 average review out of 31,105 reviews. Depending on the information available, we think that R-Tutorials Training is an instructor that you can trust.
Data Science Education
  R-Tutorials is your provider of choice when it comes to analytics training courses! Try it out – our 100,000+ students love it. 
        We focus on Data Science tutorials. Offering several R courses for every skill level, we are among Udemy’s top R training provider. On top of that courses on Tableau, Excel and a Data Science career guide are available.
        All of our courses contain exercises to give you the opportunity to try out the material on your own. You will also get downloadable script pdfs to recap the lessons. 
        The courses are taught by our main instructor Martin – trained biostatistician and enthusiastic data scientist / R user. 
        Should you have any questions, you are invited to check out our website, you can open a discussion in the course or you can simply drop us a pm. 
        We are here to help you boost your career with analytics training – Just learn and enjoy. 
Browse all courses by on Classbaze.

8.5

Classbaze Grade®

5.7

Freshness

9.6

Popularity

9.5

Material

Platform: Udemy
Video: 6h 25m
Language: English
Next start: On Demand

Classbaze recommendations for you