Classbaze

Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Learning Path: Data Science With Apache Spark 2

Get started with Spark for large-scale distributed data processing and data science
3.6
3.6/5
(10 reviews)
164 students
Created by

6.7

Classbaze Grade®

3.6

Freshness

6.6

Popularity

9.4

Material

Get started with Spark for large-scale distributed data processing and data science
Platform: Udemy
Video: 8h 58m
Language: English
Next start: On Demand

Best Apache Spark classes:

Classbaze Rating

Classbaze Grade®

6.7 / 10

CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.

Freshness

3.6 / 10
This course was last updated on 2/2017.

Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.

Popularity

6.6 / 10
We analyzed factors such as the rating (3.6/5) and the ratio between the number of reviews and the number of students, which is a great signal of student commitment.

New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.

Material

9.4 / 10
Video Score: 8.9 / 10
The course includes 8h 58m video content. Courses with more videos usually have a higher average rating. We have found that the sweet spot is 16 hours of video, which is long enough to teach a topic comprehensively, but not overwhelming. Courses over 16 hours of video gets the maximum score.
The average video length is 6 hours 47 minutes of 113 Apache Spark courses on Udemy.
Detail Score: 9.9 / 10

The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.

Extra Content Score: 9.5 / 10

Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.

This course contains:

0 article.
1 resources.
0 exercise.
0 test.

In this page

About the course

The real power and value proposition of Apache Spark is its speed and platform to execute data processing and data science tasks. Sounds interesting? Let’s see how easy it is!
Packt’s Video Learning Paths are a series of individual video products put together in a logical and stepwise manner such that each video builds on the skills learned in the video before it.
Spark is one of the most widely-used large-scale data processing engines and runs extremely fast. It is a framework that has tools that are equally useful for application developers as well as data scientists. Spark’s unique use case is that it combines ETL, batch analytics, real-time stream analysis, machine learning, graph processing, and visualizations to allow data scientists to tackle the complexities that come with raw unstructured datasets.
This Learning Path starts with an introduction tour of Apache Spark 2. We will look at the basics of Spark, introduce SparkR, then look at the charting and plotting features of Python in conjunction with Spark data processing, and finally take a thorough look at Spark’s data processing libraries. We then develop a real-world Spark application. Next, we will help you become comfortable and confident working with Spark for data science by exploring Spark’s data science libraries on a dataset of tweets.
The goal of this course to introduce you to Apache Spark 2 and teach you its data processing and data science libraries so that you are equipped with the skills required from modern data scientists.
This Learning Path is authored by some of the best in their fields.
Rajanarayanan Thottuvaikkatumana
Rajanarayanan Thottuvaikkatumana, or Raj, is a seasoned technologist with more than 23 years of software development experience at various multinational companies. His experience includes architecting, designing, and developing software applications. He has worked on various technologies including major databases, application development platforms, web technologies, and big data technologies. Currently he is building a next generation Hadoop YARN-based data processing platform and an application suite built with Spark using Scala.
Eric Charles
Eric Charles has 10 years’ experience in the field of Data Science and is the founder of Datalayer, a social network for Data Scientists. His typical day includes building efficient processing with advanced machine learning algorithms, easy SQL, streaming and graph analytics. He also focuses a lot on visualization and result sharing. He is passionate about open source and is an active Apache Member. He regularly gives talks to corporate clients and at open source events. 

What can you learn from this course?

✓ Get to know the fundamentals of Spark 2.0 and the Spark programming model using Scala and Python
✓ Know how to use Spark SQL and DataFrames using Scala and Python
✓ Get an introduction to Spark programming using R
✓ Develop a complete Spark application
✓ Obtain and clean data before processing it
✓ Understand the Spark machine learning algorithm to build a simple pipeline
✓ Work with interactive visualization packages in Spark
✓ Apply data mining techniques on the available datasets
✓ Build a recommendation engine

What you need to start the course?

• Requires basic knowledge of either Python or R

Who is this course is made for?

• Application developers, data scientists, or big data architects interested in combining the data processing power of Apache Spark will find this course to be very useful. As implementations of Apache Spark will be shown with Scala and Python, some programming knowledge on these languages will be needed. This course is for anyone who wants to work with Spark on large and complex datasets. A basic knowledge about statistics and computational mathematics is expected.
• With the help of real-world use cases on the main features of Spark, this course offers an easy introduction to the framework. This practical hands-on course covers the fundamentals of Spark needed to get to grips with data science through a single dataset. It expands on the next learning curve for those comfortable with Spark programming who are looking to apply Spark in the field of data science.

Are there coupons or discounts for Learning Path: Data Science With Apache Spark 2 ? What is the current price?

The course costs $14.99. And currently there is a 82% discount on the original price of the course, which was $84.99. So you save $70 if you enroll the course now.
The average price is $17.1 of 113 Apache Spark courses. So this course is 12% cheaper than the average Apache Spark course on Udemy.

Will I be refunded if I'm not satisfied with the Learning Path: Data Science With Apache Spark 2 course?

YES, Learning Path: Data Science With Apache Spark 2 has a 30-day money back guarantee. The 30-day refund policy is designed to allow students to study without risk.

Are there any financial aid for this course?

Currently we could not find a scholarship for the Learning Path: Data Science With Apache Spark 2 course, but there is a $70 discount from the original price ($84.99). So the current price is just $14.99.

Who will teach this course? Can I trust Packt Publishing?

Packt Publishing has created 1,262 courses that got 66,758 reviews which are generally positive. Packt Publishing has taught 394,723 students and received a 3.9 average review out of 66,758 reviews. Depending on the information available, we think that Packt Publishing is an instructor that you can trust.
Tech Knowledge in Motion
Browse all courses by on Classbaze.

6.7

Classbaze Grade®

3.6

Freshness

6.6

Popularity

9.4

Material

Platform: Udemy
Video: 8h 58m
Language: English
Next start: On Demand

Classbaze recommendations for you