Classbaze

Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Spark and Python for Big Data with PySpark

Learn how to use Spark with Python, including Spark Streaming, Machine Learning, Spark 2.0 DataFrames and more!
4.5
4.5/5
(18,617 reviews)
98,818 students
Created by

8.9

Classbaze Grade®

7.6

Freshness

9.1

Popularity

9.5

Material

Learn how to use Spark with Python
Platform: Udemy
Video: 10h 35m
Language: English
Next start: On Demand

Best Python classes:

Classbaze Rating

Classbaze Grade®

8.9 / 10

CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.

Freshness

7.6 / 10
This course was last updated on 5/2020.

Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.

Popularity

9.1 / 10
We analyzed factors such as the rating (4.5/5) and the ratio between the number of reviews and the number of students, which is a great signal of student commitment.

New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.

Material

9.5 / 10
Video Score: 9.2 / 10
The course includes 10h 35m video content. Courses with more videos usually have a higher average rating. We have found that the sweet spot is 16 hours of video, which is long enough to teach a topic comprehensively, but not overwhelming. Courses over 16 hours of video gets the maximum score.
The average video length is 7 hours 31 minutes of 1,582 Python courses on Udemy.
Detail Score: 9.5 / 10

The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.

Extra Content Score: 9.9 / 10

Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.

This course contains:

4 articles.
4 resources.
0 exercise.
0 test.

In this page

About the course

Learn the latest Big Data Technology – Spark! And learn to use it with one of the most popular programming languages, Python!
One of the most valuable technology skills is the ability to analyze huge data sets, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Spark! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Spark to solve their big data problems!
Spark can perform up to 100x faster than Hadoop MapReduce, which has caused an explosion in demand for this skill! Because the Spark 2.0 DataFrame framework is so new, you now have the ability to quickly become one of the most knowledgeable people in the job market!
This course will teach the basics with a crash course in Python, continuing on to learning how to use Spark DataFrames with the latest Spark 2.0 syntax! Once we’ve done that we’ll go through how to use the MLlib Machine Library with the DataFrame syntax and Spark. All along the way you’ll have exercises and Mock Consulting Projects that put you right into a real world situation where you need to use your new skills to solve a real problem!
We also cover the latest Spark Technologies, like Spark SQL, Spark Streaming, and advanced models like Gradient Boosted Trees! After you complete this course you will feel comfortable putting Spark and PySpark on your resume! This course also has a full 30 day money back guarantee and comes with a LinkedIn Certificate of Completion!
If you’re ready to jump into the world of Python, Spark, and Big Data, this is the course for you!

What can you learn from this course?

✓ Use Python and Spark together to analyze Big Data
✓ Learn how to use the new Spark 2.0 DataFrame Syntax
✓ Work on Consulting Projects that mimic real world situations!
✓ Classify Customer Churn with Logisitic Regression
✓ Use Spark with Random Forests for Classification
✓ Learn how to use Spark’s Gradient Boosted Trees
✓ Use Spark’s MLlib to create Powerful Machine Learning Models
✓ Learn about the DataBricks Platform!
✓ Get set up on Amazon Web Services EC2 for Big Data Analysis
✓ Learn how to use AWS Elastic MapReduce Service!
✓ Learn how to leverage the power of Linux with a Spark Environment!
✓ Create a Spam filter using Spark and Natural Language Processing!
✓ Use Spark Streaming to Analyze Tweets in Real Time!

What you need to start the course?

• General Programming Skills in any Language (Preferrably Python)
• 20 GB of free space on your local computer (or alternatively a strong internet connection for AWS)

Who is this course is made for?

• Someone who knows Python and would like to learn how to use it for Big Data
• Someone who is very familiar with another programming language and needs to learn Spark

Are there coupons or discounts for Spark and Python for Big Data with PySpark ? What is the current price?

The course costs $24.99. And currently there is a 81% discount on the original price of the course, which was $129.99. So you save $105 if you enroll the course now.
The average price is $20.1 of 1,582 Python courses. So this course is 24% more expensive than the average Python course on Udemy.

Will I be refunded if I'm not satisfied with the Spark and Python for Big Data with PySpark course?

YES, Spark and Python for Big Data with PySpark has a 30-day money back guarantee. The 30-day refund policy is designed to allow students to study without risk.

Are there any financial aid for this course?

Currently we could not find a scholarship for the Spark and Python for Big Data with PySpark course, but there is a $105 discount from the original price ($129.99). So the current price is just $24.99.

Who will teach this course? Can I trust Jose Portilla?

Jose Portilla has created 50 courses that got 905,198 reviews which are generally positive. Jose Portilla has taught 2,879,309 students and received a 4.6 average review out of 905,198 reviews. Depending on the information available, we think that Jose Portilla is an instructor that you can trust.
Head of Data Science at Pierian Training
Jose Marcial Portilla has a BS and MS in Mechanical Engineering from Santa Clara University and years of experience as a professional instructor and trainer for Data Science, Machine Learning and Python Programming. He has publications and patents in various fields such as microfluidics, materials science, and data science. Over the course of his career he has developed a skill set in analyzing data and he hopes to use his experience in teaching and data science to help other people learn the power of programming, the ability to analyze data, and the skills needed to present the data in clear and beautiful visualizations. Currently he works as the Head of Data Science for Pierian Training and provides in-person data science and python programming training courses to employees working at top companies, including General Electric, Cigna, The New York Times, Credit Suisse, McKinsey and many more. Feel free to check out the website link to find out more information about training offerings.
Browse all courses by on Classbaze.

8.9

Classbaze Grade®

7.6

Freshness

9.1

Popularity

9.5

Material

Platform: Udemy
Video: 10h 35m
Language: English
Next start: On Demand

Classbaze recommendations for you