Classbaze

Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Scalable Data Analysis in Python with Dask

Build high-performance, distributed, and parallel applications in Dask
4.1
4.1/5
(153 reviews)
755 students
Created by

8.9

Classbaze Grade®

8.5

Freshness

8.4

Popularity

9.1

Material

Build high-performance
Platform: Udemy
Video: 3h 41m
Language: English
Next start: On Demand

Best Python classes:

Classbaze Rating

Classbaze Grade®

8.9 / 10

CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.

Freshness

8.5 / 10
This course was last updated on 2/2021.

Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.

Popularity

8.4 / 10
We analyzed factors such as the rating (4.1/5) and the ratio between the number of reviews and the number of students, which is a great signal of student commitment.

New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.

Material

9.1 / 10
Video Score: 8.1 / 10
The course includes 3h 41m video content. Courses with more videos usually have a higher average rating. We have found that the sweet spot is 16 hours of video, which is long enough to teach a topic comprehensively, but not overwhelming. Courses over 16 hours of video gets the maximum score.
The average video length is 7 hours 31 minutes of 1,582 Python courses on Udemy.
Detail Score: 9.7 / 10

The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.

Extra Content Score: 9.5 / 10

Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.

This course contains:

0 article.
1 resources.
0 exercise.
0 test.

In this page

About the course

Data analysts, Machine Learning professionals, and data scientists often use tools such as Pandas, Scikit-Learn, and NumPy for data analysis on their personal computer. However, when they want to apply their analyses to larger datasets, these tools fail to scale beyond a single machine, and so the analyst is forced to rewrite their computation.
If you work on big data and you’re using Pandas, you know you can end up waiting up to a whole minute for a simple average of a series. And that’s just for a couple of million rows!
In this course, you’ll learn to scale your data analysis. Firstly, you will execute distributed data science projects right from data ingestion to data manipulation and visualization using Dask. Then, you will explore the Dask framework. After, see how Dask can be used with other common Python tools such as NumPy, Pandas, matplotlib, Scikit-learn, and more.
You’ll be working on large datasets and performing exploratory data analysis to investigate the dataset, then come up with the findings from the dataset. You’ll learn by implementing data analysis principles using different statistical techniques in one go across different systems on the same massive datasets.
Throughout the course, we’ll go over the various techniques, modules, and features that Dask has to offer. Finally, you’ll learn to use its unique offering for machine learning, using the Dask-ML package. You’ll also start using parallel processing in your data tasks on your own system without moving to the distributed environment.
About the Author
Mohammed Kashif works as a data scientist at Nineleaps, India, dealing mostly with graph data analysis. Prior to this, he worked as a Python developer at Qualcomm. He completed his Master’s degree in computer science at IIIT Delhi, with a specialization in data engineering. His areas of interest include recommender systems, NLP, and graph analytics.
In his spare time, he likes to solve questions on StackOverflow and help debug other people out of their misery. He is also an experienced teaching assistant with a demonstrated history of working in the higher-education industry.

What can you learn from this course?

✓ Understand the concept of Block algorithms and how Dask leverages it to load large data
✓ Implement various example using Dask Arrays, Bags, and Dask Data frames for efficient parallel computing
✓ Combine Dask with existing Python packages such as NumPy and pandas
✓ See how Dask works under the hood and the various in-built algorithms it has to offer
✓ Leverage the power of Dask in a distributed setting and explore its various schedulers
✓ Implement an end-to-end Machine Learning pipeline in a distributed setting using Dask and scikit-learn
✓ Use Dask Arrays, Bags, and Dask Data frames for parallel and out-of-memory computations

What you need to start the course?

• Working knowledge of Python coding and familiarity with Python libraries would be beneficial.

Who is this course is made for?

• This course is for data scientists, Machine Learning engineers, and data engineers who want to perform predictive analytics and data science tasks at scale.

Are there coupons or discounts for Scalable Data Analysis in Python with Dask ? What is the current price?

The course costs $17.99. And currently there is a 82% discount on the original price of the course, which was $99.99. So you save $82 if you enroll the course now.
The average price is $20.1 of 1,582 Python courses. So this course is 10% cheaper than the average Python course on Udemy.

Will I be refunded if I'm not satisfied with the Scalable Data Analysis in Python with Dask course?

YES, Scalable Data Analysis in Python with Dask has a 30-day money back guarantee. The 30-day refund policy is designed to allow students to study without risk.

Are there any financial aid for this course?

Currently we could not find a scholarship for the Scalable Data Analysis in Python with Dask course, but there is a $82 discount from the original price ($99.99). So the current price is just $17.99.

Who will teach this course? Can I trust Packt Publishing?

Packt Publishing has created 1,262 courses that got 66,776 reviews which are generally positive. Packt Publishing has taught 394,771 students and received a 3.9 average review out of 66,776 reviews. Depending on the information available, we think that Packt Publishing is an instructor that you can trust.
Tech Knowledge in Motion
Browse all courses by on Classbaze.

8.9

Classbaze Grade®

8.5

Freshness

8.4

Popularity

9.1

Material

Platform: Udemy
Video: 3h 41m
Language: English
Next start: On Demand

Classbaze recommendations for you