Classbaze

Disclosure: when you buy through links on our site, we may earn an affiliate commission.

Data Engineering on Google Cloud platform

End to end batch processing,data orchestration and real time streaming analytics on GCP
4.4
4.4/5
(373 reviews)
3,128 students
Created by

9.5

Classbaze Grade®

10.0

Freshness

8.5

Popularity

9.4

Material

End to end batch processing
Platform: Udemy
Video: 10h 1m
Language: English
Next start: On Demand

Best Google Cloud classes:

Classbaze Rating

Classbaze Grade®

9.5 / 10

CourseMarks Score® helps students to find the best classes. We aggregate 18 factors, including freshness, student feedback and content diversity.

Freshness

10.0 / 10
This course was last updated on 6/2022.

Course content can become outdated quite quickly. After analysing 71,530 courses, we found that the highest rated courses are updated every year. If a course has not been updated for more than 2 years, you should carefully evaluate the course before enrolling.

Popularity

8.5 / 10
We analyzed factors such as the rating (4.4/5) and the ratio between the number of reviews and the number of students, which is a great signal of student commitment.

New courses are hard to evaluate because there are no or just a few student ratings, but Student Feedback Score helps you find great courses even with fewer reviews.

Material

9.4 / 10
Video Score: 9.1 / 10
The course includes 10h 1m video content. Courses with more videos usually have a higher average rating. We have found that the sweet spot is 16 hours of video, which is long enough to teach a topic comprehensively, but not overwhelming. Courses over 16 hours of video gets the maximum score.
The average video length is 5 hours 46 minutes of 62 Google Cloud courses on Udemy.
Detail Score: 9.5 / 10

The top online course contains a detailed description of the course, what you will learn and also a detailed description about the instructor.

Extra Content Score: 9.5 / 10

Tests, exercises, articles and other resources help students to better understand and deepen their understanding of the topic.

This course contains:

0 article.
42 resources.
0 exercise.
0 test.

In this page

About the course

Google Cloud platform is catching up and a lot of companies have already started moving their infrastructure to GCP . This course provides the most practical solutions to real world use cases in terms of data engineering on Cloud . This course is designed keeping in mind end to end lifecycle of a typical Big data ETL project both batch processing and real time streaming and analytics .
Considering the most important components of any batch processing or streaming jobs , this course covers
•Writing ETL jobs using Pyspark  from scratch
•Storage components on GCP (GCS & Dataproc HDFS) 
•Loading Data into Data-warehousing tool on GCP (BigQuery)
•Handling/Writing Data Orchestration and dependencies using Apache Airflow(Google Composer) in Python from scratch
•Batch Data ingestion using Sqoop , CloudSql and Apache Airflow
•Real Time data streaming and analytics using the latest API , Spark Structured Streaming with Python
•Micro batching using PySpark streaming & Hive on Dataproc
The coding tutorials and the problem statements in this course are extremely comprehensive and will surely give one enough confidence to take up new challenges in the Big Data / Hadoop Ecosystem on cloud and start approaching problem statements & job interviews without inhibition .
Most importantly , this course makes use of Linux Ubuntu 18.02 as a local operating system.Though most of the codes are run and triggered on Cloud , this course expects one to be experienced enough to be able to set up Google SDKs , python and a GCP Account by themselves on their local machines because the local operating system does not matter in order to succeed in this course .
P.S : 88BA1461141F3A2A6E2D for half price .

What can you learn from this course?

✓ Pyspark for ETL/Batch Processing on GCP using Bigquery as data warehousing component
✓ Automate and orchestrate SparkSql batch jobs using Apache Airflow and Google Workflows
✓ Sqoop for Data ingestion from CloudSql and using Airflow to automate the batch ETL
✓ Difference between Event-time data transformations and process-time transformations
✓ Pyspark Structured Streaming – Real Time Data streaming and transformations
✓ Save real time streaming raw data as external hive tables on Dataproc and perform ad-hoc queries using HiveSql
✓ Run Hive-SparkSql jobs on Dataproc and automate micro-batching and transformations using Airflow
✓ Pyspark Structured Streaming – Handling Late Data using watermarking and Event-time data processing
✓ Using different file formats – AVRO and Parquet . Different scenarios in which to use the file formats

What you need to start the course?

• Basic Python Skills
• Comfortable with basic Linux/Bash commands
• Basic understanding of spark (python) and how hadoop works
• A Google cloud compute account / if not sign up for a free trial account
• Comfortable with setting up Google SDKs regardless of the operating system
• Should have the desire to learn and eagerness to explore more about the relevant topics

Who is this course is made for?

• Any techie who needs hands on project expertise on end to end batch data processing & real time streaming
• Aspiring Data Engineers who find it hard to setup and work practically on distributed processing
• Any Techie who is preparing for an interview for a Data engineering position and wants hands on expertise

Are there coupons or discounts for Data Engineering on Google Cloud platform ? What is the current price?

The course costs $15.99. And currently there is a 20% discount on the original price of the course, which was $19.99. So you save $4 if you enroll the course now.
The average price is $16.5 of 62 Google Cloud courses. So this course is 3% cheaper than the average Google Cloud course on Udemy.

Will I be refunded if I'm not satisfied with the Data Engineering on Google Cloud platform course?

YES, Data Engineering on Google Cloud platform has a 30-day money back guarantee. The 30-day refund policy is designed to allow students to study without risk.

Are there any financial aid for this course?

Currently we could not find a scholarship for the Data Engineering on Google Cloud platform course, but there is a $4 discount from the original price ($19.99). So the current price is just $15.99.

Who will teach this course? Can I trust Siddharth Raghunath?

Siddharth Raghunath has created 3 courses that got 889 reviews which are generally positive. Siddharth Raghunath has taught 7,078 students and received a 4.4 average review out of 889 reviews. Depending on the information available, we think that Siddharth Raghunath is an instructor that you can trust.
Data Engineer / Cloud Data Engineer / Passionate Techie
I am a Business oriented Engineering manager with a vast experience in the field of Software Development,Distributed processing and data engineering on cloud . I have worked on different cloud platforms such as AWS & GCP and also with on-prem hadoop clusters. I also give seminars on Distributed processing using Spark , real time streaming and analytics and best practices for ETL and data governance.I am also a passionate coder ,love writing and building optimal data pipelines for robust data processing and streaming solutions . 
Browse all courses by on Classbaze.

9.5

Classbaze Grade®

10.0

Freshness

8.5

Popularity

9.4

Material

Platform: Udemy
Video: 10h 1m
Language: English
Next start: On Demand

Classbaze recommendations for you