<  Back

Data Engineering on Google Cloud

Data Engineering on Google Cloud

BOOK YOUR SEAT

Course Overview

Get hands-on experience with designing and building data processing systems on Google Cloud. This course uses lectures, demos, and hands-on labs to show you how to design data processing systems, build end-to-end data pipelines, analyze data, and implement machine learning. This course covers structured, unstructured, and streaming data.

• Design and build data processing systems on Google Cloud
• Process batch and streaming data by implementing autoscaling data pipelines on Cloud Dataflow
• Derive business insights from extremely large datasets using Google BigQuery
• Leverage unstructured data using Spark and ML APIs on Cloud Dataproc
• Enable instant insights from streaming data
• Understand ML APIs and BigQuery ML, and learn to use AutoML to create powerful models without coding

Audience Profile

This course is intended for developers responsible for:

• Extracting, loading, transforming, cleaning, and validating data
• Designing pipelines and architectures for data processing
• Integrating analytics and machine learning capabilities into data pipelines
• Querying datasets, visualizing query results and creating reports

Prerequisites

To benefit from this course, participants should have completed “Google Cloud Big Data and Machine Learning Fundamentals” or have equivalent experience. Participant should also have:

• Basic proficiency with a common query language such as SQL
• Experience with data modeling and ETL (extract, transform, load) activities
• Experience with developing applications using a common programming language such as Python
• Familiarity with machine learning and/or statistics

Course Outline

Course Outline

  • Introduction to Data Engineering
  • Building a Data Lake
  • Building a Data Warehouse
  • Introduction to Building Batch Data Pipelines
  • Executing Spark on Cloud Dataproc
  • Serverless Data Processing with Cloud Dataflow
  • Manage Data Pipelines with Cloud Data Fusion and Cloud Composer
  • Introduction to Processing Streaming Data
  • Serverless Messaging with Cloud Pub/Sub
  • Dataflow Streaming Features
  • High-Throughput BigQuery and Bigtable Streaming Features
  • Advanced BigQuery Functionality and Performance
  • Introduction to Analytics and AI
  • Prebuilt ML model APIs for Unstructured Data
  • Big Data Analytics Notebooks
  • Production ML Pipelines
  • Custom Model building with SQL in BigQuery ML
  • Custom Model building with Cloud AutoML

COURSE INFORMATION

Code GDE2
Duration 4 Days
Full Price RM 7,200
Early Bird Price RM 6,400
DOWNLOAD TRAINING BROCHURE

    Sign up and enjoy
    early bird discount!