Building Batch Data Pipelines on GCP
Data pipelines typically fall under one of the Extra-Load, Extract-Load-Transform or Extract-Transform-Load paradigms. This course describes which paradigm should be used and when for batch data.
Data pipelines typically fall under one of the Extra-Load, Extract-Load-Transform or Extract-Transform-Load paradigms. This course describes which paradigm should be used and when for batch data. Furthermore, this course covers several technologies on Google Cloud Platform for data transformation including BigQuery, executing Spark on Cloud Dataproc, pipeline graphs in Cloud Data Fusion and serverless data processing with Cloud Dataflow. Learners will get hands-on experience building data pipeline components on Google Cloud Platform using QwikLabs.
Author Name: Google Cloud
Author Description:
Google Cloud can help solve your toughest problems and grow your business. With Google Cloud, their infrastructure is your infrastructure. Their tools are your tools. And their innovations are your innovations.
Table of Contents
- Introduction
1min - Introduction to Batch Data Pipelines
17mins - Executing Spark on Cloud Dataproc
53mins - Manage Data Pipelines with Cloud Data Fusion and Cloud Composer
45mins - Serverless Data Processing with Cloud Dataflow
41mins - Summary
4mins
There are no reviews yet.