menu
arrow_back

ETL Processing on GCP Using Dataflow and BigQuery

ETL Processing on GCP Using Dataflow and BigQuery

小时 9 积分

GSP290

Google Cloud Self-Paced Labs

Overview

In this lab you will build several Data Pipelines that will ingest data from a publicly available dataset into BigQuery, using these GCP services:

  • GCS - Google Cloud Storage
  • Dataflow - Google Dataflow
  • BigQuery - BigQuery tables

You will create your own Data Pipeline, including the design considerations, as well as implementation details, to ensure that your prototype meets the requirements. Be sure to open the python files and read the comments when instructed to.

Join Qwiklabs to read the rest of this lab...and more!

  • Get temporary access to the Google Cloud Console.
  • Over 200 labs from beginner to advanced levels.
  • Bite-sized so you can learn at your own pace.
Join to Start This Lab
得分

—/100

Create a Cloud Storage Bucket

Run Step

/ 20

Copy Files to Your Bucket

Run Step

/ 10

Create the BigQuery Dataset (name: lake)

Run Step

/ 20

Build a Data Ingestion Dataflow Pipeline

Run Step

/ 10

Build a Data Transformation Dataflow Pipeline

Run Step

/ 10

Build a Data Enrichment Dataflow Pipeline

Run Step

/ 10

Build a Data lake to Mart Dataflow Pipeline

Run Step

/ 10

Build a Data lake to Mart CoGroupByKey Dataflow Pipeline

Run Step

/ 10