menu
arrow_back

ETL Processing on GCP Using Dataflow and BigQuery

ETL Processing on GCP Using Dataflow and BigQuery

시간 9 크레딧

GSP290

Google Cloud Self-Paced Labs

Overview

In this lab you will build several Data Pipelines that will ingest data from a publicly available dataset into BigQuery, using these GCP services:

  • GCS - Google Cloud Storage
  • Dataflow - Google Dataflow
  • BigQuery - BigQuery tables

You will create your own Data Pipeline, including the design considerations, as well as implementation details, to ensure that your prototype meets the requirements. Be sure to open the python files and read the comments when instructed to.

Join Qwiklabs to read the rest of this lab...and more!

  • Get temporary access to the Google Cloud Console.
  • Over 200 labs from beginner to advanced levels.
  • Bite-sized so you can learn at your own pace.
Join to Start This Lab
점수

—/100

Create a Cloud Storage Bucket

단계 진행

/ 20

Copy Files to Your Bucket

단계 진행

/ 10

Create the BigQuery Dataset (name: lake)

단계 진행

/ 20

Build a Data Ingestion Dataflow Pipeline

단계 진행

/ 10

Build a Data Transformation Dataflow Pipeline

단계 진행

/ 10

Build a Data Enrichment Dataflow Pipeline

단계 진행

/ 10

Build a Data lake to Mart Dataflow Pipeline

단계 진행

/ 10

Build a Data lake to Mart CoGroupByKey Dataflow Pipeline

단계 진행

/ 10