menu
arrow_back

Creating a Data Transformation Pipeline with Cloud Dataprep

Creating a Data Transformation Pipeline with Cloud Dataprep

1 个小时 15 分钟 7 个积分

GSP430

Google Cloud Self-Paced Labs

Overview

Cloud Dataprep by Trifacta is an intelligent data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis. In this lab you explore the Cloud Dataprep UI to build a data transformation pipeline that runs at a scheduled interval and outputs results into BigQuery.

The dataset you'll use is an ecommerce dataset that has millions of Google Analytics session records for the Google Merchandise Store loaded into BigQuery. You have a copy of that dataset for this lab and will explore the available fields and row for insights.

Objectives

In this lab, you learn how to perform these tasks:

  • Connect BigQuery datasets to Cloud Dataprep.
  • Explore dataset quality with Cloud Dataprep.
  • Create a data transformation pipeline with Cloud Dataprep.
  • Schedule transformation jobs outputs to BigQuery.

What you'll need

加入 Qwiklabs 即可阅读本实验的剩余内容…以及更多精彩内容!

  • 获取对“Google Cloud Console”的临时访问权限。
  • 200 多项实验,从入门级实验到高级实验,应有尽有。
  • 内容短小精悍,便于您按照自己的节奏进行学习。
加入以开始此实验
分数

—/100

Run Cloud Dataprep jobs to BigQuery

运行步骤

/ 100