menu
arrow_back

Introduction to Cloud Dataproc: Hadoop and Spark on Google Cloud

—/10

Checkpoints

arrow_forward

Create a Cloud Dataproc cluster (region: us-central1)

Submit a Spark job to your cluster (region: us-central1)

Introduction to Cloud Dataproc: Hadoop and Spark on Google Cloud

30 godz. Punkty: 5

GSP123

Google Cloud Self-Paced Labs

Overview

Cloud Dataproc is a managed Spark and Hadoop service that lets you take advantage of open source data tools for batch processing, querying, streaming, and machine learning. Cloud Dataproc automation helps you create clusters quickly, manage them easily, and save money by turning clusters off when you don't need them. With less time and money spent on administration, you can focus on your jobs and your data.

This lab is adapted from https://cloud.google.com/dataproc/quickstart-console.

What you'll learn

  • How to create a managed Cloud Dataproc cluster (with Apache Spark pre-installed).

  • How to submit a Spark job

  • How to shut down your cluster

What you'll need

Dołącz do Qwiklabs, aby zapoznać się z resztą tego modułu i innymi materiałami.

  • Uzyskaj tymczasowy dostęp do Google Cloud Console.
  • Ponad 200 modułów z poziomów od początkującego do zaawansowanego.
  • Podzielono na części, więc można uczyć się we własnym tempie.
Dołącz, aby rozpocząć ten moduł