Create a Cloud Dataproc cluster (region: us-central1)
Submit a Spark job to your cluster (region: us-central1)
Introduction to Cloud Dataproc: Hadoop and Spark on Google Cloud
Cloud Dataproc is a managed Spark and Hadoop service that lets you take advantage of open source data tools for batch processing, querying, streaming, and machine learning. Cloud Dataproc automation helps you create clusters quickly, manage them easily, and save money by turning clusters off when you don't need them. With less time and money spent on administration, you can focus on your jobs and your data.
This lab is adapted from https://cloud.google.com/dataproc/quickstart-console.
What you'll learn
How to create a managed Cloud Dataproc cluster (with Apache Spark pre-installed).
How to submit a Spark job
How to shut down your cluster
What you'll need
加入 Qwiklabs 即可阅读本实验的剩余内容…以及更多精彩内容！
- 获取对“Google Cloud Console”的临时访问权限。
- 200 多项实验，从入门级实验到高级实验，应有尽有。