Analyzing Natality Data Using Datalab and BigQuery

Analyzing Natality Data Using Datalab and BigQuery

30 minutes 7 Credits


Google Cloud Self-Paced Labs


In this lab you analyze a large (137 million rows) natality dataset using Google BigQuery and Cloud Datalab.

If you are not yet familiar with Datalab, here is a graphical cheat sheet for the main Datalab functionality:


What you learn

In this lab, you:

  • Launch Cloud Datalab
  • Invoke a BigQuery query
  • Create charts in Datalab
  • Export data for machine learning

This lab illustrates how you can carry out data exploration of large datasets, but continue to use familiar tools like Pandas and Jupyter. The trick is to do the first part of your aggregation in BigQuery, get back a Pandas DataFrame, then work with the smaller Pandas DataFrame locally. Datalab provides a managed Jupyter experience, so you don't need to run notebook servers yourself.

Join Qwiklabs to read the rest of this lab...and more!

  • Get temporary access to the Google Cloud Console.
  • Over 200 labs from beginner to advanced levels.
  • Bite-sized so you can learn at your own pace.
Join to Start This Lab


Launch Cloud Datalab

Run Step

/ 50

Create a notebook

Run Step

/ 50