Exploratory Data Analysis Using AI Platform
In this lab you will learn the process of analyzing a data set stored in Google BigQuery using AI Platform to perform queries and present the data using various statistical plotting techniques. The analysis will help you discover patterns in the data that will allow you to predict probable arrival time delays given the initial flight details and actual departure time.
AI Platform is a powerful interactive tool created to explore, analyze, transform and visualize data and build machine learning models on Google Cloud Platform. It runs on Google Compute Engine and connects to multiple cloud services easily so you can focus on your data science tasks.
Google BigQuery is a RESTful web service that enables interactive analysis of massively large datasets working in conjunction with Google Storage.
The data set that is used provides historic information about internal flights in the United States retrieved from the US Bureau of Transport Statistics website. This data set can be used to demonstrate a wide range of data science concepts and techniques and will be used in all of the other labs in the Data Science on Google Cloud Platform Quest.
加入 Qwiklabs 即可阅读本实验的剩余内容…以及更多精彩内容！
- 获取对“Google Cloud Console”的临时访问权限。
- 200 多项实验，从入门级实验到高级实验，应有尽有。
Create the AI Platform notebook instance
Run Queries in Jupyter notebook
Run Queries in Bigquery
Execute query in AI Platform to visualize thresholded data
Execute query in Jupyter notebook to derive the probability distribution function from the data itself