Evaluating a Data Model
In this lab you will learn the process for partitioning a data set into two separate parts, a training set that will be used to develop a model, and a test set that can then be used to evaluate the accuracy of the model and then independently evaluate predictive models in a repeatable manner. Then you'll re-create the model developed in a previous lab in this quest using the training data set and evaluate it against the test data set. The data is stored in Google BigQuery and the analysis will be performed using Jupyterlab.
The data set that is used provides historic information about internal flights in the United States retrieved from the US Bureau of Transport Statistics website. This data set can be used to demonstrate a wide range of data science concepts and techniques and will be used in all of the other labs in the Data Science on the Google Cloud Platform quest.
Google BigQuery is a RESTful web service that enables interactive analysis of massively large datasets working in conjunction with Google Storage.
加入 Qwiklabs 即可阅读本实验的剩余内容…以及更多精彩内容！
- 获取对“Google Cloud Console”的临时访问权限。
- 200 多项实验，从入门级实验到高级实验，应有尽有。