Machine Learning with Spark on Google Cloud Dataproc

No context given on any commands

kept getting errors: >>> schema = StructType([get_structfield(colname) for colname in header.split(',')]) Traceback (most recent call last): File "<stdin>", line 1, in <module> File "<stdin>", line 1, in <listcomp> NameError: name 'get_structfield' is not defined Tried to copy/paste differently and entering manually

This is a useless lab. Covers too much material for any reasonable focus; copy-paste does not work, so we're just copy-typing long function definitions? Why not put it in a notebook?

Great lab. The spark code needed to be entered one line/command at a time. If I copied a code block of many lines/commands this error occurred: "SyntaxError: multiple statements found while compiling a single statement". When I entered one line/command at a time it worked fine, but that is not what the directions say to do. The direction encourage copy and pasting code blocks resulting the error above.

No... Copying and pasting instructions not clean. Can not recommends

