Spark PySpark and SparkSQL This repository has basics to advanced use to Spark. It is a 6-node cluster maintained on Google Cloud with Ubuntu 16.04 LTA file system