This article will walk you through how to build Apache Spark for usage on your local machine. After that you'll be able to create Spark Clusters or try out Spark on your local computer.
In this easy to follow tutorial, learn the basics of Spark DataFrames, how they're composed of RDDs and what they allow you to do in Scala. They're a similar abstraction to pandas DataFrames or R's DataFrames.
In this short article, we'll go over what Broadcast variables are, some of their uses, and how you should try and leverage them in your projects. We'll be covering topics like the broadcast join to keep your cluster from having to do too much work!