Graphs are a simple way of representing relationships in data and Apache Spark provides a simple way of creating and manipulating them. This tutorial will walk you through the basics of GraphX in Apache Spark using Scala. You'll analyze flight data from 2008 and run algorithms like PageRank to better understand all the flights that took place!
In this easy to follow tutorial, learn the basics of Spark DataFrames, how they're composed of RDDs and what they allow you to do in Scala. They're a similar abstraction to pandas DataFrames or R's DataFrames.
This guide will show you how to read in csv files in Apache Spark. We'll walk through how to use this package in both Python and Scala.