This Git repository features use cases of good and bad practices when using Spark-based tools to process and analyze data.