San Francisco
June 30 - July 2, 2014

Spark Summit 2014 brought the Apache Spark community together on June 30- July 2, 2014 at the The Westin St. Francis in San Francisco. It featured production users of Spark, Shark, Spark Streaming and related projects.

Spark Summit 2014 Training Archive

Databricks Spark training was offered as part of the a 3-day pass to the Spark Summit, and contained an introductory and advanced track. Both tracks began at 9am on July 2, 2014 and finished by 5pm. Lunch included.

You can download the course materials HERE

Course Prerequisites:

  • Laptop with WiFi capabilities
  • Java 6 or 7

TRACK A: Introduction to Apache Spark Workshop

The Introduction to Apache Spark workshop is for users to learn the core Spark APIs. This session features hands-on technical exercises to get developers up to speed in using Spark for data exploration, analysis, and building big data applications.

The integrated lecture and lab format covers the following topics:

  • Overview of Big Data and Spark
  • Installing Spark Locally
  • Using Spark’s Core APIs in Scala, Java, & Python
  • Building Spark Applications
  • Deploying on a Big Data Cluster
  • Building Applications for Multiple Platforms

TRACK B:Advanced Apache Spark Workshop

The Advanced Apache Spark Workshop will cover advanced topics on architecture, tuning, and each of Spark’s high-level libraries (including the latest features). Attendees will have the opportunity after the lunch break to work through labs on each of the libraries.

Some familiarity with Spark or MapReduce is expected, as this workshop will not cover basic Spark programming.

Topics covered include: