2016 Keynote Speakers

Matei Zaharia

CTO & Co-Founder, Databricks

Jeff Dean

Senior Fellow, Google

Doug Cutting

Chief Architect, Cloudera / Co-founder, Apache Hadoop, Cloudera

Andrew Ng

Chief Scientist, Baidu

Ali Ghodsi

CEO & Co-Founder, Databricks

Marvin Theimer

Distinguished Engineer, Amazon

Spark Summit 2016 by the Numbers

0 Days
0 Hours
0 Minutes
Countdown to Summit
3 Training Courses Check them out
96 Sessions See the full schedule
6 Tracks Data Science, Developer, Enterprise, Research, Spark Ecosystem, Use Cases

Enhance Your Apache Spark Skills

Join more than 2,500 engineers, analysts, scientists, and business professionals for three days of in-depth learning and networking.

With over 90 sessions and five tracks to choose from, there’s content for every level and role.


  • Structured Spark, Spark Streaming, and related projects.
  • The future of Apache Spark.
  • How to use the Spark stack in a variety of applications.
  • Best practices for deploying Spark at Scale.

Apache® Spark™ is a powerful open source processing engine built around speed, ease of use, and sophisticated analytics. It was started at UC Berkeley in 2009 and is now developed at the vendor-independent Apache Software Foundation. Since its release, Spark has seen rapid adoption by enterprises across a wide range of industries. Internet powerhouses such as Yahoo, eBay and Netflix have deployed Spark at massive scale, processing multiple petabytes of data on clusters of over 8,000 nodes. Apache Spark has also become the largest open source community in big data, with over 1000 contributors from 250+ organizations.

Day 1: Spark Training

A 3-day registration pass includes one full day of Apache Spark training from Databricks.

Classes offered at Spark Summit 2016

Apache Spark Essentials (Sold Out)
for beginners

Data Science with Apache Spark (Sold Out)
for data scientists

Exploring Wikipedia with Apache Spark (Sold Out)
for advanced Spark developers


Visit the Training Page

Days 2 and 3: Main Conference

Spark Summit has something for everyone, from developers and data scientists to researchers and business executives.

Find out what’s in store for:

Developer Day
June 7

Who it’s for:

  • Apache Spark Developers
  • Data Scientists
  • Infrastructure or Site Reliability Engineers
  • Researchers

Why you should attend:

  • Learn what’s ahead for the open source Spark project.
  • To hear from Spark committers on performance and memory optimization.
  • To see how others deploy Spark at scale.
  • To learn how data scientists deploy in R and discover machine learning at scale.
  • To hear researchers share their knowledge about spatial analyses and GPU support in Spark.

Enterprise Day
June 8

Who it’s for:

  • Data Practitioners
  • Key Decision Makers
  • Business Executives

Why you should attend:

  • To learn how Apache Spark is deployed in enterprises and best practices.
  • To find out how Spark is employed in a variety of applications.
  • To hear how leading enterprise Spark users solve business problems.


Hilton San Francisco Union Square

Hilton San Francisco Union Square

Conveniently located in the heart of San Francisco, the Hilton Union Square makes it easy to see what the city has to offer after the sessions close.

333 O’Farrell St
San Francisco, CA 94102 USA

+1 (415) 771-1400
+1 (855) 827-1009 (reservations only)

Learn more

Looking for a new gig? Check out the Job Board

Spark Summit 2016 Sponsors

If you have media questions, or would like to find out about sponsoring a Spark Summit, please contact press@spark-summit.org.