Call for Presentations

The CFP is now closed. 

Spark Summits are the world’s largest big data events focused entirely on Apache Spark—assembling the very best engineers, scientists, analysts, and executives from around the globe to share their knowledge and receive expert training on this open-source powerhouse. Since our pioneering summit in 2013, thousands have come to learn how Spark, big data, machine learning, data engineering, and data science are delivering new insights to businesses and institutions worldwide.

Do you have a big idea to share, tips and tricks for community members embarking on the same journey, or a new developer tool or application to showcase? If so, we’d love to put your ideas, case studies, best practices, and technical knowledge in front of the largest gathering of big data professionals interested in all things Spark.

Suggested topics

These are just guidelines and suggestions—we are open to your creativity. The CFP was open from May 15 – June 16, 2017.


In this track, presenters cover technical content on internals and the latest development in Spark core and Spark SQL.


This track is dedicated to academic and advanced industrial research. Talks span systems research involving Spark to research use cases (e.g. genomics, GPUs, etc.).

Data Science

This track spotlights the practice of data science using Spark. Sessions cover innovative techniques, algorithms, and systems that refine raw data into actionable insight using visualization, statistics, and machine learning.


This track features use cases on how businesses deploy Spark and the lessons learned. Talks offer an exploration into business use cases across industries, ROI, best practices, relevant business metrics, compliance requirements for specific industries, and customer testimonials.

Data Engineering

In parallel to data science, this track focuses more on production data pipelines, ETLs, and operations.

Spark Ecosystem

This track will feature open source and proprietary applications, libraries, or frameworks in the Spark ecosystem.

AI and Machine Learning

AI and deep learning are all the buzz these days. This track covers algorithms, techniques, models, and platforms for machine learning. You will be able to select a focus area of AI or Machine Learning.

Streaming and Continuous Applications

Streaming use cases and continuous applications that reacts to data in real-time. Lambda architecture, Kappa architecture, Structured Streaming, Kafka, Kinesis, etc.

Deep Dives

This track, as the name suggests, is a 60-min slot that allows a presenter to go deeper into the topic than the normal 30 min tracks do. The session should be highly technical with some demonstration. For example “Deep Dive into Catalyst Optimizer,” followed by “Deep Dive into Catalyst Optimizer Hands-on Lab.” There will be a limited number of talks in this track.

Required information

You’ll need to include the following information for your proposal:

  • Proposed title
  • Presentation overview and extended description
  • Suggested topics
  • Speaker(s): Biography and headshot
  • A video of the speaker

Tips for submitting a successful proposal

Help us understand why your presentation is the right one for Spark Summit. Please keep in mind that this event is by and for professionals. All presentations and supporting materials must be respectful and inclusive.

  • Be authentic. Your peers need original ideas in real-world scenarios, relevant examples, and knowledge transfer.
  • Give your proposal a simple and straightforward title.
  • Include as much detail about the presentation as possible.
  • Keep proposals free of marketing and sales.
  • If you are not the speaker, provide the contact information of the person you’re suggesting. We tend to ignore proposals submitted by PR agencies and require that we can reach the suggested participant directly. Improve the proposal’s chances of being accepted by working closely with the presenter(s) to write a jargon-free proposal that contains a clear value for attendees.
  • Keep the audience in mind: they are professional and already pretty smart.
  • Limit the scope: in 30 minutes, you won’t be able to cover ‘everything about framework X’. Instead, pick a useful aspect, a particular technique, or walk through a simple program.
  • Explain why people will want to attend and what they’ll take away from it.
  • Don’t assume that your company’s name buys you credibility. If you’re talking about something important that you have specific knowledge of because of what your company does, spell that out in the description.
  • Does your presentation have the participation of a woman, person of color, or member of another group often underrepresented at a tech conference? Diversity is one of the factors we seriously consider when reviewing proposals as we seek to broaden our speaker roster.