Jump to:   Spark Training   Developer Day   Enterprise Day   Live Stream

 

Day 1 • Tuesday, February 7 • Spark Training

 

Day 2 • Wednesday, February 8 • Developer Day

7:00 AM

Registration

9:00 AM

What to Expect for Big Data and Apache Spark in 2017

Big data remains a rapidly evolving field with new applications and infrastructure appearing every year. In this talk, I’ll cover new trends in 2016 / 2017 and how Apache Spark is moving to meet them.… Read more
9:20 AM

Using Apache Spark for Intelligent Services

Salesforce is developing Einstein which is an artificial intelligence (AI) capability built into the core of the Salesforce Platform. Einstein helps power the world’s smartest CRM to deliver advanced AI capabilities to sales, services, and… Read more
9:40 AM

Production-Ready Structured Streaming

In Spark 2.0, we introduced Structured Streaming, which allows users to continually and incrementally update your view of the world as new data arrives, while still using the same familiar Spark SQL abstractions. I talk… Read more
9:55 AM

Scaling Genetic Data Analysis with Apache Spark

In 2001, it cost ~$100M to sequence a single human genome. In 2014, due to dramatic improvements in sequencing technology far outpacing Moore’s law, we entered the era of the $1,000 genome. At the same… Read more
10:15 AM

RISELab: Enabling Intelligent Real-Time Decisions

A long-standing grand challenge in computing is to enable machines to act autonomously and intelligently: to rapidly and repeatedly take appropriate actions based on information in the world around them. To address this challenge, at… Read more
10:30 AM

Break

11:00 AM
11:40 AM
12:20 PM
Spark Ecosystem

Lessons Learned from Dockerizing Spark Workloads

Developer

Cost-Based Optimizer Framework for Spark SQL

Spark Experience and Use Cases

Spark as the Gateway Drug to Typed Functional Programming

Sponsored Sessions

Women In Big Data Lunch

12:50 PM

Lunch

2:00 PM
Developer

Optimizing Apache Spark SQL Joins

Spark Experience and Use Cases

Exploring Spark for Scalable Metagenomics Analysis

Data Science

Tuning and Monitoring Deep Learning on Apache Spark

Sponsored Sessions

Women In Big Data Lunch

2:40 PM
3:20 PM
3:50 PM

Break

4:20 PM
5:00 PM
Spark Ecosystem

Apache Toree: A Jupyter Kernel for Spark

Spark Experience and Use Cases

Migrating from Redshift to Spark at Stitch Fix

Sponsored Sessions

Building the Ideal Stack for Real-Time Analytics

Research

Analysis Andromeda Galaxy Data Using Spark

5:40 PM
6:10 PM

Attendee Reception

8:00 PM

End of Day

 

Day 3 • Thursday, February 9 • Enterprise Day

8:00 AM

Registraion

9:00 AM

Virtualizing Analytics with Apache Spark

In the race to invent multi-million dollar business opportunities with exclusive insights, data scientists and engineers are hampered by a multitude of challenges just to make one use case a reality – the need to… Read more
9:20 AM

Big Data Meets Learning Science

How do we learn and how can we learn better? Educational technology is undergoing a revolution fueled by learning science and data science. The promise is to make a high-quality personalized education accessible and affordable… Read more
9:30 AM

Accelerating Machine Learning and Deep Learning At Scale...With Apache Spark

Deep learning is a fast growing subset of machine learning. There is an emerging trend to conduct deep learning in the same cluster along with existing data processing pipelines to support feature engineering and traditional… Read more
9:40 AM

Artificial Intelligence: How Enterprises Can Crush It With Apache Spark

Artificial intelligence (AI) is not new. It emerged as a computer science discipline in the 50’s and has been a persistent theme in science fiction. What is new is that enterprises now have the prerequisites… Read more
10:00 AM

Data Science Transformation Via Apache Spark on Hybrid Cloud

Most enterprises have their business running on legacy environments on premise. Just picking up and moving everything to the cloud isn’t an option for the vast majority. Cloud migration requires a critical mass of data,… Read more
10:10 AM

Apache Spark in Cloud and Hybrid: Why Security and Governance Become More Important

An Increasing number of Apache Spark deployments are in Cloud and hybrid environments. This often means that Spark workloads are ephemeral but the data exists in a durable storage either in cloud and on-prem. The… Read more
10:20 AM

Break

11:00 AM
Spark Ecosystem

Auto Scaling Systems With Elastic Spark Streaming

Spark Experience and Use Cases

Learnings Using Spark Streaming and DataFrames for Walmart Search

Data Science

Scalable Data Science with SparkR

Research

Sparkler—Crawler on Apache Spark

11:40 AM
12:20 PM
Spark Ecosystem

Kerberizing Spark

Spark Experience and Use Cases

Sparking Up Data Engineering

Research

Large-Scale Text Processing Pipeline with Spark ML and GraphFrames

12:50 PM

Lunch

2:00 PM
2:40 PM
Developer

Spark and Online Analytics

Spark Experience and Use Cases

Fault Tolerance in Spark: Lessons Learned from Production

Data Science

Scaling Apache Spark MLlib to Billions of Parameters

Research

Algorithms and Tools for Genomic Analysis on Spark

3:20 PM
3:50 PM

Break

4:20 PM
5:00 PM
5:40 PM
Developer

SparkSQL: A Compiler from Queries to RDDs

Spark Experience and Use Cases

Keeping Spark on Track: Productionizing Spark for ETL

Data Science

Parallelizing Existing R Packages with SparkR

Enterprise

 

Can’t make it to Spark Summit?

Register to watch Spark Summit East 2017 for FREE via live web streaming.

The Spark Summit live stream will be active from 9:00 PM to 6:00 PM Eastern Time on Wednesday, February 8 through Thursday, February 9, 2017.

Register now