Speaker

Erik Erlandson, Senior Software Engineer at Red Hat

Erik Erlandson

Senior Software Engineer, Red Hat

Erik Erlandson is a Senior Software Engineer at Red Hat, where he investigates analytics use cases and scalable deployments for Apache Spark on clustering and cloud-enabled environments. Erik also consults on internal data science and analytics projects. He is a contributor to Apache Spark and other open source projects in the Spark ecosystem, including Algebird, Scala and Silex.

Sessions

Teaching Apache Spark Clusters to Manage Their Workers Elastically

Devops engineers have applied a great deal of creativity and energy to invent tools that automate infrastructure management, in the service of deploying capable and functional applications. For data-driven applications running on Apache Spark, the… Read more

Sketching Data with T-Digest In Apache Spark

Algorithms for sketching probability distributions from large data sets are a fundamental building block of modern data science. Sketching plays a role in diverse applications ranging from visualization, optimizing data encodings, estimating quantiles, data synthesis… Read more