Photo of

Evan Chan

Data Architect, TupleJump

Evan loves to design, build, and improve bleeding edge distributed data and backend systems using the latest in open source technologies. He has led the design and implementation of multiple big data platforms based on Storm, Spark, Kafka, Cassandra, and Scala/Akka, including a columnar real-time distributed query engine. He is an active contributor to the Apache Spark project, a Datastax Cassandra MVP, and co-creator and maintainer of the open-source Spark Job Server. He is a big believer in GitHub, open source, and meetups, and have given talks at various conferences including Spark Summit, Cassandra Summit, FOSS4G, and Scala Days.


Productionizing Spark and the Spark REST Job Server

This is a two-part talk. The first part covers general deployment, configuration, and application running tips for Apache Spark, from my personal experience setting up and running Spark clusters since the early days of version…