Evan Chan, Data Architect at TupleJump

Evan Chan

Data Architect, TupleJump

Evan loves to design, build, and improve bleeding edge distributed data and backend systems using the latest in open source technologies. He has led the design and implementation of multiple big data platforms based on Storm, Spark, Kafka, Cassandra, and Scala/Akka, including a columnar real-time distributed query engine. He is an active contributor to the Apache Spark project, a Datastax Cassandra MVP, and co-creator and maintainer of the open-source Spark Job Server. He is a big believer in GitHub, open source, and meetups, and have given talks at various conferences including Spark Summit, Cassandra Summit, FOSS4G, and Scala Days.


700 Queries Per Second with Updates: Spark As A Real-Time Web Service

Apache Spark has taken over machine learning and exploratory analytics, but is not often thought of as a platform capable of delivering sub-second / web-speed concurrent queries. Spark DataFrames has in-memory caching, but it cannot… Read more