Eric Liang is a software engineer at Databricks, where he works on Spark’s backend execution as well as storage services. He has previously worked on storage performance at Google, and received his Bachelor’s in EECS from UC Berkeley.
The majority of reported Spark deployments are now in the Cloud. In such an environment, it is preferable for Spark to access data directly from services such as Amazon S3, thereby decoupling storage and compute.… Read more
Apache, Apache Spark, Spark, and the Spark logo are trademarks of the Apache Software Foundation. The Apache Software Foundation has no affiliation with and does not endorse the materials provided at this event.