Speaker

Ryan Blue,  at Netflix

Ryan Blue

Netflix

Ryan Blue works on open source projects, including Spark, Avro, and Parquet, at Netflix.

Sessions

Improving Apache Spark with S3

Netflix’s Big Data Platform team manages data warehouse in Amazon S3 with over 60 petabytes of data and writes hundreds of terabytes of data every day. At this scale, output committers that create extra copies… Read more