Ryan Blue,  at Netflix

Ryan Blue


Ryan Blue works on open source projects, including Spark, Avro, and Parquet, at Netflix.


Improving Apache Spark with S3

Netflix’s Big Data Platform team manages data warehouse in Amazon S3 with over 60 petabytes of data and writes hundreds of terabytes of data every day. At this scale, output committers that create extra copies… Read more