Photo of

Ankur Dave

PhD Student, UC Berkeley

Ankur is a second-year PhD student advised by Ion Stoica in the UC Berkeley AMPLab. He is a Spark committer and a maintainer for GraphX.


IndexedRDD: Efficient Fine-Grained Updates for RDDs

Spark’s core abstraction is the RDD, an immutable distributed dataset. Spark requires immutability to enable dataset reuse, fault tolerance, and straggler mitigation. But new Spark applications like streaming aggregation and incremental graph processing seem to…