Speaker

Amelia Arbisser, Sofware Engineer at Trifacta

Amelia Arbisser

Sofware Engineer, Trifacta

Amelia Arbisser is a software engineer at Trifacta. She works on a system for profiling data in Spark, and also contributes to the job execution stack. Prior to joining Trifacta, Amelia was an engineer at Twitter where she worked on relevance infrastructure for search and trends. She completed her Masters’ in CS at MIT in 2012.

Sessions

Scalable And Incremental Data Profiling With Spark

Data wrangling tools let analysts build workflows to transform large and unstructured datasets into cleaned, well structured columnar data. A key strategy for validating the cleaned data is profiling, which provides value distributions, anomaly counts… Read more