Speaker

Adam Silberstein, Director of Development at Trifacta

Adam Silberstein

Director of Development, Trifacta

Adam Silberstein is a director of development at Trifacta. His main area of interest is large-scale data processing, including in the batch processing and online serving spaces. His work has appeared in top database venues such as SIGMOD, VLDB, and ICDE. Prior to joining Trifacta, Adam was a Staff Software Engineer at LinkedIn in and a Research Scientist at Yahoo! Research. He completed his PhD at Duke University in 2007.

Sessions

Scalable And Incremental Data Profiling With Spark

Data wrangling tools let analysts build workflows to transform large and unstructured datasets into cleaned, well structured columnar data. A key strategy for validating the cleaned data is profiling, which provides value distributions, anomaly counts… Read more