Yelp Ad Targeting at Scale with Apache Spark

Slides PDF Video

From training billions of ad impressions to scaling gradient boosted trees with more than three million nodes, Ad Targeting at Yelp uses Apache Spark in many stages of its large-scale machine learning pipeline.

This session will explore examples of how Yelp employed and tweaked Spark to support big data feature engineering, visualizations and machine learning model training, evaluation and diagnostics. You’ll also hear about the challenges in building and deploying such a large-scale intelligent system in a production environment.

Session hashtag: #SFml7

Inaz Alaei-Novin, Data Mining Engineer at Yelp

About Inaz

Inaz is originally from Iran. She did her M.A.Sc at The University of Toronto in Canada, and worked there for a year at Modiface. Currently she works as a data mining engineer at Yelp.

About Joe

Joe has over 16 years experience in technology, building systems from compilers, streaming media engines, distributed systems, and search engines for, natural language understanding in finance, and now works at Yelp building machine learning for Ad Targeting.