Training: Data Science with Apache Spark


The Data Science with Apache Spark workshop will show how to use Apache Spark to perform exploratory data analysis (EDA), develop machine learning pipelines, and use the APIs and algorithms available in Spark ML and Spark MLlib. It is designed for software developers, data analysts, data engineers, and data scientists.

Photo of Jon Bates

About Jon

Jon is passionate about data science, computer science, and management science. A pragmatist at heart, he enjoys using tools from these fields to build a competitive edge in business. He spent nine years as a proprietary bond trader, where he built portfolio infrastructure and data analysis tools to maximize his and his team’s trading returns. He has a BS in Management Science from MIT and an MS in Predictive Analytics from Northwestern University. He was an assistant instructor for the Scalable Machine Learning MOOC recently offered via edX. Jon lives in Boulder, CO where he runs a consulting business focused on providing data science solutions and training.