Nick Pentreath, Principal Engineer at IBM

Nick Pentreath

Principal Engineer, IBM

Nick is a Principal Engineer at IBM. He’s a member of the Apache Spark PMC and author of Machine Learning with Spark. Previously, he co-founded Graphflow, a startup focused on recommendations and customer intelligence. He has worked at Goldman Sachs, Cognitive Match, and led the Data Science team at Mxit, Africa’s largest social network. He’s passionate about combining commercial focus with machine learning and cutting-edge technology to build intelligent systems that learn from data to add business value.


Extending Apache Spark ML: Adding Your Own Algorithms and Tools

Apache Spark’s machine learning (ML) pipelines provide a lot of power, but sometimes the tools you need for your specific problem aren’t available yet. This talk introduces Spark’s ML pipelines, and then looks at how… Read more

Feature Hashing for Scalable Machine Learning

Feature hashing is a powerful technique for handling high-dimensional features in machine learning. It is fast, simple, memory-efficient, and well suited to online learning scenarios. While an approximation, it has surprisingly low accuracy tradeoffs in… Read more