Speaker

Seth Hendrickson, Data Scientist at Cloudera

Seth Hendrickson

Data Scientist, Cloudera

Seth Hendrickson is a top Apache Spark contributor. He implemented multinomial logistic regression with elastic-net regularization in Spark’s ML library and has contributed several other performance improvements to linear models in Spark. He has also made extensive contributions to Spark ML decision trees and ensemble algorithms. Prior to joining IBM, Seth was an electrical engineer working on signal processing and IOT. He earned his M.S. in electrical engineering from Georgia Institute of Technology.

Sessions

Extending Spark Machine Learning: Adding Your Own Algorithms and Tools

Apache Spark’s machine learning (ML) pipelines provide a lot of power, but sometimes the tools you need for your specific problem aren’t available yet. This talk introduces Spark’s ML pipelines, and then looks at how… Read more