San Francisco
June 30 - July 2, 2014

Spark Summit 2014 brought the Apache Spark community together on June 30- July 2, 2014 at the The Westin St. Francis in San Francisco. It featured production users of Spark, Shark, Spark Streaming and related projects.


Spark Summit 2014
The Emergence of the Enterprise Data Hub
Mike Olson (Chief Strategy Officer, Cloudera)

At inception, Apache Hadoop was a powerful and scalable new platform for storing and processing data at enormous scale. It was, though, difficult to use and poorly suited for use cases that required real time, advanced analytics capabilities, security, data governance and more. In the ensuing years, the platform has evolved tremendously, with new open source projects enhancing and complementing the original Hadoop core. Apache Spark is a tremendous example of that evolution, and represents the next generation of the MapReduce paradigm in the big data ecosystem. Taken together, these capabilities create a new product category — the enterprise data hub.

Mike (@mikeolson) co-founded Cloudera in 2008 and served as its CEO until 2013 when he took on his current role of chief strategy officer (CSO.) As CSO, Mike is responsible for Cloudera’s product strategy, open source leadership, engineering alignment and direct engagement with customers. Prior to Cloudera Mike was CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine. Mike spent two years at Oracle Corporation as vice president for Embedded Technologies after Oracle’s acquisition of Sleepycat in 2006. Prior to joining Sleepycat, Mike held technical and business positions at database vendors Britton Lee, Illustra Information Technologies and Informix Software. Mike has a Bachelor’s and a Master’s Degree in Computer Science from the University of California, Berkeley.

Slides PDF |Video