Spark Summit 2013 brought the Apache Spark community together on December 2-3, 2013 at the Hotel Nikko in San Francisco. It featured production users of Spark, Shark, Spark Streaming and related projects.
In this talk we will discuss how Adatao has successfully built a full-featured, powerful enterprise analytics solution with Spark. Features include web-based reporting/visualization/publishing (“basic analytics”) as well as real-time, interactive data mining and machine learning (“advanced analytics”) on large data sets. What used to take hours are now routinely accomplished in seconds. We will present architecturally how this was accomplished using Spark/Shark/HDFS and other subsystems, with Python and R scriptable front-ends. We will also discuss some use cases where large enterprises are successfully deploying this solution, and lessons learned.