Speaker

Mingjie Tang, Software Engineer at Hortonworks

Mingjie Tang

Software Engineer, Hortonworks

Mingjie Tang is an engineer at Hortonworks. He is working on SparkSQL, Spark MLlib and Spark Streaming. He has broad research interest in database management system, similarity query processing, data indexing, big data computation, data mining and machine learning. Mingjie completed his PhD in Computer Science from Purdue University.

Sessions

Spark HBase Connector: Feature Rich and Efficient Access to HBase Through Spark SQL

Both Spark and HBase are widely used, but how to use them together with high performance and simplicity is a very hard topic. Spark HBase Connector (SHC) provides feature-rich and efficient access to HBase through… Read more