Speaker

Photo of

Veena Basavaraj

Senior Software Engineer, Uber

Veena recently joined the data engineering team at Uber, focusing on stream processing solutions. She has worked at LinkedIn and at Cloudera in the past on various parts of the stack from front end, services, contributed to a couple of open source projects, and developed a keen interest in distributed systems such as Apache Kafka, Sqoop, and Apache Spark in the past year. As part of the ingest team at Cloudera, Veena was focusing on building solutions for batch and streaming ingestion and discovered the world of Apache Spark.

Sessions

Sqoop on Spark for Data Ingestion

Apache Sqoop has been used primarily for transfer of data between relational databases and HDFS, leveraging the Hadoop Mapreduce engine. Recently the Sqoop community has made changes to allow data transfer across any two data…