Confluent Cloud Site Reliability Engineer (Confluent)

Confluent is an infrastructure company founded and led by the original authors and thought leaders behind Apache Kafka. We are building an open-source, enterprise-class stream data platform for real-time data and stream processing centered around Kafka. We believe that the future of enterprise applications will be built atop a streaming platform, and we want to put it at the heart of every company.
The next big goal for the company is to make it as easy as possible for anyone in the world to use Confluent’s products, including Kafka, to build their next killer streaming application. To do that we need to offer Confluent’s products as a Platform as a Service. (PaaS). In order for this product to be successful we absolutely have to build a world class team of Site Reliability Engineers (SREs) that are passionate about running large scale, multi-tenant distributed data systems for customers that expect a very high level of availability.
As a Confluent SRE, you will be working alongside the rest of the Confluent engineers to build our PaaS product. You, and the rest of the SRE team, will be responsible for the availability, performance, monitoring, emergency response, and capacity planning of the Confluent cloud. If you love the hum of big data systems, thinking about how to make them run as smoothly as possible, and want to have a big influence on the architecture plus operational design points of this new product, then you will fit right in.

What we’re looking for:

    • Strong fundamentals in distributed systems design and operation
    • Experience building automation to operate large-scale data systems
    • Solid experience working with large private or public clouds
    • A self starter with the ability to work effectively in teams
    • Excellent spoken / written communication
    • Required proficiency in Python, shell scripting, and working in/around github
    • Bachelor’s degree in Computer Science or similar field or equivalent

What gives you an edge:

    • Experience using Apache Kafka is a big plus
    • Ability to work with remote teams
    • Interest in evangelism (giving talks at tech conferences, writing blog posts evangelizing the challenges of running large data systems, like Kafka)
    • Nice to have proficiency in Java, Go, and/or C.



Job posted 5/31/2017