Site Reliability Engineer (Platform Infrastructure)

Join Elastic, a company that values open source and innovation. As a Site Reliability Engineer, you will be part of the Infrastructure team, responsible for managing state, building and operating production services, and supporting internal adoption of the Elastic Stack. You will have the opportunity to work with a variety of programming languages and tools, and enjoy benefits such as fully paid health coverage, flexible location and schedule, generous vacation days, and more.

About the role

- Site-Reliability Engineering: We are fundamentally an operations team. We solve problems with code, but our core mission is keeping things running. Experience in an SRE or equivalent role is a strong indicator of fit - Software Developer: You have a broad development background and are deeply proficient in at least one language. Our team uses Python, JavaScript, Clojure, and Haskell, but we work alongside engineers across the company using everything from Java to Go — the specific language matters less than the depth of your expertise - Service-Oriented: You have multiple years of hands-on experience administering Linux systems, ideally at scale and in distributed environments. Experience helping operate a SaaS platform is a plus - Infrastructure-as-code: You're comfortable automating production systems collaboratively — treating configuration as code, managing it through version control, and working with tools such as Docker, Terraform, Puppet, Chef, Ansible, Salt, Packer, Kubernetes, or your own well-crafted shell scripts - A drive to automate and monitor everything. If it can be automated, you'll find a way - Comfort with a versioned, Git-based workflow driven by issues and pull requests - Experience building reusable software components; open source contributions (library, patch, documentation, or otherwise) are a bonus - Strong Linux fundamentals. You know your way around syscall tracing, TCP internals, init systems (sysvinit/runit/systemd), and aren't afraid to go deep when a problem demands it - A passion for open source, whether through code, mailing lists, documentation, or community participation - Experience thriving in a distributed, asynchronous work environment with strong written communication habits - A genuine appreciation for diverse, globally distributed teams and a collaborative, inclusive approach to getting work done

Key missions

Design and develop tooling that facilitates building, testing, and shipping the Elastic Stack.
Build and operate production services that power core aspects of the Elastic business, including downloads, Docker registry, maps service, and more.
Support internal adoption of the Elastic Stack for software development and analytics use cases.