Staff DevOps Engineer — Developer Infrastructure - US (Remote)

Luxurypresence · United States

Lever Posted Mar 17, 2026 First seen May 22, 2026
Luxury Presence is building the AI growth platform for real estate. Backed by Bessemer Venture Partners and other top investors, we're a Series C company that has hit $100M in annual recurring revenue. More than 90,000 real estate professionals, including over 30% of the WSJ Real Trends top 100 agents in the United States, use us to run and grow their business.

The Opportunity

We're hiring a Staff DevOps Engineer to build the AI-native, self-service infrastructure platform that powers how Luxury Presence ships software — a platform where every developer, human or agent, can validate a change in minutes, understand failures immediately, and ship confidently.

You'll join a small, high-leverage Infrastructure team with a clear mandate: multiply what every engineer and agent can do through automation, self-service, and guardrails-by-design. The goal is a platform where self-service is the default and human intervention is the exception — and you'll be central to making that real.

What You'll Do

  • Build and operate the self-service infrastructure platform. Make the right thing the easy thing — and the secure thing the default. Own core layers of the platform that developers and agents build on: a unified CI/CD pipeline, GitOps-based delivery (ArgoCD), an IaC module catalog, and golden-path scaffolding for new services. The goal is for engineers and agents to provision infrastructure and ship changes within guardrails — without hand-writing Terraform/Kustomize or waiting on a ticket.

  • Make agents first-class developers. Help generalize our agent isolation pattern into reusable platform primitives: sandboxed runtime, workload identity, audit trails, and approval gates. Every agent action runs on the same identity, audit, ownership, cost-attribution, and approval rails as a human.

  • Own the validation platform and quality gate frameworks. Provide the rails teams validate changes on before production – ephemeral PR environments with auto-teardown, and the security, performance and load-testing frameworks that developers leverage. Make gate and test results machine-readable and callable outside the PR flow, so agents and humans alike can act on a failure directly.

  • Build internal tooling, observability, and metrics. Create the dashboards and operational controls that make our pipelines observable and improvable at increasing throughput — gate pass rates, approval-rate and self-service-ratio trends, common failure modes, cost attribution, and DORA metrics surfaced to teams.

What We're Looking For

  • Staff-level ownership in DevOps, Platform Engineering, or SRE — you've built and operated production platforms at scale and can lead a foundational initiative end to end.

  • An active, opinionated use of AI development tools (Claude Code, Codex, etc.) in your own infrastructure workflow — Terraform changes, Kubernetes debugging, automation, operational investigations. You have a point of view on where these tools help and where they don't.

  • Deep Kubernetes (EKS) and AWS experience (IAM, VPC, ECR, SSM/Secrets Manager, S3, SQS, Lambda, RDS/Aurora).

  • Strong IaC (Terraform) and GitOps experience, including PR-driven apply workflows (Atlantis or similar) and ArgoCD.

  • Identity, access, and policy-as-code experience: workload/service identity (SPIFFE/SPIRE, OIDC), short-lived credentials, secrets management (Vault), and policy enforcement.

  • A proven track record building self-service developer platforms, ephemeral environments, CLIs, scaffolding tools, or internal developer portals (Backstage or custom). You treat infrastructure as a product.

  • CI/CD depth (GitHub Actions), including caching/parallelism, artifact management, test reliability, and pipeline observability.

  • Cost-awareness: you've built cost attribution, budgets, or rightsizing into a platform.

  • Excellent cross-team communication — you can translate platform constraints into developer-friendly solutions and documentation.

Our Tech Stack

  • Infrastructure: AWS, EKS, Terraform (with Atlantis), Vault, Docker, Crossplane.

  • CI/CD: GitHub Actions, ArgoCD + Kustomize (GitOps).

  • Messaging: Kafka (Confluent Cloud).

  • Observability: Datadog, OpenTelemetry.

  • Languages/Apps: Node.js/TypeScript microservices, Python, React front-ends.