Sub banner

Senior DevOps Engineer - USA

Job description.

At a Glance


•    Senior DevOps / Platform Engineering role at a fast-growing agentic AI startup
•    Build and own the infrastructure powering a next-generation AI test automation platform
•    Fully remote — open to candidates across the United States
•    Greenfield infrastructure opportunity with real ownership and architectural influence
•    Base salary $150,000–$190,000 + equity

About Our Client


Ocho People are proud to be partnering with a high-growth AI software business that is redefining how engineering teams test and ship software. Our client has built an agentic AI-powered test automation platform that enables organisations to dramatically reduce manual QA effort, accelerate release cycles, and catch critical defects earlier in the development lifecycle — all without writing a single line of test code. Backed by leading investors and trusted by engineering teams at scale-up and enterprise level, the business is entering an exciting phase of growth. With a product that sits at the intersection of AI, developer tooling, and CI/CD, this is a rare opportunity to join a company that is genuinely changing how software gets built.


The Role
This is a senior individual contributor role for a DevOps or Platform Engineer who wants to build, not maintain. You’ll be joining a lean, high-calibre engineering team and taking ownership of the infrastructure that underpins a production AI platform — from cloud architecture and CI/CD pipelines through to the compute environments that run agentic AI workloads at scale. You’ll work closely with product engineers, ML engineers, and the CTO, and have a genuine voice in architectural decisions. If you’re energised by greenfield challenges, care about developer experience, and want your work to directly shape the trajectory of an early-stage company, this role will suit you well.


Key Responsibilities
•    Design, build, and maintain scalable cloud infrastructure across AWS, GCP, or Azure to support a production agentic AI platform
•    Own CI/CD pipeline architecture end-to-end, enabling fast, safe, and reliable software delivery
•    Build and manage containerised workload infrastructure using Kubernetes, including orchestration of AI/ML compute environments
•    Implement infrastructure-as-code across all environments using Terraform, Pulumi, or equivalent
•    Drive observability and reliability through robust monitoring, alerting, logging, and incident response frameworks
•    Collaborate with product and ML engineers to provision and optimise GPU/compute infrastructure for AI workload execution
•    Champion developer experience — reducing friction in local development, testing, and deployment workflows
•    Define and enforce security best practices across cloud environments, secrets management, and access controls
•    Contribute to engineering culture by documenting systems, sharing knowledge, and supporting a blameless incident process

What You’ll Need


Essential
•    5+ years’ experience in DevOps, Platform Engineering, or Site Reliability Engineering
•    Deep hands-on expertise with at least one major cloud provider (AWS, GCP, or Azure) at production scale
•    Strong Kubernetes experience — cluster management, workload orchestration, and scaling
•    Infrastructure-as-code proficiency with Terraform, Pulumi, or CloudFormation
•    Solid CI/CD experience with tools such as GitHub Actions, CircleCI, ArgoCD, or equivalent
•    Experience building and operating containerised environments with Docker
•    Proficiency in at least one scripting language (Python, Bash, or Go)
•    Right to work in the USA without restriction
Desirable / Nice to Have
•    Experience provisioning and managing GPU compute or AI/ML workload infrastructure
•    Familiarity with agentic AI frameworks, LLM orchestration, or MLOps tooling
•    Background at a high-growth Series A–C software or AI startup

Why Apply?
•    Base salary $150,000–$190,000 depending on experience, plus meaningful equity in a high-growth AI company
•    Fully remote — work from anywhere in the United States with flexible hours
•    20 days PTO plus federal holidays, increasing with tenure
•    Comprehensive medical, dental, and vision coverage; 401(k) with company match
•    Real ownership — greenfield infrastructure built by you, not inherited from someone else
•    Direct access to founders and the CTO — your voice carries weight from day one
•    Work at the frontier of AI and developer tooling in one of the most exciting product categories in tech

How to Apply
Connect with your Ocho People consultant on LinkedIn or submit your CV via the link below. All applications are treated in the strictest confidence.

Submit CV for this Job.

Apply for this job now
Posted
Job Details:
United States$150,000 - 190,000
Job reference:
CR262622
CHRIS RYAN

CHRIS RYAN

Principal Technology Recruiter at Ocho

An accomplished and results-driven recruiter with over 17 years of experience in both agency and internal recruitment across the EMEA and APAC regions, with a strong focus on the financial services and cybersecurity sectors.

Read More