Sub banner

Site Reliability Engineer (SRE)

Job description.

Site Reliability Engineer (AWS)

Belfast - Hybrid

Full-time

Ocho is proud to partner with an exclusive client to recruit a skilled Site Reliability Engineer (SRE) with deep AWS expertise. This is a fantastic opportunity to join a growing global software organisation that powers mission-critical services across government and industry.

As a key member of the engineering team, you'll play a vital role in ensuring the reliability, availability, and performance of complex cloud-based infrastructure in a 24/7 production environment.

Key Responsibilities:

  • Build and manage secure, highly available AWS infrastructure.
  • Automate infrastructure deployment using Terraform, CloudFormation, or Ansible.
  • Implement and maintain monitoring and alerting systems with tools like CloudWatch, Prometheus, and Grafana.
  • Develop CI/CD pipelines using GitHub Actions, GitLab CI, or Jenkins.
  • Respond to incidents, troubleshoot, and perform root cause analysis.
  • Collaborate closely with development, DevOps, and security teams.

Experience:

  • 3+ years in an SRE, DevOps, or related role.
  • Hands-on experience with AWS (EC2, RDS, S3, EKS, etc.).
  • Skilled in infrastructure as code and scripting (Python, Bash, Go).
  • Experience with Docker, Kubernetes, and modern CI/CD workflows.
  • Strong problem-solving and communication skills.
  • Comfortable working in fast-paced, 24/7 production environments.

Additional Benefits:

  • Private Healthcare (BUPA)
  • Company Share Scheme - Buy one share, get one free
  • Generous Parental Leave - 26 weeks full maternity pay
  • Flexible Working Options - 1 day a week onsite (Belfast)
  • Income Protection & Life Assurance - Up to 4x salary

If you meet the above criteria, please apply now, alternatively feel free to reach out to Andrew Harrison directly for a confidential chat.

Submit CV for this Job.