Site Reliability Engineer (AWS)
Belfast - Hybrid
Full-time
Ocho is proud to partner with an exclusive client to recruit a skilled Site Reliability Engineer (SRE) with deep AWS expertise. This is a fantastic opportunity to join a growing global software organisation that powers mission-critical services across government and industry.
As a key member of the engineering team, you'll play a vital role in ensuring the reliability, availability, and performance of complex cloud-based infrastructure in a 24/7 production environment.
Key Responsibilities:
- Build and manage secure, highly available AWS infrastructure.
- Automate infrastructure deployment using Terraform, CloudFormation, or Ansible.
- Implement and maintain monitoring and alerting systems with tools like CloudWatch, Prometheus, and Grafana.
- Develop CI/CD pipelines using GitHub Actions, GitLab CI, or Jenkins.
- Respond to incidents, troubleshoot, and perform root cause analysis.
- Collaborate closely with development, DevOps, and security teams.
Experience:
- 3+ years in an SRE, DevOps, or related role.
- Hands-on experience with AWS (EC2, RDS, S3, EKS, etc.).
- Skilled in infrastructure as code and scripting (Python, Bash, Go).
- Experience with Docker, Kubernetes, and modern CI/CD workflows.
- Strong problem-solving and communication skills.
- Comfortable working in fast-paced, 24/7 production environments.
Additional Benefits:
- Private Healthcare (BUPA)
- Company Share Scheme - Buy one share, get one free
- Generous Parental Leave - 26 weeks full maternity pay
- Flexible Working Options - 1 day a week onsite (Belfast)
- Income Protection & Life Assurance - Up to 4x salary
If you meet the above criteria, please apply now, alternatively feel free to reach out to Andrew Harrison directly for a confidential chat.