Position Description: CGI was recognised in the Sunday Times Best Places to Work List and named one of the World’s Best Employers by Forbes. We offer a competitive salary, excellent pension, private healthcare, and a share scheme (3.5% + 3.5% matching), making you a CGI Partner rather than just an employee. We are committed to inclusivity and building a genuinely diverse community of technology professionals.
CGI’s Space, Defence and Intelligence business unit delivers innovative, mission-critical solutions to some of the UK government’s most complex and challenging problems. We design, build, and operate bespoke, secure systems that help keep the nation safe and secure. Our work spans cloud, on-prem, and hybrid environments, underpinned by strong cyber and engineering best practices.
We are seeking experienced Site Reliability Engineers to join our cross-functional teams supporting secure, cloud-based and big-data platforms. Working closely with clients and internal delivery teams, you will help design, build, operate, and continuously improve highly reliable, scalable systems within a secure environment.
Due to the nature of the work, this role requires onsite attendance and applicants must be solely UK Nationals (not dual nationals and not visa holders) with existing HMG DV / HLC clearance. Your future duties and responsibilities: Architect, build and operate cloud infrastructure
Design and deliver secure, scalable, fault-tolerant infrastructure across AWS (and potentially Azure/GCP), optimised for performance, availability, and cost.
Support and operate live systems
Maintain deployment environments, support production systems, and manage changes to live services in mission-critical environments.
You will work with a modern and diverse technology stack, including:
Languages & Scripting: Java, Python, Go, JavaScript (TypeScript), Bash
Frontend: Vue
CI/CD & Automation: Jenkins, GitLab CI, Ansible, Cucumber
Cloud & Platforms: AWS, OpenShift, Linux
Infrastructure as Code: Terraform
Data & Streaming: Apache NiFi
Monitoring & Logging: Grafana, ELK Stack, SonarQube
Version Control: GitLab
Automation & Infrastructure as Code (IaC)
Automate infrastructure provisioning, configuration, and CI/CD pipelines using Terraform, Ansible, CloudFormation, and scripting languages.
CI/CD & Platform Engineering
Build, maintain, and improve CI/CD pipelines using Jenkins, GitLab CI/CD, and related tooling.
Monitoring, reliability & optimisation
Implement and manage monitoring, logging, alerting, and performance optimisation using tools such as Grafana, ELK Stack, CloudWatch, Prometheus, and SonarQube.
Security & compliance
Implement IAM, encryption, network controls, and secure configurations in line with government and industry standards (e.g. GDPR, ISO ).
Disaster recovery & resilience
Design and test backup, disaster recovery, and self-healing strategies to ensure system reliability and availability.
Collaboration & documentation
Work closely with development, DevOps, security, and operations teams; produce runbooks, architecture documentation, and knowledge-sharing materials. Required qualifications to be successful in this role: Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role supporting cloud-based or secure systems
Strong automation and scripting skills (Python, Bash, Go, or similar)
Hands-on experience with CI/CD pipelines (Jenkins, GitLab CI/CD, etc.)
Strong knowledge of Infrastructure as Code (Terraform, CloudFormation, Ansible)
Solid understanding of AWS and cloud-native architectures
Experience with Linux-based environments and production troubleshooting
Familiarity with containerisation and orchestration (OpenShift or Kubernetes)
Experience with monitoring, logging, and observability tools
Comfortable supporting live services in production environments
Strong communication skills and a proactive, problem-solving mindset Skills: DevOpsEnglishGitLabKubernetesCloud Native Development
Read Less