eDV DevOps Engineer/Site Reliability Engineer

itecopeople
2 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
£ 169K

Job location

Tech stack

Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Advanced Message Queuing Protocol
Azure
Cloud Computing
Configuration Management
Continuous Integration
Relational Databases
Linux
DevOps
Network Security
Linux Commands
Openshift
RabbitMQ
Reliability Engineering
Ansible
Prometheus
Shell Script
Software Engineering
SQL Databases
Docker Swarm
Grafana
Reliability of Systems
Amazon Web Services (AWS)
Kubernetes
Infrastructure Automation Frameworks
InfluxDB
Functional Programming
Terraform
Docker
Jenkins

Job description

We are supporting a specialist engineering consultancy delivering secure technology platforms to high-profile UK government organisations. They are seeking an eDV Cleared DevOps Engineer/Site Reliability Engineer (SRE) with strong experience across AWS, Kubernetes, Terraform, CI/CD and Linux environments to support the continued growth of critical cross-domain systems. This contract role will focus on improving platform reliability, automation, infrastructure as code, observability and DevOps practices across both cloud and on-premise environments. You will work closely with software engineers, platform engineers and operations teams to ensure highly secure, scalable and resilient systems supporting sensitive government programmes., As a DevOps/Site Reliability Engineer, you will be responsible for ensuring the availability, performance, and reliability of services supporting sensitive government programmes. You will collaborate with multiple feature development teams and BAU/support teams to evolve both cloud and on-premise infrastructure, delivery pipelines, and observability tooling. The role will focus on improving system reliability, monitoring, automation, and performance, while proactively identifying and mitigating operational risks. This position may also involve participation in an on-call rota, which could include occasional 24/7 call-out support., * Collaborate with software engineering teams to improve subsystem reliability and performance.

  • Work with system administrators to automate operational processes and reduce manual effort.
  • Enhance monitoring and observability capabilities to proactively detect and resolve issues.
  • Support development environments to improve delivery speed and quality.
  • Contribute to the evolution of infrastructure, DevOps practices, and CI/CD pipelines.
  • Research and evaluate new technologies and tools to support engineering decisions.
  • Develop expertise across multiple technical and business domains.

Requirements

  • Active eDV clearance is essential
  • configuration management tools such as Ansible, Chef, or similar
  • Strong Terraform
  • Docker containers and container orchestration platforms (Kubernetes, OpenShift, Docker Swarm)
  • maintaining and using CI/CD tooling such as Jenkins
  • Monitoring and observability experience with Prometheus, Grafana, or InfluxDB
  • event-driven integration and messaging systems such as RabbitMQ or other AMQP solutions
  • Strong Linux command line, administration, and Shell Scripting experience
  • Solid understanding of relational databases and SQL
  • network security protocols
  • Working with cloud platforms, ideally AWS (EC2, RDS, S3, Lambda) Azure a plus

Apply for this position