Jobs | Jobs Hiring near me

Oscar Technology

Platform Architect Hybrid £70,000-£100,000 About the Role: We're partnering with a growing SaaS business to hire a senior Platform Architect to own the design, security, reliability, and operational management of their AWS platform and internal IT function. This is a hands-on leadership role in a lean organisation where you'll shape cloud architecture, modernise a legacy platform into a cloud-native environment, and provide senior oversight across platform engineering, security, SRE, CI/CD, and operational IT. Key Responsibilities: Own the AWS platform architecture and modernisation roadmap, including migration from a Java monolith to microservices on EKS. Define standards for containers, runtime environments, observability, tenancy, security, and infrastructure automation. Lead SRE practices including SLI/SLOs, incident management, DR/BCP planning, post-mortems, and operational resilience. Own platform security, secure SDLC, CI/CD pipelines, IaC, and software supply chain governance. Drive developer productivity through automation, self-service tooling, and platform standardisation. Provide senior oversight of IT operations including service desk governance, endpoint management, onboarding/offboarding, patching, ITAM, and MSP/vendor management. Act as a senior escalation point for critical incidents, outages, and operational issues. About You: Experience within a platform, infrastructure, or software engineering within SaaS environments. Strong AWS expertise including EKS, IAM, networking, KMS, RDS, and multi-account architecture. Hands-on Kubernetes, CI/CD, Terraform, and cloud security experience. Strong understanding of SRE, observability, incident response, and disaster recovery. Experience operating within regulated environments such as ISO 27001, SOC 2, or GxP. Comfortable balancing strategic leadership with hands-on operational delivery. AWS Solutions Architect - Professional certification required. CKA or CKS certification highly desirable. Platform Architect Hybrid £70,000-£100,000 Oscar Associates (UK) Limited is acting as an Employment Agency in relation to this vacancy. To understand more about what we do with your data please review our privacy policy in the privacy section of the Oscar website.

Jun 11, 2026

Full time

Platform Architect Hybrid £70,000-£100,000 About the Role: We're partnering with a growing SaaS business to hire a senior Platform Architect to own the design, security, reliability, and operational management of their AWS platform and internal IT function. This is a hands-on leadership role in a lean organisation where you'll shape cloud architecture, modernise a legacy platform into a cloud-native environment, and provide senior oversight across platform engineering, security, SRE, CI/CD, and operational IT. Key Responsibilities: Own the AWS platform architecture and modernisation roadmap, including migration from a Java monolith to microservices on EKS. Define standards for containers, runtime environments, observability, tenancy, security, and infrastructure automation. Lead SRE practices including SLI/SLOs, incident management, DR/BCP planning, post-mortems, and operational resilience. Own platform security, secure SDLC, CI/CD pipelines, IaC, and software supply chain governance. Drive developer productivity through automation, self-service tooling, and platform standardisation. Provide senior oversight of IT operations including service desk governance, endpoint management, onboarding/offboarding, patching, ITAM, and MSP/vendor management. Act as a senior escalation point for critical incidents, outages, and operational issues. About You: Experience within a platform, infrastructure, or software engineering within SaaS environments. Strong AWS expertise including EKS, IAM, networking, KMS, RDS, and multi-account architecture. Hands-on Kubernetes, CI/CD, Terraform, and cloud security experience. Strong understanding of SRE, observability, incident response, and disaster recovery. Experience operating within regulated environments such as ISO 27001, SOC 2, or GxP. Comfortable balancing strategic leadership with hands-on operational delivery. AWS Solutions Architect - Professional certification required. CKA or CKS certification highly desirable. Platform Architect Hybrid £70,000-£100,000 Oscar Associates (UK) Limited is acting as an Employment Agency in relation to this vacancy. To understand more about what we do with your data please review our privacy policy in the privacy section of the Oscar website.

Telemetry and Observability Engineer

Oscar Technology

Telemetry and Observability Engineer (Inside IR-35) London / Hybrid (3 days on-site) I'm working with a global organisation building next-generation cloud-native and observability platforms at enterprise scale, and they're looking for a strong Senior Observability Engineer to join the team. This is a high-impact role focused on scalable telemetry pipelines, monitoring, alerting, reliability engineering, and embedding observability across complex distributed systems and Kubernetes environments. Key experience needed: Observability / SRE / Platform Engineering background OpenTelemetry , Prometheus, Grafana, Splunk, Elastic, Loki, or Jaeger Kubernetes, microservices, and cloud-native platforms Python, Go, or Java Terraform, Helm, and IaC SLIs, SLOs, alerting, and reliability engineering Financial services or regulated environment experience is a bonus. Great opportunity to work with cutting-edge technology, influence engineering standards, and help shape observability at enterprise scale. Interested? Drop me a message or send over your CV. Oscar Associates (UK) Limited is acting as an Employment Business in relation to this vacancy. To understand more about what we do with your data please review our privacy policy in the privacy section of the Oscar website.

Jun 11, 2026

Contractor

Telemetry and Observability Engineer (Inside IR-35) London / Hybrid (3 days on-site) I'm working with a global organisation building next-generation cloud-native and observability platforms at enterprise scale, and they're looking for a strong Senior Observability Engineer to join the team. This is a high-impact role focused on scalable telemetry pipelines, monitoring, alerting, reliability engineering, and embedding observability across complex distributed systems and Kubernetes environments. Key experience needed: Observability / SRE / Platform Engineering background OpenTelemetry , Prometheus, Grafana, Splunk, Elastic, Loki, or Jaeger Kubernetes, microservices, and cloud-native platforms Python, Go, or Java Terraform, Helm, and IaC SLIs, SLOs, alerting, and reliability engineering Financial services or regulated environment experience is a bonus. Great opportunity to work with cutting-edge technology, influence engineering standards, and help shape observability at enterprise scale. Interested? Drop me a message or send over your CV. Oscar Associates (UK) Limited is acting as an Employment Business in relation to this vacancy. To understand more about what we do with your data please review our privacy policy in the privacy section of the Oscar website.

Site Reliability Engineer (SRE) - Cloud & Automation

Spencer Rose Ltd

Site Reliability Engineer (SRE) - Cloud & Automation London, Docklands (hybrid) £80,000 - £90,000 per annum + annual discretionary bonus On behalf of a leading financial services organisation, I'm looking for a highly capable Site Reliability Engineer (SRE) to drive the adoption of SRE methodologies across their Cloud-hosted environment and act as the central point of expertise for automation within the Platform Operations function. This role is ideal for someone who thrives in complex, regulated environments and is passionate about building reliable, scalable, and automated cloud platforms. The organisation is pleased to offer the role on a hybrid basis with 2 days per week in their Canary Wharf office, therefore you must be within a reasonable commute of London. Responsibilities: Lead the implementation of SRE practices across the organisation, working closely with infrastructure teams to optimise deployment processes and embed automation and operational excellence. Enhance observability and reliability , defining and implementing SLAs, SLOs and SLIs to improve alerting, monitoring, and capacity planning. Identify and eliminate toil , developing frameworks to analyse recurring issues and automate remediation wherever possible. Develop secure, production-ready code , while reviewing and debugging code produced by others. Build and mature GitOps capabilities using tools such as Terraform and Ansible Automation Platform to support multi-environment, multi-region cloud platforms. Provide on-call support for Cloud and Automation services, ensuring production stability remains the top priority. Drive post-incident improvements , ensuring risks and stability issues are understood and addressed through SRE best practices. Experience/Skills required: Strong operational support experience within an infrastructure services team, including on-call responsibilities, incident ownership, and root-cause analysis. 2+ years applying SRE methodologies, with a solid understanding of service-level metrics and reliability engineering principles. Proficiency in at least one Scripting language - ideally Python or Ansible (PowerShell also beneficial). Experience supporting and building multi-environment, multi-region cloud platforms (AWS or GCP), using IaC and GitOps workflows. Hands-on experience with observability/APM tooling such as Grafana, Datadog or Dynatrace. Background working in regulated financial services or banking environments. Excellent troubleshooting, analytical and communication skills, able to work effectively with both technical and non-technical stakeholders. Nice to have: Software development background. Familiarity with the ITIL framework. Experience with Ansible Automation Platform. Strong service-oriented mindset with the ability to work proactively and keep stakeholders informed.

Jun 11, 2026

Full time

Site Reliability Engineer (SRE) - Cloud & Automation London, Docklands (hybrid) £80,000 - £90,000 per annum + annual discretionary bonus On behalf of a leading financial services organisation, I'm looking for a highly capable Site Reliability Engineer (SRE) to drive the adoption of SRE methodologies across their Cloud-hosted environment and act as the central point of expertise for automation within the Platform Operations function. This role is ideal for someone who thrives in complex, regulated environments and is passionate about building reliable, scalable, and automated cloud platforms. The organisation is pleased to offer the role on a hybrid basis with 2 days per week in their Canary Wharf office, therefore you must be within a reasonable commute of London. Responsibilities: Lead the implementation of SRE practices across the organisation, working closely with infrastructure teams to optimise deployment processes and embed automation and operational excellence. Enhance observability and reliability , defining and implementing SLAs, SLOs and SLIs to improve alerting, monitoring, and capacity planning. Identify and eliminate toil , developing frameworks to analyse recurring issues and automate remediation wherever possible. Develop secure, production-ready code , while reviewing and debugging code produced by others. Build and mature GitOps capabilities using tools such as Terraform and Ansible Automation Platform to support multi-environment, multi-region cloud platforms. Provide on-call support for Cloud and Automation services, ensuring production stability remains the top priority. Drive post-incident improvements , ensuring risks and stability issues are understood and addressed through SRE best practices. Experience/Skills required: Strong operational support experience within an infrastructure services team, including on-call responsibilities, incident ownership, and root-cause analysis. 2+ years applying SRE methodologies, with a solid understanding of service-level metrics and reliability engineering principles. Proficiency in at least one Scripting language - ideally Python or Ansible (PowerShell also beneficial). Experience supporting and building multi-environment, multi-region cloud platforms (AWS or GCP), using IaC and GitOps workflows. Hands-on experience with observability/APM tooling such as Grafana, Datadog or Dynatrace. Background working in regulated financial services or banking environments. Excellent troubleshooting, analytical and communication skills, able to work effectively with both technical and non-technical stakeholders. Nice to have: Software development background. Familiarity with the ITIL framework. Experience with Ansible Automation Platform. Strong service-oriented mindset with the ability to work proactively and keep stakeholders informed.

Site Reliability Engineer

Huxley Associates City, London

Site Reliability Engineer (Cloud & Automation) - London - 2 Days on Site per week. A leading global financial services organisation is seeking a Site Reliability Engineer (SRE) to drive reliability, automation, and performance across its cloud-hosted platforms. The Opportunity This role sits within a high-performing Platform Operations function, acting as a central point of expertise for SRE methodologies and automation. You will play a key role in improving system resilience, scalability, and operational excellence across a complex, regulated environment. Key Responsibilities Lead the implementation of SRE best practices across cloud infrastructure Drive improvements in observability, alerting, and capacity planning (SLA / SLO / SLI) Identify and reduce operational toil through automation and remediation frameworks Build and enhance GitOps and Infrastructure-as-Code capabilities (e.g. Terraform, Ansible) Develop and review production-grade code to support automation initiatives Support incident management and on-call processes, ensuring production stability Contribute to post-incident reviews, embedding SRE principles to reduce risk Requirements Demonstrable experience in SRE or infrastructure operations within cloud environments (AWS / GCP) Strong scripting skills (Python, Ansible, or PowerShell) Experience with Infrastructure as Code and GitOps methodologies Hands-on knowledge of observability / APM tools (e.g. Grafana, Datadog, Dynatrace) Proven experience managing incidents, root cause analysis, and on-call support Understanding of SLA/SLO/SLI frameworks and reliability engineering principles Desirable Background in software development Experience working within regulated financial services environments Familiarity with ITIL and enterprise service management frameworks Relevant certifications (e.g. AWS, Terraform) Why Apply Opportunity to shape cloud reliability strategy in a large-scale environment Work with modern tooling across automation, DevOps, and SRE practices Strong emphasis on engineering excellence and continuous improvement Competitive compensation and long-term career progression To find out more about Huxley, please visit (url removed) Huxley, a trading division of SThree Partnership LLP is acting as an Employment Business in relation to this vacancy Registered office 8 Bishopsgate, London, EC2N 4BQ, United Kingdom Partnership Number OC(phone number removed) England and Wales

Jun 11, 2026

Full time

Site Reliability Engineer (Cloud & Automation) - London - 2 Days on Site per week. A leading global financial services organisation is seeking a Site Reliability Engineer (SRE) to drive reliability, automation, and performance across its cloud-hosted platforms. The Opportunity This role sits within a high-performing Platform Operations function, acting as a central point of expertise for SRE methodologies and automation. You will play a key role in improving system resilience, scalability, and operational excellence across a complex, regulated environment. Key Responsibilities Lead the implementation of SRE best practices across cloud infrastructure Drive improvements in observability, alerting, and capacity planning (SLA / SLO / SLI) Identify and reduce operational toil through automation and remediation frameworks Build and enhance GitOps and Infrastructure-as-Code capabilities (e.g. Terraform, Ansible) Develop and review production-grade code to support automation initiatives Support incident management and on-call processes, ensuring production stability Contribute to post-incident reviews, embedding SRE principles to reduce risk Requirements Demonstrable experience in SRE or infrastructure operations within cloud environments (AWS / GCP) Strong scripting skills (Python, Ansible, or PowerShell) Experience with Infrastructure as Code and GitOps methodologies Hands-on knowledge of observability / APM tools (e.g. Grafana, Datadog, Dynatrace) Proven experience managing incidents, root cause analysis, and on-call support Understanding of SLA/SLO/SLI frameworks and reliability engineering principles Desirable Background in software development Experience working within regulated financial services environments Familiarity with ITIL and enterprise service management frameworks Relevant certifications (e.g. AWS, Terraform) Why Apply Opportunity to shape cloud reliability strategy in a large-scale environment Work with modern tooling across automation, DevOps, and SRE practices Strong emphasis on engineering excellence and continuous improvement Competitive compensation and long-term career progression To find out more about Huxley, please visit (url removed) Huxley, a trading division of SThree Partnership LLP is acting as an Employment Business in relation to this vacancy Registered office 8 Bishopsgate, London, EC2N 4BQ, United Kingdom Partnership Number OC(phone number removed) England and Wales

Senior Site Reliability Engineer

DWP Digital

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Jun 11, 2026

Full time

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Azure SRE Engineer

Oscar Technology Glasgow, Lanarkshire

Azure Site Reliability EngineersGlasgow, Scotland (On-site)Up to £625/day Inside IR35Initial 6-Month Contract We're looking for two experienced Azure Site Reliability Engineers to join a major Financial Services programme focused on platform health, reliability, and observability across a large-scale Azure environment. You'll be responsible for building and maintaining Azure platform health infrastructure using Terraform, developing Python-based automation and integrations, and implementing SLOs/SLIs across infrastructure and application layers. The role also involves working with observability tooling, event-driven integrations, and Azure-native services in a highly collaborative environment with engineering and product stakeholders. Required experience: Strong hands-on Azure engineering experience Terraform in production environments (primary IaC tool) Python for automation, integrations, or agent development Designing SLOs / SLIs across distributed systems Observability tooling (e.g., Grafana, alerting, synthetic monitoring) Event-driven architecture (Kafka, Splunk, REST APIs, webhooks) Cloud security best practices (RBAC, encryption, Private Endpoints) Strong communication and stakeholder engagement skills Interested? Please send your CV and a brief overview of your relevant experience. Oscar Associates (UK) Limited is acting as an Employment Business in relation to this vacancy. To understand more about what we do with your data please review our privacy policy in the privacy section of the Oscar website.

Jun 09, 2026

Contractor

Azure Site Reliability EngineersGlasgow, Scotland (On-site)Up to £625/day Inside IR35Initial 6-Month Contract We're looking for two experienced Azure Site Reliability Engineers to join a major Financial Services programme focused on platform health, reliability, and observability across a large-scale Azure environment. You'll be responsible for building and maintaining Azure platform health infrastructure using Terraform, developing Python-based automation and integrations, and implementing SLOs/SLIs across infrastructure and application layers. The role also involves working with observability tooling, event-driven integrations, and Azure-native services in a highly collaborative environment with engineering and product stakeholders. Required experience: Strong hands-on Azure engineering experience Terraform in production environments (primary IaC tool) Python for automation, integrations, or agent development Designing SLOs / SLIs across distributed systems Observability tooling (e.g., Grafana, alerting, synthetic monitoring) Event-driven architecture (Kafka, Splunk, REST APIs, webhooks) Cloud security best practices (RBAC, encryption, Private Endpoints) Strong communication and stakeholder engagement skills Interested? Please send your CV and a brief overview of your relevant experience. Oscar Associates (UK) Limited is acting as an Employment Business in relation to this vacancy. To understand more about what we do with your data please review our privacy policy in the privacy section of the Oscar website.

Site Reliability Engineer

Huxley Associates Bromley, London

My client within Investment Banking is currently seeking for an SRE Lead. I'm working on an SRE Lead role within a banking/payments environment that I thought might be of interest. You'd lead SRE strategy, driving automation, observability, and reliability by design, with a focus on reducing incidents and improving recovery. Looking for someone with 8+ years' experience in SRE, strong resilience engineering background, and the ability to scale operations in complex environments. Logistics: Up to 1000 p/d inside IR35 3 Days a week Bromley 12 month contract Please click here to find out more about our Key Information Documents. Please note that the documents provided contain generic information. If we are successful in finding you an assignment, you will receive a Key Information Document which will be specific to the vendor set-up you have chosen and your placement. To find out more about Huxley, please visit (url removed) Huxley, a trading division of SThree Partnership LLP is acting as an Employment Business in relation to this vacancy Registered office 8 Bishopsgate, London, EC2N 4BQ, United Kingdom Partnership Number OC(phone number removed) England and Wales

Jun 09, 2026

Contractor

My client within Investment Banking is currently seeking for an SRE Lead. I'm working on an SRE Lead role within a banking/payments environment that I thought might be of interest. You'd lead SRE strategy, driving automation, observability, and reliability by design, with a focus on reducing incidents and improving recovery. Looking for someone with 8+ years' experience in SRE, strong resilience engineering background, and the ability to scale operations in complex environments. Logistics: Up to 1000 p/d inside IR35 3 Days a week Bromley 12 month contract Please click here to find out more about our Key Information Documents. Please note that the documents provided contain generic information. If we are successful in finding you an assignment, you will receive a Key Information Document which will be specific to the vendor set-up you have chosen and your placement. To find out more about Huxley, please visit (url removed) Huxley, a trading division of SThree Partnership LLP is acting as an Employment Business in relation to this vacancy Registered office 8 Bishopsgate, London, EC2N 4BQ, United Kingdom Partnership Number OC(phone number removed) England and Wales

Senior Site Reliability Engineer

DWP Digital Leeds, Yorkshire

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Jun 09, 2026

Full time

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

SRE Lead - Cyber Security

Client Server Cambridge, Cambridgeshire

SRE Lead (Site Reliability Engineer) Cambridge / WFH to £80k Do you have expertise with SRE on AWS and / or Azure? You could be progressing your career at the world's most advanced cybersecurity technology business that uses AI technology to protect clients across the globe from advanced cyber threats, working alongside a team of friendly and supportive people and enjoying a host of perks and benefi click apply for full job details

Jun 09, 2026

Full time

SRE Lead (Site Reliability Engineer) Cambridge / WFH to £80k Do you have expertise with SRE on AWS and / or Azure? You could be progressing your career at the world's most advanced cybersecurity technology business that uses AI technology to protect clients across the globe from advanced cyber threats, working alongside a team of friendly and supportive people and enjoying a host of perks and benefi click apply for full job details

Platform Engineer (GCP)

Hays Specialist Recruitment Manchester, Lancashire

Prestigious opportunity for a talented and experienced Platform Engineer to join a rapidly growing digital engineering team delivering cutting edge solutions across a diverse portfolio of clients.This is hands on applying your deep engineering and architectural expertise to design, build, and evolve scalable platform solutions that drive digital transformation within some of the UK's most exciting organisations. Collaborating closely with highly skilled, cross functional teams, you will work alongside engineers, architects, and product specialists to push boundaries and deliver impactful, high quality outcomes. Key responsibilities: Design, build, and maintain robust cloud platforms and CI/CD pipelines Contribute across the full software development life cycle Provide technical leadership and input into system and architecture design Collaborate with cross functional teams to deliver scalable, reliable solutions. Write clean, well documented code and contribute to technical documentation Proactively monitor, troubleshoot, and resolve production issues Continuously improve platform performance, security, and reliability Stay current with emerging technologies and drive innovation and adoption Communicate complex technical ideas to both technical and non-technical stakeholders If you possess a combination of some of the following skills, then LET'S TALK! Strong experience designing and managing cloud native platforms on Google Cloud Platform (GCP) Hands-on expertise with services such as: Compute Engine, GKE (Kubernetes), Cloud Storage VPC networking, IAM, Cloud Functions Proven experience with Infrastructure as Code (eg, Terraform) Strong background in CI/CD pipeline design (eg, Jenkins, GitLab CI, Cloud Build) Proficiency in Scripting/programming (eg, Python, Bash, Go) Experience with containerisation & orchestration (Docker, Kubernetes) Solid understanding of cloud networking, security, and identity management Experience with monitoring & observability tools (Prometheus, Grafana, ELK, or similar) Strong Git/version control experience Exposure to the following skills is advantageous but not essential: - Experience with multi-cloud or hybrid cloud environments (AWS, Azure, on-premise). Exposure to service mesh technologies (eg, Istio, Linkerd) and configuration management tools (eg, Chef, Puppet). Knowledge of site reliability engineering (SRE) principles and practices. Familiarity with GCP AI/ML services (AI Platform, AutoML). In return, you will be rewarded with ongoing career development and a market leading benefits package in a flexible, hybrid working environment. What you need to do now If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now. Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found on our website.

Jun 08, 2026

Full time

Prestigious opportunity for a talented and experienced Platform Engineer to join a rapidly growing digital engineering team delivering cutting edge solutions across a diverse portfolio of clients.This is hands on applying your deep engineering and architectural expertise to design, build, and evolve scalable platform solutions that drive digital transformation within some of the UK's most exciting organisations. Collaborating closely with highly skilled, cross functional teams, you will work alongside engineers, architects, and product specialists to push boundaries and deliver impactful, high quality outcomes. Key responsibilities: Design, build, and maintain robust cloud platforms and CI/CD pipelines Contribute across the full software development life cycle Provide technical leadership and input into system and architecture design Collaborate with cross functional teams to deliver scalable, reliable solutions. Write clean, well documented code and contribute to technical documentation Proactively monitor, troubleshoot, and resolve production issues Continuously improve platform performance, security, and reliability Stay current with emerging technologies and drive innovation and adoption Communicate complex technical ideas to both technical and non-technical stakeholders If you possess a combination of some of the following skills, then LET'S TALK! Strong experience designing and managing cloud native platforms on Google Cloud Platform (GCP) Hands-on expertise with services such as: Compute Engine, GKE (Kubernetes), Cloud Storage VPC networking, IAM, Cloud Functions Proven experience with Infrastructure as Code (eg, Terraform) Strong background in CI/CD pipeline design (eg, Jenkins, GitLab CI, Cloud Build) Proficiency in Scripting/programming (eg, Python, Bash, Go) Experience with containerisation & orchestration (Docker, Kubernetes) Solid understanding of cloud networking, security, and identity management Experience with monitoring & observability tools (Prometheus, Grafana, ELK, or similar) Strong Git/version control experience Exposure to the following skills is advantageous but not essential: - Experience with multi-cloud or hybrid cloud environments (AWS, Azure, on-premise). Exposure to service mesh technologies (eg, Istio, Linkerd) and configuration management tools (eg, Chef, Puppet). Knowledge of site reliability engineering (SRE) principles and practices. Familiarity with GCP AI/ML services (AI Platform, AutoML). In return, you will be rewarded with ongoing career development and a market leading benefits package in a flexible, hybrid working environment. What you need to do now If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now. Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found on our website.

Platform Engineer (GCP)

Hays Technology City, Manchester

Prestigious opportunity for a talented and experienced Platform Engineer to join a rapidly growing digital engineering team delivering cutting edge solutions across a diverse portfolio of clients.This is hands on applying your deep engineering and architectural expertise to design, build, and evolve scalable platform solutions that drive digital transformation within some of the UK's most exciting organisations. Collaborating closely with highly skilled, cross functional teams, you will work alongside engineers, architects, and product specialists to push boundaries and deliver impactful, high quality outcomes. Key responsibilities: Design, build, and maintain robust cloud platforms and CI/CD pipelines Contribute across the full software development lifecycle Provide technical leadership and input into system and architecture design Collaborate with cross functional teams to deliver scalable, reliable solutions. Write clean, well documented code and contribute to technical documentation Proactively monitor, troubleshoot, and resolve production issues Continuously improve platform performance, security, and reliability Stay current with emerging technologies and drive innovation and adoption Communicate complex technical ideas to both technical and non-technical stakeholders If you possess a combination of some of the following skills, then LET'S TALK! Strong experience designing and managing cloud native platforms on Google Cloud Platform (GCP) Hands-on expertise with services such as: Compute Engine, GKE (Kubernetes), Cloud Storage VPC networking, IAM, Cloud Functions Proven experience with Infrastructure as Code (e.g., Terraform) Strong background in CI/CD pipeline design (e.g., Jenkins, GitLab CI, Cloud Build) Proficiency in scripting/programming (e.g., Python, Bash, Go) Experience with containerisation & orchestration (Docker, Kubernetes) Solid understanding of cloud networking, security, and identity management Experience with monitoring & observability tools (Prometheus, Grafana, ELK, or similar) Strong Git/version control experience Exposure to the following skills is advantageous but not essential: - Experience with multi-cloud or hybrid cloud environments (AWS, Azure, on-premise). Exposure to service mesh technologies (e.g., Istio, Linkerd) and configuration management tools (e.g., Chef, Puppet). Knowledge of site reliability engineering (SRE) principles and practices. Familiarity with GCP AI/ML services (AI Platform, AutoML). In return, you will be rewarded with ongoing career development and a market leading benefits package in a flexible, hybrid working environment. What you need to do now If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now. If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career. Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found at (url removed)

Jun 07, 2026

Full time

Prestigious opportunity for a talented and experienced Platform Engineer to join a rapidly growing digital engineering team delivering cutting edge solutions across a diverse portfolio of clients.This is hands on applying your deep engineering and architectural expertise to design, build, and evolve scalable platform solutions that drive digital transformation within some of the UK's most exciting organisations. Collaborating closely with highly skilled, cross functional teams, you will work alongside engineers, architects, and product specialists to push boundaries and deliver impactful, high quality outcomes. Key responsibilities: Design, build, and maintain robust cloud platforms and CI/CD pipelines Contribute across the full software development lifecycle Provide technical leadership and input into system and architecture design Collaborate with cross functional teams to deliver scalable, reliable solutions. Write clean, well documented code and contribute to technical documentation Proactively monitor, troubleshoot, and resolve production issues Continuously improve platform performance, security, and reliability Stay current with emerging technologies and drive innovation and adoption Communicate complex technical ideas to both technical and non-technical stakeholders If you possess a combination of some of the following skills, then LET'S TALK! Strong experience designing and managing cloud native platforms on Google Cloud Platform (GCP) Hands-on expertise with services such as: Compute Engine, GKE (Kubernetes), Cloud Storage VPC networking, IAM, Cloud Functions Proven experience with Infrastructure as Code (e.g., Terraform) Strong background in CI/CD pipeline design (e.g., Jenkins, GitLab CI, Cloud Build) Proficiency in scripting/programming (e.g., Python, Bash, Go) Experience with containerisation & orchestration (Docker, Kubernetes) Solid understanding of cloud networking, security, and identity management Experience with monitoring & observability tools (Prometheus, Grafana, ELK, or similar) Strong Git/version control experience Exposure to the following skills is advantageous but not essential: - Experience with multi-cloud or hybrid cloud environments (AWS, Azure, on-premise). Exposure to service mesh technologies (e.g., Istio, Linkerd) and configuration management tools (e.g., Chef, Puppet). Knowledge of site reliability engineering (SRE) principles and practices. Familiarity with GCP AI/ML services (AI Platform, AutoML). In return, you will be rewarded with ongoing career development and a market leading benefits package in a flexible, hybrid working environment. What you need to do now If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now. If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career. Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C's, Privacy Policy and Disclaimers which can be found at (url removed)

Secure Cloud & DevSecOps (GCP / GDC)

RT Consulting Bristol, Somerset

Associate Consultant - Secure Cloud / GCP / GDC DevSecOps Join RT Consulting's Associate Consulting workforce Who we are RT Consulting are a trusted management consultancy and service provider. We are proud to hold the Gold Award under the Armed Forces Employer Recognition Scheme. RT are a member of the Government Digital Sustainability Alliance, bringing government, industry, and academia together to improve digital sustainability outcomes for the UK government and its supply chain We deliver highly capable and effective value for money solutions to our clients as the 'customer friend' and trusted partner across Defence, Policing, Central and Local Government. We deploy consultants who ensure alignment with Government policy, stakeholder expectations, and long-term impact goals. We specialise in the delivery of Cloud & Digital Infrastructure services , including multi-cloud engineering (AWS, Azure, GCP), secure cloud platforms, DevSecOps and automation, Site Reliability Engineering, digital workplace technologies, and resilient, scalable infrastructure operations across complex and regulated environments. Your Invitation: We invite you to join our Cloud & Digital Infrastructure consulting team , where we can align you to current and upcoming demand across cloud engineering, secure platform engineering, DevSecOps/SRE, and modern infrastructure transformation. We are particularly building capability in: Google Cloud Platform (GCP) Google Distributed Cloud (GDC) / air-gapped deployments Secure-by-design cloud engineering for Defence and high-assurance environments Kubernetes, containerisation, and Infrastructure-as-Code (Terraform) This includes supporting surge activity for our defence partners delivering secure cloud services into secure environments. Engagement expectations Vetting: Due to the regulated nature of our work and our significant defence portfolio, a minimum of active SC clearance is required. DV-cleared professionals are also in high demand for secure, air-gapped GDC programmes. Working pattern: Projects typically require 2-3 days per week on-site at Southwest client locations including, Corsham & Bristol, with hybrid flexibility where permitted. Fees: Rates are aligned to engagement scope and seniority. What you'll get You join a community of specialists across Defence, Government, Policing and wider Public Sector programmes, where knowledge sharing, peer support and professional connection are part of the culture. Priority access to new consultancy opportunities, including secure GCP/GDC, DevSecOps and platform engineering workstreams. Dedicated relationship support, Ongoing contact with a Relationship Manager who provides guidance, check-ins and forward planning to help minimise gaps between assignments. An invitation to Society events, meetups and community touchpoints, we aim to ensure you feel supported, valued and engaged throughout your consultancy journey. A consultancy environment that reflects our Group Values - Integrity & Respect, Accountability, Collaboration, High Performance, Innovation, Agility, Client Centricity & People Focused. Who you are An experienced Cloud / Platform Engineering professional with capability in one or more of the following: Google Cloud Platform (GCP) or Google Distributed Cloud (GDC) DevOps, DevSecOps or Site Reliability Engineering (SRE) Platform Engineering and secure cloud design Kubernetes and container platforms (GKE / secure clusters) Infrastructure-as-Code (Terraform) Secure cloud operations (IAM, RBAC, networking, secrets management) You are comfortable working within secure, regulated environments and collaborating directly with users and stakeholders to deliver cloud capability at pace. You will need to be well versed in the direction of travel from Government, focused on digital transformation to enhance public services, improve efficiency, and meet the evolving expectations of its citizens. This shift involves modernising outdated systems, leveraging data effectively, and adopting new technologies like Artificial Intelligence (AI). The goal is a more agile, responsive, and citizen-centric government. You are comfortable operating in high-assurance, regulated environments, capable of working independently within secure delivery teams, and adept at designing, deploying and maintaining secure, modern cloud platforms. How to express interest Contact us to arrange a confidential conversation.

Jun 07, 2026

Contractor

Associate Consultant - Secure Cloud / GCP / GDC DevSecOps Join RT Consulting's Associate Consulting workforce Who we are RT Consulting are a trusted management consultancy and service provider. We are proud to hold the Gold Award under the Armed Forces Employer Recognition Scheme. RT are a member of the Government Digital Sustainability Alliance, bringing government, industry, and academia together to improve digital sustainability outcomes for the UK government and its supply chain We deliver highly capable and effective value for money solutions to our clients as the 'customer friend' and trusted partner across Defence, Policing, Central and Local Government. We deploy consultants who ensure alignment with Government policy, stakeholder expectations, and long-term impact goals. We specialise in the delivery of Cloud & Digital Infrastructure services , including multi-cloud engineering (AWS, Azure, GCP), secure cloud platforms, DevSecOps and automation, Site Reliability Engineering, digital workplace technologies, and resilient, scalable infrastructure operations across complex and regulated environments. Your Invitation: We invite you to join our Cloud & Digital Infrastructure consulting team , where we can align you to current and upcoming demand across cloud engineering, secure platform engineering, DevSecOps/SRE, and modern infrastructure transformation. We are particularly building capability in: Google Cloud Platform (GCP) Google Distributed Cloud (GDC) / air-gapped deployments Secure-by-design cloud engineering for Defence and high-assurance environments Kubernetes, containerisation, and Infrastructure-as-Code (Terraform) This includes supporting surge activity for our defence partners delivering secure cloud services into secure environments. Engagement expectations Vetting: Due to the regulated nature of our work and our significant defence portfolio, a minimum of active SC clearance is required. DV-cleared professionals are also in high demand for secure, air-gapped GDC programmes. Working pattern: Projects typically require 2-3 days per week on-site at Southwest client locations including, Corsham & Bristol, with hybrid flexibility where permitted. Fees: Rates are aligned to engagement scope and seniority. What you'll get You join a community of specialists across Defence, Government, Policing and wider Public Sector programmes, where knowledge sharing, peer support and professional connection are part of the culture. Priority access to new consultancy opportunities, including secure GCP/GDC, DevSecOps and platform engineering workstreams. Dedicated relationship support, Ongoing contact with a Relationship Manager who provides guidance, check-ins and forward planning to help minimise gaps between assignments. An invitation to Society events, meetups and community touchpoints, we aim to ensure you feel supported, valued and engaged throughout your consultancy journey. A consultancy environment that reflects our Group Values - Integrity & Respect, Accountability, Collaboration, High Performance, Innovation, Agility, Client Centricity & People Focused. Who you are An experienced Cloud / Platform Engineering professional with capability in one or more of the following: Google Cloud Platform (GCP) or Google Distributed Cloud (GDC) DevOps, DevSecOps or Site Reliability Engineering (SRE) Platform Engineering and secure cloud design Kubernetes and container platforms (GKE / secure clusters) Infrastructure-as-Code (Terraform) Secure cloud operations (IAM, RBAC, networking, secrets management) You are comfortable working within secure, regulated environments and collaborating directly with users and stakeholders to deliver cloud capability at pace. You will need to be well versed in the direction of travel from Government, focused on digital transformation to enhance public services, improve efficiency, and meet the evolving expectations of its citizens. This shift involves modernising outdated systems, leveraging data effectively, and adopting new technologies like Artificial Intelligence (AI). The goal is a more agile, responsive, and citizen-centric government. You are comfortable operating in high-assurance, regulated environments, capable of working independently within secure delivery teams, and adept at designing, deploying and maintaining secure, modern cloud platforms. How to express interest Contact us to arrange a confidential conversation.

Lead Site Reliability Engineer (SRE)

McGregor Boyall Bromley, Kent

Lead Site Reliability Engineer (SRE) - Banking and Payments Contract, 12 months + Based in Bromley (Hybrid working - 3 days office) Global financial services client is seeking an SRE to lead the designs and reliability engineering across banking and payments, establishing SRE standards, automation, and learning practices to improve resilience, reduce incidents, and scale engineering led operations. The successful candidate will be skilled in resilient engineering, risk control, and scaling operations across complex banking and payments environments. Demonstrate flexibility, navigate ambiguity, and quickly establish credibility among technical peers Excellent written and verbal communication skills. Responsibilities include: SRE strategy ownership Banking/payments resilience Reliability engineering transformation SLO/SLI/error budget adoption Incident reduction and operational scaling Senior stakeholder influence If this is of interest and you have the required skills, please submit your CV over for immediate consideration. McGregor Boyall is an equal opportunity employer and do not discriminate on any grounds.

Jun 07, 2026

Contractor

Lead Site Reliability Engineer (SRE) - Banking and Payments Contract, 12 months + Based in Bromley (Hybrid working - 3 days office) Global financial services client is seeking an SRE to lead the designs and reliability engineering across banking and payments, establishing SRE standards, automation, and learning practices to improve resilience, reduce incidents, and scale engineering led operations. The successful candidate will be skilled in resilient engineering, risk control, and scaling operations across complex banking and payments environments. Demonstrate flexibility, navigate ambiguity, and quickly establish credibility among technical peers Excellent written and verbal communication skills. Responsibilities include: SRE strategy ownership Banking/payments resilience Reliability engineering transformation SLO/SLI/error budget adoption Incident reduction and operational scaling Senior stakeholder influence If this is of interest and you have the required skills, please submit your CV over for immediate consideration. McGregor Boyall is an equal opportunity employer and do not discriminate on any grounds.

Senior Site Reliability Engineer

DWP Digital Newcastle Upon Tyne, Tyne And Wear

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Jun 07, 2026

Full time

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Senior Site Reliability Engineer

DWP Digital Sheffield, Yorkshire

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Jun 07, 2026

Full time

Site Reliability Engineer Pay up to £80,664 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. DWP. Digital with Purpose. We have a fantastic opportunity to join our community of experts at DWP Digital as a Senior Site Reliability Engineer, within one of our SRE teams at the heart of Digital Transformation click apply for full job details

Azure Platform Engineer

Opus Recruitment Solutions

Platform Engineer 6 month contract 400- 550 InsideIR35 Hybrid 2/3 days onsite london Design pipelines, Build pipelines, Automate deployments Design, build, and maintain CI/CD pipelines for secure, automated, and reliable deployments Develop and manage infrastructure-as-code (Terraform, YAML) across cloud and hybrid environments Implement monitoring, alerting, and logging to ensure platform performance, availability, and reliability Collaborate with cross-functional teams to enhance platform capabilities and enable self-service engineering Embed DevOps, SRE, and DevSecOps best practices, driving automation, security, and continuous improvement Support incident management, root cause analysis, cost optimisation, and ensure scalable, resilient environments supporting modern architectures Platform Engineer 6 month contract 400- 550 InsideIR35 Hybrid 2/3 days onsite london This role does not offer sponsorship.

Jun 06, 2026

Contractor

Platform Engineer 6 month contract 400- 550 InsideIR35 Hybrid 2/3 days onsite london Design pipelines, Build pipelines, Automate deployments Design, build, and maintain CI/CD pipelines for secure, automated, and reliable deployments Develop and manage infrastructure-as-code (Terraform, YAML) across cloud and hybrid environments Implement monitoring, alerting, and logging to ensure platform performance, availability, and reliability Collaborate with cross-functional teams to enhance platform capabilities and enable self-service engineering Embed DevOps, SRE, and DevSecOps best practices, driving automation, security, and continuous improvement Support incident management, root cause analysis, cost optimisation, and ensure scalable, resilient environments supporting modern architectures Platform Engineer 6 month contract 400- 550 InsideIR35 Hybrid 2/3 days onsite london This role does not offer sponsorship.

Lead Site Reliability Engineer (SRE)

McGregor Boyall

Lead Site Reliability Engineer (SRE) - Transformation - SRE Adoption - Banking and Payments Contract, 12 months + Based in London (Hybrid working - 3 days office) Global financial services client is seeking an SRE to lead the designs and reliability engineering across banking and payments, establishing SRE standards, automation, and learning practices to improve resilience, reduce incidents, and scale engineering led operations. The successful candidate will be skilled in resilient engineering, risk control, and scaling operations across complex banking and payments environments. Demonstrate flexibility, navigate ambiguity, and quickly establish credibility among technical peers Excellent written and verbal communication skills. Responsibilities include: SRE strategy ownership Banking/payments resilience Reliability engineering transformation SLO/SLI/error budget adoption Incident reduction and operational scaling Senior stakeholder influence If this is of interest and you have the required skills, please submit your CV over for immediate consideration. McGregor Boyall is an equal opportunity employer and do not discriminate on any grounds.

Jun 05, 2026

Contractor

Lead Site Reliability Engineer (SRE) - Transformation - SRE Adoption - Banking and Payments Contract, 12 months + Based in London (Hybrid working - 3 days office) Global financial services client is seeking an SRE to lead the designs and reliability engineering across banking and payments, establishing SRE standards, automation, and learning practices to improve resilience, reduce incidents, and scale engineering led operations. The successful candidate will be skilled in resilient engineering, risk control, and scaling operations across complex banking and payments environments. Demonstrate flexibility, navigate ambiguity, and quickly establish credibility among technical peers Excellent written and verbal communication skills. Responsibilities include: SRE strategy ownership Banking/payments resilience Reliability engineering transformation SLO/SLI/error budget adoption Incident reduction and operational scaling Senior stakeholder influence If this is of interest and you have the required skills, please submit your CV over for immediate consideration. McGregor Boyall is an equal opportunity employer and do not discriminate on any grounds.

Site Reliability Engineer - BACLJP

Huxley Associates

My client within Investment Banking is currently seeking for an SRE Lead. I'm working on an SRE Lead role within a banking/payments environment that I thought might be of interest. You'd lead SRE strategy, driving automation, observability, and reliability by design, with a focus on reducing incidents and improving recovery. Looking for someone with 8+ years' experience in SRE, strong resilience engineering background, and the ability to scale operations in complex environments. Logistics: 600 p/d inside IR35 3 Days a week Bromley 12 month contract Please click here to find out more about our Key Information Documents. Please note that the documents provided contain generic information. If we are successful in finding you an assignment, you will receive a Key Information Document which will be specific to the vendor set-up you have chosen and your placement. To find out more about Huxley, please visit (url removed) Huxley, a trading division of SThree Partnership LLP is acting as an Employment Business in relation to this vacancy Registered office 8 Bishopsgate, London, EC2N 4BQ, United Kingdom Partnership Number OC(phone number removed) England and Wales

Jun 05, 2026

Contractor

My client within Investment Banking is currently seeking for an SRE Lead. I'm working on an SRE Lead role within a banking/payments environment that I thought might be of interest. You'd lead SRE strategy, driving automation, observability, and reliability by design, with a focus on reducing incidents and improving recovery. Looking for someone with 8+ years' experience in SRE, strong resilience engineering background, and the ability to scale operations in complex environments. Logistics: 600 p/d inside IR35 3 Days a week Bromley 12 month contract Please click here to find out more about our Key Information Documents. Please note that the documents provided contain generic information. If we are successful in finding you an assignment, you will receive a Key Information Document which will be specific to the vendor set-up you have chosen and your placement. To find out more about Huxley, please visit (url removed) Huxley, a trading division of SThree Partnership LLP is acting as an Employment Business in relation to this vacancy Registered office 8 Bishopsgate, London, EC2N 4BQ, United Kingdom Partnership Number OC(phone number removed) England and Wales

SRE Technical Lead

Adecco Reading, Berkshire

SRE Technical Lead Reading/Hybrid (UK-based - mix of home, office, and client site) Must be eligible for SC Clearance We are seeking an experienced SRE Technical Lead to act as the technical authority for Site Reliability Engineering across complex, large-scale platforms. This is a senior, client-facing leadership role where you will be responsible for driving reliability, availability, and operational excellence across multi-team and multi-vendor environments. You will combine hands-on engineering expertise with strategic leadership, ensuring SRE practices are Embedded across the full service life cycle-from design through to production operations. As the SRE Technical Lead, you will: Define and implement SRE strategy, standards, and best practices, including SLAs, SLOs, and error budgets Embed reliability principles into platform and service design from the outset Lead key SRE practices such as reliability reviews, operational readiness, and toil reduction Drive automation across monitoring, incident response, and remediation Act as the technical escalation point for major incidents and high-risk releases Lead blameless post-incident reviews and ensure continuous improvement Establish observability and capacity management practices using modern tooling Identify and eliminate systemic reliability risks and operational inefficiencies Collaborate with engineering, platform, security, and operations teams across multiple vendors Provide coaching and mentorship to engineers, raising SRE capability across the organisation Essential experience: Deep expertise in Kubernetes and/or OpenShift Experience working in multi-cloud or hybrid cloud environments Strong understanding of SRE principles (SLOs, SLAs, error budgets, reliability engineering) Hands-on experience with observability tooling (eg, Prometheus, Grafana, OpenTelemetry, Loki, Tempo) Strong knowledge of Infrastructure as Code and GitOps (eg, Helm, Kustomize, ArgoCD, Tekton) Experience with CI/CD pipelines and automation Proven ability to operate as a technical leader in complex, multi-team environments

Jun 05, 2026

Full time

SRE Technical Lead Reading/Hybrid (UK-based - mix of home, office, and client site) Must be eligible for SC Clearance We are seeking an experienced SRE Technical Lead to act as the technical authority for Site Reliability Engineering across complex, large-scale platforms. This is a senior, client-facing leadership role where you will be responsible for driving reliability, availability, and operational excellence across multi-team and multi-vendor environments. You will combine hands-on engineering expertise with strategic leadership, ensuring SRE practices are Embedded across the full service life cycle-from design through to production operations. As the SRE Technical Lead, you will: Define and implement SRE strategy, standards, and best practices, including SLAs, SLOs, and error budgets Embed reliability principles into platform and service design from the outset Lead key SRE practices such as reliability reviews, operational readiness, and toil reduction Drive automation across monitoring, incident response, and remediation Act as the technical escalation point for major incidents and high-risk releases Lead blameless post-incident reviews and ensure continuous improvement Establish observability and capacity management practices using modern tooling Identify and eliminate systemic reliability risks and operational inefficiencies Collaborate with engineering, platform, security, and operations teams across multiple vendors Provide coaching and mentorship to engineers, raising SRE capability across the organisation Essential experience: Deep expertise in Kubernetes and/or OpenShift Experience working in multi-cloud or hybrid cloud environments Strong understanding of SRE principles (SLOs, SLAs, error budgets, reliability engineering) Hands-on experience with observability tooling (eg, Prometheus, Grafana, OpenTelemetry, Loki, Tempo) Strong knowledge of Infrastructure as Code and GitOps (eg, Helm, Kustomize, ArgoCD, Tekton) Experience with CI/CD pipelines and automation Proven ability to operate as a technical leader in complex, multi-team environments

Site Reliability Engineer

Harvey Nash Glasgow, Lanarkshire

Here at Harvey Nash my client based in Glasgow is looking to recruit a Site Reliability Engineer to join their team on a full-time basis working on a hybrid working model ( 2 days per week onsite ). You'll join a public cloud team, supporting application migrations and working across the full development lifecycle, from solution design through to production support and continuous improvement. You'll play a key role in shaping and advancing the organisation's SRE capability. The role involves leading technical design discussions, engaging with senior stakeholders, and applying software engineering, automation, and incident response best practices to ensure the reliability, availability, and scalability of critical systems and platforms. Responsibilities include: Ensure system availability, performance, and scalability through proactive monitoring and capacity planning. Investigate and resolve outages, implementing measures to prevent recurrence. Build tools and scripts to automate processes, improving efficiency and resilience. Monitor performance, identify bottlenecks, and apply optimisation best practices. Collaborate with development teams to embed reliability and scalability across the SDLC. I'm looking to speak with self-starting candidates with: Strong cloud environment experience (AWS, Azure or GCP) Solid experience in observability and monitoring tools . You should be proficient in a development language such as Python , with experience using it to automate tasks, build tools, and support infrastructure management. Terraform or Cloud Foundation . Experience with CI/CD Pipelines You should also have a strong understanding of the SDLC or a software engineering background If you would be interested in learning more then please apply with your CV for review Always use these settings

Jun 05, 2026

Full time

Here at Harvey Nash my client based in Glasgow is looking to recruit a Site Reliability Engineer to join their team on a full-time basis working on a hybrid working model ( 2 days per week onsite ). You'll join a public cloud team, supporting application migrations and working across the full development lifecycle, from solution design through to production support and continuous improvement. You'll play a key role in shaping and advancing the organisation's SRE capability. The role involves leading technical design discussions, engaging with senior stakeholders, and applying software engineering, automation, and incident response best practices to ensure the reliability, availability, and scalability of critical systems and platforms. Responsibilities include: Ensure system availability, performance, and scalability through proactive monitoring and capacity planning. Investigate and resolve outages, implementing measures to prevent recurrence. Build tools and scripts to automate processes, improving efficiency and resilience. Monitor performance, identify bottlenecks, and apply optimisation best practices. Collaborate with development teams to embed reliability and scalability across the SDLC. I'm looking to speak with self-starting candidates with: Strong cloud environment experience (AWS, Azure or GCP) Solid experience in observability and monitoring tools . You should be proficient in a development language such as Python , with experience using it to automate tasks, build tools, and support infrastructure management. Terraform or Cloud Foundation . Experience with CI/CD Pipelines You should also have a strong understanding of the SDLC or a software engineering background If you would be interested in learning more then please apply with your CV for review Always use these settings

35 jobs found

Modal Window