Site Reliability Engineer (UK Remote)

Posted 10 April 2025
Salary Negotiable
LocationUnited Kingdom
Job type Contract
DisciplineGames Programming
Reference19890
Contact NameMargaret Smith

Job description

We’re looking for an innovative and enthusiastic Site Reliability Engineer to a well-known, UK games studio who are highly respected throughout the industry.

As a Site Reliability Engineer your main purpose is solving for scale through collaboration and automation, bringing engineering principles to infrastructure and operational problems.

You will work closely with the different teams to help improve manual tasks, operational processes, lower complexities & risks, break down team silos through improved communication and really get involved with them to reinvent how they work to help them succeed.

You will work with a scaling platform, maintaining its programmable infrastructure and maximising the availability of the workloads that run on it, both at a live production & deliverable lifecycle level.

With constant improvement and automation as core principles, a lot of this role is thinking about inefficient and time-consuming things that are happening and putting a stop to them as soon as possible.

Your responsibilities…

  • Minimising downtime to products and services.

  • Ensuring the platform is stable, scalable and completely automated.

  • Helping to improve and shorten development/process lifecycles.

  • Applying effective monitoring and alerting in place.

  • Supporting release through stable and automated pipeline processes.

The skills and experience you will bring to the role…

  • Knowledge of languages such as PowerShell, C#.

  • Managed/implemented large scale distributed server systems within Azure.

  • Worked on modern release pipelines – CI/CD (Octopus Deploy/Azure DevOps/TeamCity).

  • Knowledge of Azure monitoring, alerting, message queues.

  • Understand or worked within an Incident Management Process (ITSM).