National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Site Reliability Engineer

bet365
Stoke-on-Trent
3 months ago
Applications closed

Related Jobs

View all jobs

Reliability Engineer

Site Service Engineer

Lead Test and Verification Engineer

Component Design Engineer - Mechnical Control Units

Systems Engineer (Landing Gear Integration)

PCB Design Engineer

bet365 Stoke-On-Trent, England, United Kingdom

Site Reliability Engineer

bet365 Stoke-On-Trent, England, United Kingdom

Who we are looking for

A Site Reliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices.

You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting operational efficiency.

Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management.

Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands and enhance overall service performance.

This role is eligible for inclusion in the Company’s hybrid working from home policy.

Preferred skills and experience

  • Excellent knowledge of Site Reliability Engineering principles, including the creation and management of effective Service Level Indicators (SLI) and Service Level Objectives (SLO) for reliability and customer satisfaction.
  • Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty.
  • Knowledge and experience of modern software development techniques and lifecycles.
  • Experience with Infrastructure as Code (IaC) automation and orchestration tools such as Ansible and Terraform.
  • Prior experience working in a large scale, 24/7 enterprise where system uptime and stability is of paramount importance to the Business.
  • Keen interest of industry trends, particularly Platform Engineering.
  • Proficiency in shell scripting for automation and system management tasks.

Main Responsibilities

  • Writing and contributing to code that enhances the reliability and observability of services, including telemetry, operational APIs and tooling.
  • Developing and maintaining tools that facilitate effective management of our systems, ensuring they are operationally efficient and resilient.
  • Working with automation and orchestration platforms to automate manual activity and reduce toil.
  • Building sophisticated dashboards using a range of telemetry data and dashboarding technologies like Grafana, Splunk and New Relic.
  • Maintaining and administering existing monitoring and analytic toolsets.
  • Mentoring colleagues in use of new technologies or practices.
  • Actively participating in live incident resolution and post-mortem analysis, providing effective remediation strategies to improve overall system health and prevent future issues.
  • Driving initiatives to enhance system reliability and observability, contributing to a culture of continuous improvement.
  • Collaborating with the central Site Reliability Engineering and Observability teams to establish and uphold standards for reliability and observability, assisting teams in adhering to these practices.
  • Working with IT Operations, providing and supporting the use of critical tooling to enable increasing levels of value to the Business.

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Information Technology

Industries

Gambling Facilities and Casinos

#J-18808-Ljbffr

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

The Ultimate Assessment-Centre Survival Guide for Space Sector Jobs in the UK

Assessment centres for space sector positions in the UK replicate the interdisciplinary, high-stakes environment of spacecraft design, mission operations and R&D. Through psychometric assessments, orbital mechanics problems, systems engineering tasks, mission-design workshops, case studies and interviews, recruiters test your technical prowess, analytical rigour and teamwork. Whether you specialise in satellite engineering, propulsion, space robotics or mission control, this guide prepares you to excel at every stage and secure your next role in the space industry.

Top 10 Mistakes Candidates Make When Applying for Space-Industry Jobs—And How to Avoid Them

UK space hiring is accelerating—but so are application mistakes. Learn the ten biggest errors candidates make, with practical fixes, expert tips and live resources that will help you launch your next space-industry role. Introduction From nanosatellite start-ups in Glasgow to deep-space mission teams in Harwell and propulsion testbeds in Cornwall, the UK space sector is expanding at orbital velocity. A quick scan of LinkedIn and niche boards like SpaceCareers.uk shows hundreds of open roles spanning RF systems, orbital dynamics, on-board software and space sustainability. Yet recruiters still reject a majority of CVs long before interview—usually for slip-ups that take minutes to fix. To keep your application from burning up on re-entry, we analysed recent space-industry adverts, spoke with hiring managers across the UK “space cluster”, and gathered feedback from career advisers. Below is a definitive list of the ten most expensive mistakes we see—each linked to a trusted, working resource so you can dive deeper. Bookmark this checklist before you hit Apply.