Jobs

Site Reliability Engineer


Job details
  • bet365
  • Manchester
  • 1 week ago

Who we are looking for

A Site Reliability Engineer who will develop software solutions, consult with development teams and work with modern telemetry data to maintain and improve the performance of key systems.


The site reliability team provide an increasingly important service to our technology department.


Focusing on application performance, reliability, availability, capacity and health, you will work with other teams across the platform department to help ensure our critical systems are reliable and observable. You will be working to provide solutions to help minimise toil and provide operational efficiency at scale on our critical systems for those that operate them.


You will work with a wide range of technologies developing solutions, consulting with development teams and working with contemporary observability and incident management tools to assist the Business. You will be required to make effective decisions to improve the health and maintain the availability and performance of some of our most critical systems.


This role is eligible for inclusion in the Company’s hybrid working from home policy.


Preferred skills and experience

  • Excellent knowledge of SRE principles, including the creation and management of effective SLI’s and SLO’s for reliability and customer satisfaction.
  • Knowledge of contemporary observability tools, techniques and best practice including Splunk, New Relic, Grafana and Pager Duty.
  • Excellent knowledge of programming languages including Python, Golang and JavaScript.
  • Knowledge and experience of modern software development techniques and lifecycles.
  • Experience with automation and orchestration platforms such as Ansible and Jenkins.
  • Prior experience working in a large scale, 24/7 enterprise where system uptime and stability is of paramount importance to the business.
  • Keen interest of industry trends, particularly DevOps.


Main Responsibilities

  • Developing bespoke in house tooling using a range of technologies to provide effective operational support capabilities for our colleagues in IT Operations.
  • Working with automation and orchestration platforms to automate manual activity and reduce toil.
  • Building sophisticated dashboards using a range of telemetry data and dash boarding technologies like Grafana, Splunk and New Relic.
  • Maintaining and administering existing monitoring and analytic toolsets.
  • Mentoring colleagues in use of new technologies or practices.
  • Contributing to the evolution of team processes and approaches.
  • Collaborating with colleagues in the wider platform teams to determine requirements and solutions, to solve problems and progress work.
  • Working with IT Operations to provide and support the use of critical tooling that will enable increasing levels of value to the Business.


By applying to us you are agreeing to share your Personal Data in accordance with our Recruitment Privacy Policy - http://www.bet365careers.com/privacypolicy.pdf.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

Site Reliability Engineer (C#/Azure) - London

The RoleWe’re looking for skilled SRE engineers who will focus on improving the reliability and scalability of Joule Direct, a widely used energy trading platform. You will work across our cross-functional teams of software engineers, quality engineers and devops engineers to help drive systemic improvements to the reliability of our...

Trayport Limited London

Site Reliability Engineer II

Play a key role in ensuring system reliability at one of the world’s most iconic and largest financial institutions.As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Sector, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence....

JPMorgan Chase & Co. Glasgow

Site Reliability Engineer, Observability

Who We AreCisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don't own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect,...

Cisco ThousandEyes London

AWS Databases - Senior Lead SRE

Job SummaryThe AWS Database team is looking for experienced Site Reliability Engineer to join team, which is support all AWS Databases currently available in JP Morgan. We are currently supporting a lot of AWS native DBs (RDS/Aurora/Neptune) as well as CockroachDB. As a member of SRE team you will be...

JPMorgan Chase & Co. London

Software Engineer III - AWS and Kubernetes

We have an exciting and rewarding opportunity for you to take your software engineering career to the next level. As a Software Engineer III at JPMorgan Chase within the Fusion Platform Engineering team, Corporate and Investment Bank, you serve as a seasoned member of an agile team to design and...

JPMorgan Chase & Co. Glasgow

DevOps/SRE Consultant

Cloud Consultant SRESalary:£50,000 - £55,000 - Pension + Private HealthcareLocation:UK Wide Location - Hybrid working* To be successfully appointed to this role, you must be eligible forSecurity Check (SC) clearance.The Client:83zero is proud to be partnered with a global leader in digital services, driving innovation in customer experience through CRM,...

83Zero Manchester