bet365 Logo

bet365

Observability Engineer

Posted 5 Hours Ago
Be an Early Applicant
Hybrid
Manchester, Greater Manchester, England, GBR
Senior level
Hybrid
Manchester, Greater Manchester, England, GBR
Senior level
Design, build and evolve a company-wide observability platform: instrument applications (OpenTelemetry), integrate APM/monitoring tools, create dashboards using logs/metrics/traces, manage incident response (PagerDuty), automate onboarding and workflows with IaC/automation, maintain monitoring tooling, set observability standards, mentor colleagues, and collaborate across teams to reduce MTTD and MTTR.
The summary above was generated by AI
Company Description

We’re one of the world’s leading online gambling companies, revolutionising the industry since 2000. Founded by Denise Coates CBE, we now employ over 10,000 people and serve over 120 million customers in 26 languages.

We empower our employees to push boundaries and explore new ideas, cultivating a culture that celebrates and rewards creativity. This offers employees a wealth of growth opportunities, giving them the opportunity to make a real impact in the world of online gambling. As a forward-thinking company, we’re breaking new ground in software innovation too, redefining what’s possible for our global worldwide.

Our focus on In-Play betting has solidified our market-leading position, featuring more than 1.38 million In-Play sporting events a year. With over 750 concurrent sporting fixtures at peak and more live sports streamed than anyone else in Europe (750,000), we handle over 6 million HTTP requests daily and process more than 1.5 million bets per hour at peak.

Job Description

As an Observability Engineer, you will build and evolve our modern observability platform, ensuring our systems stay healthy and performant for millions of users. 

We’re moving from simple monitoring to an observability-first mindset. As an Observability Engineer, you’ll be at the heart of this shift. You’ll design solutions that give us deep insights into system health, helping us reduce MTTD and MTTR. You’ll work with a comprehensive toolkit to provide analytics, alerting, and remediation strategies for our cloud and on-premise applications.  
 
This role is about more than just keeping the lights on; it’s about building a platform that lets us truly understand our systems. You’ll set the standards for observability, ensuring it’s baked into every new system we build.  

This role is eligible for inclusion in the Company’s hybrid working from home policy. 

Qualifications

  • Excellent knowledge of contemporary monitoring, analytics tooling and best practice. 
  • Strong experience integrating systems and applications with monitoring and APM tools. 
  • Demonstrable experience instrumenting applications for observability, ideally with OpenTelemetry. 
  • Experience with IaC, automation and orchestration tools such as Ansible and Terraform. 
  • Basic programming experience, ideally with Python, Golang or Javascript. 
  • Basic scripting ability with Powershell and Bash. 
  • Strong experience working in a large scale, 24/7 enterprise where system uptime is paramount. 
  • Experience with public and private Cloud. 
  • Proficiency with Linux operating system. 
  • Ability to work with autonomy and collaborate well within a wider team. 

Additional Information

  • Building sophisticated monitoring dashboards using log data, metrics, traces and profiles from sources like New Relic, Grafana, Splunk, Kibana and Pyroscope. 
  • Administrating an incident response platform, like PagerDuty, to enable fast and efficient resolution of incidents. 
  • Working with service owners on integrations while supporting the onboarding of telemetry data. 
  • Using automation and orchestration platforms to streamline manual processes and workflows. 
  • Promoting an observability-first mindset and encourage best practices across teams. 
  • Contributing to the development of standards for monitoring, logging and tracing. 
  • Evolving team processes and approaches. 
  • Mentoring colleagues in the use of new technologies or practices. 
  • Maintaining and administer existing monitoring and analytic tools. 
  • Collaborating across teams to solve complex challenges and prevent recurrence. 

By applying to us you are agreeing to share your Personal Data in accordance with our Recruitment Privacy Notice - https://www.bet365careers.com/privacy-policy

At bet365, we're committed to creating an environment where everyone feels welcome, respected and valued. Where all individuals can grow and develop, regardless of their background. We're Never Ordinary, and we're always striving to be better. If you need any adjustments or accommodations to the recruitment process, at either application or interview, please don’t hesitate to reach out.

bet365 Manchester, England Office

Spring Gardens, Manchester, United Kingdom, M2 1AB

Similar Jobs at bet365

3 Hours Ago
In-Office
Mid level
Mid level
Digital Media • Gaming • Software • Esports • Automation
Provide high-touch IT support to directors and senior leaders: diagnose and resolve hardware/software issues, manage AV and workplace tech, configure consumer and IoT devices, maintain asset inventory, escalate to 3rd line as needed, suggest technical improvements, and offer on-call and cross-team support.
Top Skills: AndroidAntivirus SoftwareAudio-Visual SystemsiOSIotmacOSMicrosoft Office 365Microsoft TeamsVideo ConferencingWindows 10Zoom
7 Hours Ago
Hybrid
Mid level
Mid level
Digital Media • Gaming • Software • Esports • Automation
Ensure high-quality delivery of Player Account Management software with focus on Responsible Gambling features. Define test strategies, create BDD feature files using Gherkin, analyse markdown and API schemas, estimate testing phases, execute QA, report and manage defects in Jira, and collaborate with product and development teams to resolve issues and improve testing approaches.
Top Skills: AIApi SchemasBddGherkinJIRAMarkdown
7 Hours Ago
Hybrid
Manchester, Greater Manchester, England, GBR
Mid level
Mid level
Digital Media • Gaming • Software • Esports • Automation
Test Player Account Management features focused on Responsible Gambling (risk, limits, self-exclusion). Define test strategies, create BDD/Gherkin feature files, execute QA against APIs and markdown specs, report and manage defects in Jira, and collaborate with product and development teams to ensure regulatory-compliant, high-quality releases.
Top Skills: AIApi SchemasBddGherkinJIRAMarkdown

What you need to know about the Manchester Tech Scene

Home to a £5 billion digital ecosystem, including MediaCity, which consists of major players like the BBC, ITV and Ericsson, Manchester is one of the U.K.'s top digital tech hubs, at the forefront of advancements in film, television and emerging sectors like as e-sports, while also fostering a community of professionals dedicated to pushing creative and technological boundaries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account