Factset Logo

Factset

Lead Site Reliability Engineer

Job Posted 17 Days Ago Posted 17 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United Kingdom
Senior level
Remote
Hiring Remotely in United Kingdom
Senior level
Lead the design and maintenance of reliable systems, automate processes, troubleshoot issues, and collaborate with various teams to enhance system performance and scalability.
The summary above was generated by AI

We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our software systems and infrastructure. The ideal candidate possesses a strong background in coding, automation, and system administration, combined with a passion for continuously improving system reliability.

 

Responsibilities:

  • Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices.
  • Design, implement, and maintain highly available and scalable architectures for our applications and infrastructure.
  • Develop and enhance automated tools and frameworks to optimize system monitoring, deployment, and recovery.
  • Troubleshoot and resolve complex issues throughout the entire software stack, including networking, databases, and distributed systems.
  • Conduct performance analysis and capacity planning to ensure system scalability and resource optimization.
  • Take a proactive approach to continuously improving reliability.
  • Participate in incident response, root cause analysis, and postmortem activities to identify and rectify system failures.
  • Collaborate with cross-functional teams to implement and improve CI/CD pipelines, ensuring reliable and efficient software releases.
  • Stay up-to-date with emerging technologies and industry trends, actively contributing to ongoing system improvements.
  • Participate in on-call rotation.

 

Requirements:

  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
  • Proven experience deploying and managing large-scale distributed systems successfully.
  • Understanding of SRE concepts (error budgets, SLIs/SLOs, blameless postmortems)
  • Proficiency in programming languages such as Python, C++, or Go
  • Familiarity with monitoring and observability tools.
  • Excellent problem-solving skills and ability to troubleshoot complex issues efficiently.
  • Strong organizational and communication skills, with the ability to collaborate effectively in a cross-functional team environment.

 

Desirable Qualifications:

  • Familiarity with security best practices and experience implementing security measures in a production environment.
  • Experience with modern infrastructure technologies and tools, including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), and orchestration (Ansible, Chef, Puppet).
  • Solid understanding of networking protocols and technologies (TCP/IP, DNS, load balancing).
  • Demonstrated experience with infrastructure as code (IaC) and automation tools (e.g., Terraform, GitHub Actions).

 

Join our team and contribute to creating and maintaining a highly reliable and performant infrastructure that supports our growing platform. Help shape the future of our systems architecture while working in a collaborative and innovative environment.

Top Skills

Ansible
AWS
Azure
C++
Chef
Docker
GCP
Github Actions
Go
Kubernetes
Puppet
Python
Terraform

Similar Jobs

An Hour Ago
Remote
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Information Technology • Software • Analytics • Business Intelligence • Consulting
As a Software Engineer, you'll design and develop Cloud Contact Center solutions, conduct unit testing, and mentor junior staff, while collaborating with teams in a remote setting.
Top Skills: AcdCcaasCloud Contact CenterCrm IntegrationIvrNice Cxone
An Hour Ago
Remote
Hybrid
London, Greater London, England, GBR
Senior level
Senior level
Information Technology • Software • Analytics • Business Intelligence • Consulting
The CCaaS Application Architect designs complex Contact Center applications, managing client projects, leading teams, and troubleshooting technical issues while leveraging extensive knowledge of CXone and cloud migration.
Top Skills: Amazon ConnectAws CloudpipelineAzure DevopsC++CicdCxoneDevOpsGenesys CloudGitJavaJavaScriptJenkinsPython
7 Hours Ago
Remote
Hybrid
GB
Senior level
Senior level
Productivity • Sales • Software
Lead the Reporting team at monday.com, shaping engineering standards and fostering a culture of innovation while delivering impactful solutions.
Top Skills: AWSK8SMySQLNode.jsReactRedisRuby On Rails

What you need to know about the Manchester Tech Scene

Home to a £5 billion digital ecosystem, including MediaCity, which consists of major players like the BBC, ITV and Ericsson, Manchester is one of the U.K.'s top digital tech hubs, at the forefront of advancements in film, television and emerging sectors like as e-sports, while also fostering a community of professionals dedicated to pushing creative and technological boundaries.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account