Valstro Logo

Valstro

Site Reliability Engineer (SRE)

Posted 9 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in London, Greater London, England
Mid level
In-Office or Remote
Hiring Remotely in London, Greater London, England
Mid level
The Site Reliability Engineer will ensure system reliability and performance of a cloud-based trading platform, automate tasks, and improve integration and operations.
The summary above was generated by AI

Valstro is looking for a Site Reliability Engineer (SRE), to join our team! This person will help ensure the reliability, availability, and performance of our cloud native trading platform. The role entails building and maintaining infrastructure, automating process and working closely with the Development and Platform teams to ensure seamless integration and deployment of the service.

The successful candidate will serve as an essential link between the wider organization, executive leadership, and external vendors. Their responsibilities will include ensuring system reliability, building and maintaining monitoring solutions for both production and UAT systems, automating operational tasks, responding to incidents, and continuously improving systems and processes.

This is a remote position that will report to the Site Reliability Lead.

What will you be doing?

· Act as a key intermediary between engineering, executive leadership, and external vendors.

· Ensure the reliability, availability, and performance of our cloud-based trading solutions.

· Develop and maintain monitoring solutions to track system performance and reliability.

· Automate operational tasks to improve efficiency and reduce manual intervention.

· Collaborate with development teams to ensure seamless integration and deployment.

· Respond to incidents and troubleshoot issues to minimize downtime.

· Continuously improve systems and processes to enhance reliability and performance.

· Participate in on-call rotations to provide 24/7 support for critical systems.


Requirements

· 3+ years experience supporting Production level systems

· Strong experience in site reliability engineering, systems engineering, or a related field.

· Proficiency in cloud-based infrastructure (e.g. AWS, Azure, or Google Cloud.)

· Experience with monitoring and logging tools (e.g., ELK, LGTM, Prometheus, Datadog).

· Expertise in automation and scripting (e.g., Golang, Python, Bash, Terraform).

· Knowledge of containerization and orchestration (e.g., Docker, Kubernetes).

· Ability to effectively communicate and liaise between stakeholders, including internal teams, executive management and external vendors.

· Strong troubleshooting and problem-solving skills.

· Experience in establishing and enhancing reliability engineering practices and processes.

· Capable of operating effectively in a dynamic organizational environment with high delivery and quality expectations.

Fintech = bonus

Technical

· A recent bachelor's degree in Computer Science, Software Engineering or related field

· Knowledge of SREing

· Knowledge of observability and tooling particularly the Grafana stack


Benefits

Valstro offers an excellent benefits package, including pension or 401 (k) plans, unlimited PTO and highly competitive compensation. Our leadership team brings a wealth of experience and deep industry knowledge, and despite being a young company, we believe we have carefully dialed in our product-market fit.

Similar Jobs

20 Days Ago
Easy Apply
Remote
United Kingdom
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As a Cloud Cost Utilization SRE at GitLab, you'll manage cloud spending, improve tracking and optimization of cloud usage, and collaborate with finance and engineering teams to enhance cost efficiency across AWS and GCP.
Top Skills: AnsibleAWSElkGCPGrafanaLokiMimirPrometheusTempoTerraform
Yesterday
Remote
UK
Senior level
Senior level
Fintech • Software
The Senior Site Reliability Engineer will design and maintain a reliable data platform, develop data pipelines, automate processes, and ensure system observability while collaborating across teams.
Top Skills: AWSAzureBashDockerGCPKubernetesPower BIPythonSQLTerraform
5 Days Ago
Remote
United Kingdom
Senior level
Senior level
Mobile • Other • Software • Analytics
As a Senior or Staff Software Engineer focused on SRE, you will optimize performance, manage infrastructure with code, drive observability, and collaborate with teams to enhance system reliability and efficiency.
Top Skills: AWSAzureClickhouseEtcdGCPGoKafkaKubernetesPostgresRedisRustScylladbTerraform

What you need to know about the Manchester Tech Scene

Home to a £5 billion digital ecosystem, including MediaCity, which consists of major players like the BBC, ITV and Ericsson, Manchester is one of the U.K.'s top digital tech hubs, at the forefront of advancements in film, television and emerging sectors like as e-sports, while also fostering a community of professionals dedicated to pushing creative and technological boundaries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account