Similar Jobs
As industries race to embrace AI, traditional database solutions fall short of rising demands for versatility, performance, and affordability. Couchbase is leading the way with Capella, the developer data platform for critical applications in our AI world. By uniting transactional, analytical, mobile, and AI workloads into a seamless, fully managed solution, Couchbase empowers developers and enterprises to build and scale applications with unmatched flexibility, performance, and cost-efficiency—from cloud to edge. Trusted by over 30% of the Fortune 100, Couchbase is unlocking innovation, accelerating AI transformation, and redefining customer experiences. Come join our mission.
Role Overview
At Couchbase, Site Reliability Engineers are hybrid software and systems engineers. They are the glue holding things together, whether that’s infrastructure / platform, tooling support for our cloud business or managing Observability posture for Couchbase. In this role the candidate we are looking for is for the Observability team which is responsible for maintaining Reliability, Availability and Serviceability for the entire Couchbase cloud offerings. You will be working as a Software Engineer developing and maintaining Couchbase monitoring stack which includes metrics pipeline, alerting, notifications and the likes. You will be working with many software engineers and teams to ensure our cloud offerings meet the monitoring needs. You will have an immediate impact on the day-to-day efficiency of cloud operations and an ongoing impact on growth.
Responsibilities
- Develop/maintain software features in the Observability stack which includes metrics pipeline, alerting, logging and notifications
- Create/maintain monitoring dashboards which gives insights to our customer cluster health
- Develop control plane features requiring observability needs
- High level of ownership and responsibility as we work on production environments to debug a problem
- Working knowledge of AWS services like EC2/EKS/SQS etc.
- Collaborate with the Engineering teams to gather requirements for monitoring and implementation
- Be creative and bring in efficient processes and solutions to enhance team productivity
- Demonstrate exceptional problem-solving skills, with an ability to identify and solve issues before they affect business productivity
- Roll up your sleeves to be a full stack engineer as we build end-end software solutions in the Observability domain
Requirements
- 2+ years experience as a software developer
- Proficiency with programming and scripting languages like Go, Python, Java, or Ruby
- Strong ability to write code, understands basic DSA concepts
- Proficiency with Linux operating systems
- Exposure to any one of the CSPs like AWS/Azure/GCP
- Working experience in Grafana, Prometheus, Thanos and/or DataDog
- Strong debugging skills to mitigate a production issue
Preferred skills and qualifications
- Experience with on-call rotations & incident management
- Experience in developing and managing Kubernetes clusters both self-managed (vanilla/plain k8s) & managed (preferably AWS EKS) will be a plus
- Proficiency with Databases such as Couchbase is a plus
- Exposure to front end development like ReactJS will be a plus
- Generous Time Off Program - Flexibility to care for you and your family
- Wellness Benefits - A variety of world class medical plans to choose from, along with dental, vision, life insurance, and employee assistance programs*
- Financial Planning - RSU equity program*, ESPP program*, Retirement program* and Business Travel Insurance
- Career Growth - Be valued, Create value approach
- Fun Perks - An ergonomic and comfortable in-office / WFH setup. Food & Snacks for in-office employees.
- And much more!
News and Press Releases
Couchbase Capella
Couchbase Blog
Investors
Couchbase Manchester, England Office
1a Tariff Street, Manchester, United Kingdom, M1 2FF