About the department
Cloudflare's Resiliency Engineering Team builds and runs the systems and software that support our solutions that handle trillions of requests per month. Resiliency Engineering ensures all of the new and existing features and functionality that Cloudflare offers can be managed at scale and meet the needs of our massively growing customer base. The Infrastructure Tooling Team within the Resiliency Engineering organization is responsible for defining, building and supporting the tools that can be leveraged by the rest of the Infrastructure Engineering team to manage our infrastructure at scale.
What you'll do
At most companies, you build applications "in" the Cloud - here at Cloudflare we're building a Cloud; a uniquely performant, globally distributed and highly available Cloud. In this role, you will work with several teams of passionate and talented engineers that are building the internal Control Plane used by our SREs, and Infrastructure Operations teams to manage our internal DCaaS and IaaS platforms. You will be responsible for tools that support the management of a growing, globally distributed fleet of servers, storage, and network gear spread across over a thousand colos worldwide. You will play an active part in shaping the future of the infrastructure that propels Cloudflare's scale and growth.Along the way you will have the opportunity to write code to bring this design to fruition as well as to mentor high-potential engineers on their distributed system journey.
You will be working alongside engineers who have presented at DevOPs Days, Config Management Camp 2024 & 2025, Monitorama, OSMC, Kubecon and Promcon. Together you will deliver on the key Health Mediated Deployment projects that are being tracked through senior leadership of Cloudflare up to the founders.
Examples of desirable skills, knowledge and experience
- Minimum 10 years of experience working with distributed systems.
- Experience designing, building and managing high volume software applications.
- Expert in at least one modern strongly-typed programming language
- Experience debugging, measuring, optimizing and identifying failure modes in a large-scale distributed system.
- Excellent collaboration skills
- Proven ability to convey ideas effectively through verbal and written communication
- Ability to translate business needs into requirements, design documents and technical solutions
- Knowledge of API design standards, patterns and best practices
- Proven ability to use data to drive business outcomes
- Proven experience in developing architects and lead engineers
- Solid understanding of computer science fundamentals including data structures, algorithms, and object-oriented or functional design.
Bonus Points
- Experience with optimizing and scaling infrastructure provisioning, repair, and decommissioning processes and automations.
- Experience with scaling and simplifying Configuration Management systems managing hundreds of thousands of nodes
Compensation
Compensation may be adjusted depending on work location.
- For New York City, Washington, Washington D.C. and California (excluding Bay Area) based hires: Estimated annual salary of $230,000 - $281,000
Equity
This role is eligible to participate in Cloudflare's equity plan.
Benefits
Cloudflare offers a complete package of benefits and programs to support you and your family. Our benefits programs can help you pay health care expenses, support caregiving, build capital for the future and make life a little easier and fun! The below is a description of our benefits for employees in the United States, and benefits may vary for employees based outside the U.S.
Health & Welfare Benefits
- Medical/Rx Insurance
- Dental Insurance
- Vision Insurance
- Flexible Spending Accounts
- Commuter Spending Accounts
- Fertility & Family Forming Benefits
- On-demand mental health support and Employee Assistance Program
- Global Travel Medical Insurance
Financial Benefits
- Short and Long Term Disability Insurance
- Life & Accident Insurance
- 401(k) Retirement Savings Plan
- Employee Stock Participation Plan
Time Off
- Flexible paid time off covering vacation and sick leave
- Leave programs, including parental, pregnancy health, medical, and bereavement leave