posted Jun 21

Site Reliability Engineering (SRE) Manager

Cloud Distributed Systems Google Cloud Platform Kubernetes mid

Job Location: San Francisco, California

Salary: $160,000 - $225,000 a year

Job Description

• Censys is looking for an SRE Manager to join our organization to lead our Infrastructure & Ops team. Our engineers need a manager who will push them to self-organize, to grow technically, to grow professionally, and to be there to help them remove roadblocks. Managers at Censys understand that only by trying to distribute responsibility and obsolete ourselves do we actually empower our teams and gain enough time so we can effectively identify new opportunities to improve the organization. • Our Infrastructure & Ops team is responsible for managing basic developer experience, strategizing with development teams on projects, operating and maintaining our cloud and applications environment, and operating our on-premise co-located data centers for our scanning and data teams. • The role of the SRE Manager is to participate in the daily execution of the team, from daily standup to planning, refining, and prioritizing of engineering requirements and deliverables. They seek to empower success by ruthlessly prioritizing and stakeholding with the other engineering teams and business partners to achieve the goals of the greater Censys product roadmap. They will build trust with stakeholders and partners through diplomacy, discussion, and follow-through. This is a broad cross-organization role with high visibility, collaborating with multiple teams. They are expected to invest in and build good relations with key partners. Their collaboration with internal customers, product engineering, and development groups is critical to success.

Qualifications

• 5+ years of experience supporting large-scale, distributed systems, combining hardware and cloud experience. • 3+ years of experience building and leading engineering teams; ideally SRE, Infrastructure, or Production engineering teams. • Superb interpersonal skills, capable of working with multi-functional technical and business teams and varying levels of management, influencing decision-making. • Understanding of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements. • Experience supporting Software Engineering teams through a Platform methodology, embracing a Dev-Sec-Ops culture to allow development teams to own their services end-to-end from development to production. • Emphasis on SRE as an engineering subject area, preferably with proficiency in software engineering and development practices. • Experience running or facilitating a production outage escalation and incident response program, and coordinating vulnerability and patch management with IT and Corporate Security teams. • Optional experience operating hybrid on-premise and cloud environments, with experience managing data center operations and bare metal environments, including high availability data center design, hardware capacity planning, L2-L3 network design and security, ISP peerings, and disaster recovery. • Optional experience with Kubernetes-based application environments, including maintaining and scaling clusters that contain several hundreds of nodes, and facilitating complex distributed systems and software through mechanisms such as auto-scaling. • Optional experience managing a GCP-based cloud architecture, including efforts around reducing unnecessary cloud spend on under-utilized or poorly optimized infrastructure.

Benefits

• Our target salary range for this role is between $160,000 USD and $225,000 USD + bonus eligibility and equity. • Our roots are in Ann Arbor, Michigan with location hubs in Seattle, the Bay Area, Washington D.C., and Dublin, Ireland. Our innovation is fueled by the team’s global perspectives and diverse backgrounds.

Related Jobs

logo
Company
Henry Schein One
Post Date
New
Title
.NET Staff Software Engineer
Type
$120,000 - $160,000 a year
Location
Remote
logo
Company
KUBRA
Post Date
New
Title
Senior Security Architect
Location
Unknown, California
logo
Company
Okta
Post Date
New
Title
Staff Site Reliability Engineer (Customer Identity Cloud)
Type
$160,000 - $240,000 a year
Location
Remote
logo
Company
Kiddom
Post Date
New
Title
Senior Software Engineer, Infrastructure
Location
Remote
logo
Company
OwnBackup
Post Date
New
Title
Team Lead, Production Engineer
Type
$160,000 - $210,000 a year
Location
Unknown, California