posted May 31

Senior Site Reliability Engineer

Ansible AWS Bash Cloud Docker EC2 Go Grafana Java Jenkins Kubernetes Packer Prometheus Python Spinnaker Terraform Unix senior

Job Location: Remote

Job Description

• Keep people safe and businesses running. • Be an integral member of the team implementing our platform in a DoD IL4 cloud environment. • Own and maintain the Kubernetes infrastructure from conception to completion within AWS. Including services such as VPCs, EC2, Transit Gateways, IAM roles and policies, Route53, S3, SGs, NACLs • Build upon the operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's Kubernetes solutions. • Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other engineers on designing and implementing highly reliable solutions. • Research and implement SRE and Kubernetes best practices and by creating automation, cross-functional collaboration, and data-driven decisions to reinforce the integrity and reliability of our systems. • Participate in a rotating on-call rotation to resolve production escalations

Qualifications

• 3+ years of technical AWS experience, managing and owning systems in a production environment • 2+ years of Kubernetes experience (EKS, AKS, GKE, Self managed) • 3+ years of Terraform or similar IaC experience • Currently hold a Secret Clearance or the ability to obtain one • Experience with the following tooling: GitLab CICD, Packer, Docker, EKS, Kubernetes, Spinnaker, Helm, Argo, Jenkins • Experience with Telemetry tools such as Datadog, SumoLogic, Grafana, Prometheus • Experience writing automation in languages such as Python, Go, Bash, Java • Experience with configuration management tools such as Salt, Ansible, AWS user_data • Experience with a DevOps/SRE production environment • Experience with Agile practices • Large scale production UNIX/Linux experience • Experience working on DoD IL4 programs

Benefits

• 11 paid holidays • Generous Accrued Time Off increasing with years of service • Generous paid sick time • Annual day of service

Related Jobs

logo
Company
Henry Schein One
Post Date
New
Title
.NET Staff Software Engineer
Type
$120,000 - $160,000 a year
Location
Remote
logo
Company
KUBRA
Post Date
New
Title
Senior Security Architect
Location
Unknown, California
logo
Company
Okta
Post Date
New
Title
Staff Site Reliability Engineer (Customer Identity Cloud)
Type
$160,000 - $240,000 a year
Location
Remote
logo
Company
Kiddom
Post Date
New
Title
Senior Software Engineer, Infrastructure
Location
Remote
logo
Company
OwnBackup
Post Date
New
Title
Team Lead, Production Engineer
Type
$160,000 - $210,000 a year
Location
Unknown, California