Job Description
Job Overview
Checkr is seeking a Site Reliability Engineering Manager to lead and mentor a team of SREs, focusing on improving observability and reliability for cloud-based applications. This role involves defining metrics related to service level objectives, managing incident responses, and ensuring the performance of application endpoints. The ideal candidate will possess extensive knowledge in AWS, Kubernetes, and observability tools, contributing to Checkr’s mission of building a trusted data platform.
Technical Requirements
Required Skills
- • Leadership
- • Observability
- • Incident Management
- • Mentoring
- • Automation
Preferred Skills
- • Datadog
- • Terraform
- • Kubernetes
- • Python
Experience Level
8+ years in a relevant role with 4+ years in technical leadership
Responsibilities
- • Lead and mentor a team of Site Reliability Engineers
- • Define and track metrics related to SLO, SLI, and SLAs
- • Operationalize incident management and communication
- • Collaborate with Engineering Managers to define metrics and dashboarding requirements
- • Assist in planning for the growth of Checkr’s infrastructure and reliability
Benefits & Perks
- • Learning and development allowance
- • 100% medical, dental, and vision coverage
- • Up to 25K reimbursement for fertility, adoption, and parental planning services
- • Flexible PTO policy
- • Monthly wellness stipend, home office stipend
Additional Information
- Location
-
Denver, CO; San Francisco, CA
- Type
-
Hybrid work environment
- Compensation
-
Base salary range of $197,000 to $232,000 in Denver, CO; $233,000 to $274,000 in San Francisco, CA