Senior Manager - Site Reliability Engineering, Datacenter Hardware and IaaS
Posted 2025-04-24Description:
?? GEICO is seeking an experienced Senior Manager with a passion for building high performance, low-latency platforms, and applications.
?? You will build and manage a team of engineers with a deep focus on delivering enterprise-wide product to operate in a highly performant and efficient way.
?? The ideal candidate has deep technical expertise to improve application performance, capacity benchmarking, improve availability and reliability, design and evolve cloud infrastructure and architecture.
?? A Senior Manager will lead strategy and execution of a technical roadmap that will increase the velocity of delivering products and unlock new engineering capabilities.
Requirements: ?? Strong knowledge in modern at-scale datacenter architectures. ?? Experience with OCP hardware and related technologies (eg. OpenBMC, Redfish), bonus for knowledge in low level driver development. ?? Focus on leveraging infrastructure as code as a primary means of control. ?? Building CI/CD chains for datacenter operations ?? Experience in building IaaS systems based on OpenStack ?? Knowledge of cloud computing technologies and concepts (SaaS, PaaS, IaaS, etc) ?? Working knowledge of object-oriented development, Gang of Four (GOF) Design Patterns, Microservices, Dependency Injection with IOC containers, and both frontend and backend unit testing ?? Proven ability to concentrate and demonstrate a capacity for learning technical concepts and adapting to new technologies quickly ?? Strong Cloud (AWS, GCP, Azure etc.) platform knowledge ?? Proficiency in Project Management and work item management tools such as Azure DevOps and Portfolio ?? Strong foundation in algorithms, data structures, and core computer science concepts ?? Experience in existing Operational Portals such as Azure Portal ?? Fluency with Python, Golang, JSON, and RESTful Web Services ?? Experience with application monitoring tools and performance assessments ?? Experience in PowerShell Scripting ?? Constructing, interpreting, and applying metrics to your work and decision making, able to use those metrics to identify correlation between drivers and results, and using that information to drive prioritization and action ?? Strong understanding of Site Reliability Engineering and DevOps principles ?? Strong technical acumen in Cloud Architecture, Performance Benchmarking, and Capacity planning ?? Expert in Container orchestration (e.g., Kubernetes), container runtimes and optimization ?? Experience with driving cultural change in technical excellence, quality, and efficiency ?? Experience managing and growing technical leaders and teams ?? In-depth knowledge of CS data structures and algorithms
Benefits:
?? Premier Medical, Dental and Vision Insurance with no waiting period**
?? Paid Vacation, Sick and Parental Leave
?? 401(k) Plan
?? Tuition Reimbursement
?? Paid Training and Licensures
Apply Job!