Senior Site Reliability Engineer
The Role
The Senior Site Reliability Engineer is the heart of our IT infrastructure and Development Operations. This unique role architect, builds, optimizes, and maintains our cloud systems. This role is responsible for providing architecture and DevOps services to our engineering team. At the same time, you continue to own the environment, ensuring it is available, reliable, secure, and optimized to meet the needs of our staff and members.
This role reports to the Senior Cloud Operations Manager and has minimal off-hours or on-call support requirements.
Roles and Responsibilities
- Architect, design, build, maintain, and optimize our cloud environments
- Assess and remediate security issues and steer our organizational security initiatives
- Develop proactive systems and performance monitoring
- Provide DevOps services to our engineering teams
- Develop build and release automation (CI/CD Pipelines, Environment Builds, Config Mgmt, etc.)
- Develop and maintain our containers infrastructure and infrastructure as code
- Support engineering teams during development and releases
- Automate routine work, manual tasks, and remediation efforts
- Own the physical and cloud networking
- Develop and support our Disaster Recovery program
Skill or Technology Requirements
Minimum of 5+ years hands-on experience in all the following:
- PowerShell Scripting
- Designing and building highly available, reliable, and secure Azure cloud environments
- CI/CD pipelines, environment builds, code check-ins, and branching via Azure DevOps
- Designing and building containers using Kubernetes and Docker
- Managing roles, rules, and permissions in various services
- Networking (Cisco hardware, Physical, Virtual, TCP/IP, DNS, firewalls, SDN, Wi-Fi, etc.)
- Windows and Linux Administration (98% Windows)
- Systems Virtualization
- Infrastructure as Code Tooling (Terraform)