Cyber Recovery Infrastructure Site Reliability Engineer (SRE)
At Allstate, great things happen when our people work together to protect families and their belongings from life’s uncertainties. And for more than 90 years our innovative drive has kept us a step ahead of our customers’ evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection.
We have one position on this team focused in Networking & can hire at Lead or Expert level depending on qualifications & interview evaluation. This opportunity is fully remote based in US.
The Cyber Recovery Infrastructure Site Reliability Engineer (SRE) is part of a team that manages and operates the Allstate Cyber Recovery Environment. The Cyber Recovery Environment consists of hardware and software platforms required to support the most critical business functions needed when normal production systems are not accessible or available. This role encompasses managing detailed system design as well as implementation, testing and coordination of operational maintenance of infrastructure associated with the cyber recovery environment, with a focus on network systems and technologies. The Site Reliability Engineer (SRE) with a Networking focus works with other SRE's as well as production infrastructure engineers to support the Cyber Recovery Environment platforms, manage the lifecycle of Cyber Recovery Environment assets, and orchestrate and regularly test restoration functions.
Accountabilities
- Build long-term relationships within team and amongst peers by creating an environment of safety, innovation, integrity, and honest open communication.
- Set and execute platform strategy, including objectives, plans and policies to develop and deliver innovative solutions and systems in support of enterprise goals.
- Coordinate with decision makers in other departments to identify, recommend, develop, implement and support cost-effective technology solutions for all aspects of the organization.
- Drive decisions that have an impact on quality, availability and effectiveness of business activities beyond immediate teams.
- Initiate and implement continuous improvements in the Cyber Recovery Environment, including automation of complex tasks and delivery of cyber-attack recovery related services
- Influence direct reports, clients, service providers and peers to successfully deliver on cyber resiliency goals and commitments.
- Partner with others in the organization to set and manage expectations associated with Allstate's Cyber Recovery Environment, continually seek opportunities to be a thought partner and deepen relationships.
- Adapt communication approach for audiences at multiple internal and external levels.
Education
- Bachelor’s degree preferred
Required Experience and Qualifications
- 5+ years of experience managing computing platforms
- Experience working in a geographically dispersed team environment.
- Experience working with enterprise level IT infrastructure platforms including Network, Network Security, Storage, Compute.
Preferred Experience and Qualifications
- Experience in the delivery and support of Cyber Resiliency and Recovery Capabilities
- Basic understanding of network technologies capabilities and systems (route/switch, DDI, load balancing, networking in CSP's, etc)
- Exposure and experience with both on-premises and Cloud technology infrastructures. Knowledge of Azure and/or AWS would be desirable, with knowledge of GCP a plus
- Detailed understanding of network switching and routing (OSPF/BGP) in on-premise and CSP environments, DDI, load balancing and network troubleshooting.
- Understanding of network security infrastructure constructs, such as firewall operation
- Exposure to virtual compute infrastructure constructs.
- Understanding of compute, storage and network infrastructure delivery through automation (IaaS).
- The Ideal Candidate would be familiar with network technologies in both on-premises data centers as well as in cloud environments, and have experience is creating and executing orchestrated recovery playbooks of complex infrastructure and application environments which consist of multiple systems.
Skills: Automation Solutions, Incident Management, Infrastructure Design, Root Cause Analysis (RCA), Security Monitoring
Compensation offered for this role is $92,560.00 - 166,465.00 annually and is based on experience and qualifications.
The candidate(s) offered this position will be required to submit to a background investigation.