View All Jobs 140617

Site Reliability Engineering Lead

Build and lead a global SRE team to ensure 24/7 cloud service reliability
Cardiff, Wales, United Kingdom
Expert
yesterday
RELX

RELX

A global provider of information-based analytics and decision tools for professional and business customers.

Site Reliability Engineering Lead

Location: The position is open to applicants based in the region of Cardiff, Wales, United Kingdom, or looking for Home-Based/Fully-Remote in the United Kingdom.

About Role: LexisNexis Risk Solutions is seeking a Site Reliability Engineering Lead with proven industry experience to join our global engineering team. This role will be a key player determining and refining on-going DevOps practices, agile support and deployment processes, as we continue to develop and improve the reliability of our public cloud based services. You will enjoy working in a friendly environment and benefit from our investment in staff. The role also requires On-Call rotation for off peak hours to maintain 24/7, 365 system availability.

About Team: Our teams are collaborative and forward-thinking; the successful candidate will help shape the operations and support for critical applications, customers and projects, working closely with Development, QA, IT Operations and Customer Operations teams. You will be a collaborative team player, with ability to influence, communicate and solve problems effectively, whilst handling a fast-paced working environment.

Main Responsibilities:

  • Lead SRE teams to build / maintain "Infrastructures as Code", software services (PaaS and SaaS), security policies and continuous integration / deployment processes
  • Removal of technical debt, security hardening and continued optimisation of our cloud based environments
  • Maintaining critical production services with a view to provide best possible uptime and a huge focus on reliability around tier 1 / mission critical 24/7 services
  • Working with diverse technical and non-technical teams, including Development, QA, IT Operations, Customer Operations and Project Management teams
  • Emphasis on leading FinOps processes for continuous review and ongoing cost optimisation

Essential Skills and Attributes:

People 40% - Leading SRE Team/s to ensure:

  • Rotating staff within the teams across products / platforms to ensure broad experience / skill-set
  • Sharing and collaboration across other SRE Managers/Leads/Cloud Centre of Excellence to ensure best of breed practices are adopted
  • A passion for mentoring and development of direct reports, including own ability to develop / keep up to date with a fast paced sector
  • Maintenance of systems / application documentation for technical and non-technical audiences
  • Set appropriate goals, ongoing objective setting and general line management of direct reports

Financial 5% - Has strong business acumen and financial budgeting skills

  • Maintain and forecast cloud OPEX spend
  • Ensure that cloud spend is reasonable and constantly optimised via FinOps processes

Customer 20%:

  • Ultimate Accountability across a Group Of Products for uptime and resiliency of those products
  • Ensure 24/7 technical support and Service Level Agreements for customers is met

Technical 30%:

  • Responsibility for examining complex releases to ensure system resilience
  • Drives automation to aid productivity, minimising the amount of traditional operational effort and maximise Infrastructure as Code
  • Good understanding of defensive, corrective, detective controls and general application troubleshooting
  • Delivery of resilient application stacks via "Infrastructure as Code" and other DevOps practices
  • Monitoring and on-going support of critical, high revenue business applications
  • Diagnosis and resolution of complex system and application issues
  • Relevant experience of hosting missing critical apps within public cloud providers such as AWS and / or Azure through services such as EC2, ECS, AKS or ACA
  • Experience working with containerised workloads such as Docker and orchestration via Kubernetes

Other 5%:

  • Manage Customer Reliability Engineering Activities driving Application Monitoring, Metrics, Incident Reviews and Long Term Actions
  • Support BISO activities / implementing InfoSec changes and experience of working with security based tooling such as Qualys, Wiz, Trufflehog, GitHub Advanced Security, etc

Qualifications:

  • 10+ years experience and proven background working in a technical, Cyber Security related position.
  • 2+ years experience management (including people management)
  • BSc Engineering/Computer Science/IT Security or relevant experience.
  • Desirable – AWS / Azure Certifications

Benefits of this role:

  • The opportunity to work on a full range of challenging and interesting technologies and help to conquer some of the next generation of financial crime and compliance related challenges
  • Get to make a real difference to our customers and society
  • Help us to continually evolve and modernise our Technology stack, including contributing to our technology radar and the evolution of our products
  • The opportunity to learn or improve skillsets around a multitude of technologies, including but not limited to: AWS, Azure, Docker, Kubernetes, and Terraform

What is it like to work here? Outstanding - you have probably already got a feel for what we do and the technology we are involved with but what really stands us out from the crowd is our culture. We are an agile, dynamic, and forward-thinking organisation who understands the importance of looking after our staff. We pride ourselves on delivering high-quality products, providing our employees with interesting challenges for their personal and career development whilst also striking the right balance between work and family life.

Why Work for LexisNexis Risk Solutions (RSG) Explore our passion for discovery. Global companies and governmental entities rely on us to solve their most complex data challenges. Our employees collaborate to reduce risks and create opportunities for customers in more than 100 countries. We are adaptable, curious, and ambitious. That is why here, you will have the freedom to drive change, the trust to find your own path, and the space to explore more.

We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120. Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here. Please read our Candidate Privacy Policy. We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. USA Job Seekers: EEO Know Your Rights.

+ Show Original Job Post
























Site Reliability Engineering Lead
Cardiff, Wales, United Kingdom
Engineering
About RELX
A global provider of information-based analytics and decision tools for professional and business customers.