View All Jobs 138208

Site Reliability Engineer - Incident Management/resiliency (hybrid)

Lead real-time incident response and facilitate post-incident reviews to improve system reliability
Chicago
Mid-Level
$57,000 – 104,000 USD / year
yesterday
Enova

Enova

Provides online consumer and small business loans using data analytics and technology-driven underwriting platforms.

Site Reliability Engineer - Incident Management/Resiliency (Hybrid)

We are interested in every qualified candidate who is eligible to work in the United States. However, we are not able to sponsor visas or take over sponsorship at this time.

About the Role

Resilience Engineering is a subset of the Site Reliability Engineering team that strives to foster a culture of continuous improvement through incident analysis, process evolution, and problem-solving. We work closely with teams across Tech, Product, and Operations through our Production Incident process to uncover system weaknesses, learn from failures, and make our technology more reliable.

In this role, you'll play a key role in enhancing the resiliency of our systems. Your work will focus on our incident response, reporting and analysis processes, enabling the organization to better prepare for and respond to complex system failures.

You'll drive efforts to optimize how we manage unexpected outages, from leading real-time incident response to facilitating post-incident reviews.

What You'll Be Doing

  • Lead production incidents as part of our PI PIC (or Incident Commander) rotation after completing training, ensuring clear communication and resolution.
  • Capture and maintain detailed documentation of incidents, contributing factors, and learnings in formal incident reports.
  • Deliver documentation that is clear, comprehensive, and accessible to different types of audiences in a timely manner within the established SLAs.
  • Facilitate and document blameless post-incident reviews that promote learning and continuous improvement.
  • Collect and analyze incident data to identify systemic issues, risks, and trends.
  • Work on improvements to how we collect, analyze, and learn from system failures.

Requirements

  • 2+ years experience in a technology or analyst role (e.g., Software Engineering, Systems, Operations, SRE, or Product).
  • A strong interest in how complex distributed systems operate—and how to make them more reliable.
  • Analytical and problem-solving skills with a systems-thinking mindset.
  • Strong communication skills, both verbal and written, with the ability to tailor messaging to technical and non-technical audiences.
  • Comfort with ambiguity, and the ability to turn vague problems into actionable insights.
  • Demonstrated maturity, sound judgment, and organizational awareness.
  • Ability to coordinate the resolution of major incidents and post-incident reviews following Enova's Incident Management Process
  • Ability to seamlessly shift between high-urgency incident response and structured project work, with strong organizational skills and the capacity to manage projects independently.

Nice to Have

  • Experience leading resolution of major system outages or production incidents.
  • Experience driving large-scale technical or process changes.

Compensation

This position includes various levels within our career ladder. The actual annual salary will be determined based on qualifications, skills, experience, and level assessed during the hiring process and may fall outside of the ranges shown.

Budgeted annual salary ranges:

Site Reliability Engineer I: $57,000 - $77,000

Site Reliability Engineer II: $72,000 - $104,000

Additional compensation for this role may include a bonus. All full-time employees are eligible to participate in Company benefits.

Benefits & Perks

  • Our hybrid roles require in-office work Tuesday through Thursday, with remote flexibility on Mondays and Fridays.
  • Health, dental, and vision insurance including mental health benefits
  • 401(k) matching plus a roth option (U.S. Based employees only)
  • PTO & paid holidays off
  • Sabbatical program (for eligible roles)
  • Summer hours (for eligible roles)
  • Paid parental leave
  • DEI groups (B.L.A.C.K. @ Enova, HOLA @ Enova, Women @ Enova, Pride @ Enova, South Asians @ Enova, APEX @ Enova, and Parents @ Enova)
  • Employee recognition and rewards program
  • Charitable matching and a paid volunteer day…Plus so much more!

About Enova

Enova International is a leading financial technology company that provides online financial services through our AI and machine learning-powered Colossus™platform. We serve non-prime consumers and businesses alike, while offering world-class technology and services to traditional banks—in order to create accessible credit for millions.

Being a values-driven organization is at the core of Enova's success. We live our values by listening to our customers, challenging assumptions, thinking big, setting high expectations, and hiring and developing the best. Through our values and our commitment to making Enova an awesome place to work, we maintain an environment of inclusion and culture where our employees can thrive.

+ Show Original Job Post
























Site Reliability Engineer - Incident Management/resiliency (hybrid)
Chicago
$57,000 – 104,000 USD / year
Engineering
About Enova
Provides online consumer and small business loans using data analytics and technology-driven underwriting platforms.