Senior Site Reliability Engineer I
Join our diverse and inclusive team delivering high-quality software worldwide.
Are you someone who enjoys working with others, solving problems creatively, and making a meaningful difference?
About the Business
At ICIS, our purpose is to optimize the world's resources and empower strategic, sustainable decisions by bringing market transparency to all. We support organizations of every size and background with accessible and actionable insights across global value chains. Our team values diverse perspectives and welcomes candidates from all walks of life.
About the Team
Our teams, called Squads, are made up of people with a variety of skills and experiences, including Squad Leads, Business Analysts, Developers, and Testers. We work collaboratively, learn from each other, and support one another to achieve shared goals. Everyone's contributions are valued, and we encourage open communication and continuous learning.
About the Role
As a Senior Site Reliability Engineer I, you will play a key role in ensuring our applications and infrastructure are reliable, scalable, and secure. You will work closely with development, architecture, and service management teams, using your skills to configure, maintain, monitor, and improve our systems. This role is ideal for someone who enjoys tackling challenges, sharing knowledge, and driving innovation that benefits our customers and colleagues.
Responsibilities
- Lead efforts to enhance system reliability and scalability, designing solutions that support our evolving business needs while maintaining security and quality.
- Work collaboratively with software engineers and other teams to design and implement deployment approaches using automated processes for continuous integration and delivery.
- Help design, develop, test, and implement solutions that improve availability, reliability, and scalability of our applications.
- Implement infrastructure, configuration, and network as code for supported applications and platforms.
- Partner with Infrastructure, DevOps, Development, and SRE teams to resolve complex technical challenges.
- Monitor service levels and proactively address issues before they affect our customers.
- Promote and support best practices in site reliability engineering across the team.
Requirements
- Experience working with cloud platforms (such as Amazon Web Services) and Infrastructure as a Service (IaaS).
- Background in DevOps, site reliability engineering practices, or related areas. We value a variety of experiences and encourage you to apply even if you don't meet every listed qualification.
- Understanding of site reliability principles and how they can improve the software development process.
- Familiarity with continuous integration and delivery tools (e.g., Jenkins, GitLab, Terraform).
- Experience with containers and orchestration tools (such as ECS, Kubernetes, or Docker) is helpful but not required.
- Ability to diagnose and resolve networking, performance, and optimization issues in distributed systems.
- Basic knowledge of Linux, networking, and storage fundamentals.
- Interest in sharing knowledge, supporting teammates, and contributing to a positive team environment.
- Understanding of release, integration, and deployment processes.
- Familiarity with monitoring, logging, and alerting tools (such as Grafana, Prometheus, or similar) is a plus.
- Ability to use scripting languages (such as Python, Bash, or PowerShell) to automate tasks is welcome but not required.
- Strong communication skills and ability to work collaboratively within a diverse team.
Our Commitment to Diversity, Equity, and Inclusion
We believe that diversity, equity, and inclusion make us stronger. We welcome applications from individuals of all backgrounds, experiences, and abilities. If you require an accommodation at any stage of the application process, please let us know—we are happy to support you.
We encourage individuals from all backgrounds and experiences to apply—even if you are unsure whether you meet every qualification. If you have a passion for reliability, collaboration, and continuous improvement, we want to hear from you!