Join us at Kyndryl, where we design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. As a Site Reliability Engineer (SRE) at Kyndryl, you'll ensure reliability, resiliency, and innovation in our information systems and ecosystems. You'll drive continuous improvement and deliver exceptional service to our customers. Your role will go beyond traditional engineering, as you'll analyze business needs, tackle complex problems, and provide strategic advice and designs. You'll be involved in every stage of the software lifecycle, from building and testing to deploying changes and maintaining robust systems. We're looking for a visionary who can think strategically and help shape the future of our services. Your expertise in building trusted relationships with customers and partnering with them for success will be instrumental in driving our growth. As an SRE, you'll have the unique opportunity to work on end-to-end services, spanning customer sites and platforms. Collaboration and proactivity are key as you work alongside a talented team of professionals, eager to make a difference. You'll embrace an entrepreneurial mindset, taking ownership of your responsibilities and constantly seeking innovative solutions. With an unwavering focus on quality, robustness, and security, you'll be a driving force in implementing cutting-edge tools that enhance our operations, improve reliability, and gather valuable feedback on our platforms. Your ability to identify and mitigate common operational issues will play a crucial role in delivering seamless experiences to our customers. If you're passionate about pushing the boundaries of technology, thrive in a collaborative environment, and are motivated by the opportunity to shape the future of reliability engineering, then we want to hear from you. Join our team and be part of a dynamic and forward-thinking organization that values innovation and excellence in everything we do.
Kyndryl has a global footprint, which means you will have opportunities to work on projects and collaborate with colleagues from around the world. This role is dynamic and influential – offering a wide range of professional and personal growth opportunities that you won't find anywhere else.
•1 year of experience in operational management, including incident management and escalations
•Experience with design and implementation of application monitoring to ensure reliability and performance meets or exceeds business goals
•Experience implementing strategies to cap operations load and to handle overflow using appropriate tooling and metrics; defining service level indicators and objectives in collaboration with stakeholders, business, development, DevSecOps and Operations teams
•Solution and design experience in an enterprise environment: Windows server, Linux server (RHEL is preferred), UNIX (AIX, Solaris), Windows server, storage, and Hyperscaler Cloud (AWS, Azure, Google Cloud Platform); public cloud platforms such as AWS, OpenShift, Azure or GCP
•Experience working with Data format and Scripting languages JSON, YAML, Bash and/or PowerShell
•BS degree in Computer Science, Engineering, or other highly technical, scientific discipline
•Expertise with Ansible, Terraform, and Python
•Experience with distributed technologies as well as dynamic resource management frameworks such as Kubernetes
•Expertise in leveraging open-source tooling such as Prometheus, Grafana, or Loki
Diversity is a whole lot more than what we look like or where we come from, it's how we think and who we are. We welcome people of all cultures, backgrounds, and experiences. Our Kyndryl Inclusion Networks are just one of many ways we create a workplace where all Kyndryls can find and provide support and advice. This dedication to welcoming everyone into our company means that Kyndryl gives you – and everyone next to you – the ability to bring your whole self to work, individually and collectively, and support the activation of our equitable culture. That's the Kyndryl Way.
With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships, new processes, and new value. Kyndryl cares about your well-being and prides itself on offering benefits that give you choice, reflect the diversity of our employees and support you and your family through the moments that matter – wherever you are in your life journey. Our employee learning programs give you access to the best learning in the industry to receive certifications, including Microsoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together, we will all succeed.