Site Reliability Engineer
Working at Infobip means being part of something truly global. With 75+ offices across six continents, we're not just building technology — we're shaping how more than 80% of the world connects and communicates.
As employees, we take pride in contributing to the world's largest and only full-stack cloud communication platform. But it's not just what we do, it's how we do it: with curiosity, passion, and a whole lot of collaboration.
If you're looking for meaningful work and challenges that grow you in a culture where people show up with purpose, this is your opportunity.
Let's build what's next, together.
Why is this position important at Infobip?
We are looking for engineers who enjoy solving problems, have a passion for quality, and are data-driven and analytical. The role we are hiring is Site Reliability Engineer, whose primary focus is on driving reliability initiatives and promoting reliability practices across the organization.
SREs are:
1. Owners of Incident Management and its lifecycle
- Working on process alignment with the rest of the company
- Working on streamlining and process automation
- Driving process adoption and monitoring adoption
- Creating objective and actionable incident reporting (monthly, quarterly, yearly incident reports for management; on-demand product reliability reports)
2. Advisors/Experts on reliability topics (advocating platform reliability)
- Providing onboarding and education on reliability topics and on how to improve reliability
- Driving reliability community/mindset
- Promoting a blameless incident culture
- Identifying risks and promoting a systematical approach to problems and long-term solutions
3. Helping teams define, develop, monitor and maintain SLOs/SLIs for their products
4. Providing a client-centric point of view on incidents and products
5. Shortening incident response times (detect, engage, fix) by helping with troubleshooting and improvements
6. Developing internal tooling and automation to reduce toil and speed up incident response
7. Providing objective quality insights
- Raising awareness of areas of improvement based on historical data
- Providing actionable insights based on quality trends and metrics
- Providing guidance for outliers that are outside of the expected baseline
What will the main responsibilities be?
- Discovering problems, defining, and solving tasks under the guidance of more senior engineers
- Designing and implementing automation and tooling (script, services, dashboards) to improve incident detection and response, reduce manual work (toil), and provide better reliability insights
- Overviewing incidents in a production environment, helping others in complex incident response using incident response strategies
- Troubleshooting platform-wide problems: understanding the big picture and guiding towards resolution of the reliability-related problem
- Collaborating with product and development teams to integrate reliability improvements into code and architecture
- Actively investing in learning about the development process, technologies, system architecture, platform, and products, and client segmentation in Infobip context
- Active participation in the incident review and learning from incidents
- Communicating about problems and solutions on the right level of abstraction depending on the audience (audience having different backgrounds are included in the incident management process)
- Sharing knowledge about problem-solving towards the team/requirement area
More about you:
- Highschool, Bachelors or Masters degree
- 3+ years in positions like: Software Engineer/Backend Engineer/Full Stack Engineer, or
- DevOps/SRE with strong scripting/programming experience
Deep technical knowledge in:
- Programming/scripting: solid experience in at least one of: Java, Go, Python, Bash/Powershell or similar
- Plus familiarity with some of the following:
- Monitoring, logging and alerting tooling
- Network and Linux fundamentals
- System/architectural design and distributed systems
- Databases and querying languages (SQL/NoSQL)
Proficient in English, with good understanding of the development process, risk analysis, and problem solving
Focus on clients, strong teamwork skills, curiosity and eagerness to learn, technical skills, execution efficiency, continuous improvement mindset, and great analytical and communication skills
When you become a part of Infobip you can expect:
- Awesome clients – We serve and partner with the majority of the leading mobile operators, OTTs, brands, banks, social networks, aggregators and many more. Seriously, our clients are really cool. Work with the world's leading companies and impact how they communicate with their users!
- Opportunity knocks. Often. – Being a part of a growing company in a growing industry – we challenge you not to grow! Whether it's horizontal, vertical, or angular, we want to support the path that you want to carve.
- Learn as you grow – Starting with a fantastic onboarding program, to internal education, education resources, e-learning to external educations, we invest heavily in employee learning and development.
- Connect globally – Work with people from all over the world. We put the "global" in globalization.
- Pay & Perks – Competitive salary, a team taking care of all the equipment you need, free lunch in the office, ESOP, team building and other organized activities
Diversity drives connection. Infobip is built on diverse backgrounds, perspectives, and talents. We're proud to be an equal-opportunity employer and are committed to fostering an inclusive workplace.
No matter your race, gender, age, background, or identity — if you have the passion and skills to thrive, there's a place for you here.
All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender, gender identity, national origin, citizenship, disability, veteran status or any other part of one's identity.