View All Jobs 158939

Site Reliability Engineer - Remote Eligible

Automate operational activities to improve platform resilience and reduce downtime
Sofia, Bulgaria
Senior
8 hours agoBe an early applicant
Pay Retailers

Pay Retailers

A payment platform offering a variety of online payment solutions across Latin America.

1 Similar Job at Pay Retailers

Database Administrator

We're PayRetailers, a company offering cutting-edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee's contribution is valued. We've got big plans to expand into new markets and make a significant impact on the world of payments. To help us get there, our Technology team is looking for a new Database Administrator.

Responsibilities

  • Increase automation of operational activities to reduce downtime risk, in collaboration with Platform Engineering and Domain Squads.
  • Drive systemic improvements across engineering teams based on incident RCAs and telemetry insights.
  • Implement non-functional improvements (resilience, performance, reliability) directly in code, with Domain Squads reviewing and approving changes.
  • Promote adoption of SRE best practices across development teams (integration patterns, monitoring, alerting, real-time tracing).
  • Provide cross-platform observability capabilities above and beyond what the Domain Squads provide. Investigate issues and incidents and propose/implement changes as deemed necessary.
  • Continuously review logs, metrics, and alerts to identify and/or implement continuous improvements.
  • Design non-functional test and continuously run them to ensure that we build quality up to and including production.

About You

  • Proactive attitude, always on the lookout for improvement opportunities.
  • Expert knowledge of Grafana, Application Insights, OpenTelemetry, Prometheus.
  • Experience with non-functional and production testing.
  • Analytical mindset, being able to connect the dots and establish cause and effect.
  • Software engineering skills in any of .NET 8 (C#), Go, Java, TypeScript.
  • Expert experience with containers and container orchestration platforms.
  • Understanding of APIs and asynchronous distributed software architectures.
  • Working knowledge of AI-enabled tools like VS Code, Claude Code, etc.
  • Demonstrable experience with applying AI to Site Reliability Engineering.
  • Knowledge with process automation tools like N8N.
  • Working experience with chaos engineering.

What We Offer

  • Fully remote - but we have offices in Sofia City and open for whenever you need!
  • 26 vacation days a year
  • Language classes & professional courses
  • Free catering & snacks in the office
  • Private health insurance
  • Gym pass.
  • Afternoon off on your birthday

If you're passionate about tech, innovation, and want to thrive in an environment that values collaboration and diversity, this role might be the perfect fit for you! Apply today and help us shape the future of the PayTech industry!

+ Show Original Job Post
























Site Reliability Engineer - Remote Eligible
Sofia, Bulgaria
Engineering
About Pay Retailers
A payment platform offering a variety of online payment solutions across Latin America.