View All Jobs 158939

Site Reliability Engineer

Develop cross-platform observability tools to improve system reliability and incident response
Barcelona
Senior
8 hours agoBe an early applicant
Pay Retailers

Pay Retailers

A payment platform offering a variety of online payment solutions across Latin America.

1 Similar Job at Pay Retailers

Database Administrator

We're PayRetailers, and we offer cutting-edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee's contribution is valued. We've got big plans to expand into new markets and make a meaningful impact on the world of payments. To help us get there, our Technology team is on the lookout for a new Database Administrator.

About the Role

Site Reliability Engineers are the guardians of our reliability promise. They deliver a highly reliable, resilient, and cost-efficient platform that consistently meets business and customer expectations for availability and performance.

Your Responsibilities

  • Increase automation of operational activities to reduce downtime risk, in collaboration with Platform Engineering and Domain Squads.
  • Drive systemic improvements across engineering teams based on incident RCAs and telemetry insights.
  • Implement non-functional improvements (resilience, performance, reliability) directly in code, with Domain Squads reviewing and approving changes.
  • Promote adoption of SRE best practices across development teams (integration patterns, monitoring, alerting, real-time tracing).
  • Provide cross-platform observability capabilities above and beyond what the Domain Squads provide. Investigate issues and incidents and propose/implement changes as deemed necessary.
  • Continuously review logs, metrics, and alerts to identify and/or implement continuous improvements.
  • Design non-functional test and continuously run them to ensure that we build quality up to and including production.

About You

  • Proactive attitude, always on the lookout for improvement opportunities.
  • Expert knowledge of Grafana, Application Insights, OpenTelemetry, Prometheus.
  • Experience with non-functional and production testing.
  • Analytical mindset, being able to connect the dots and establish cause and effect.
  • Software engineering skills in any of .NET 8 (C#), Go, Java, TypeScript.
  • Expert experience with containers and container orchestration platforms.
  • Understanding of APIs and asynchronous distributed software architectures.
  • Working knowledge of AI-enabled tools like VS Code, Claude Code, etc.
  • Demonstrable experience with applying AI to Site Reliability Engineering.
  • Knowledge with process automation tools like N8N.
  • Working experience with chaos engineering.

What Do We Offer?

  • Hybrid working model: 2 days from home.
  • 26 vacation days a year
  • Language classes & professional courses
  • Free catering & snacks in the office
  • Private health insurance
  • Afternoon off on your birthday

If you're passionate about tech, innovation, and want to thrive in an environment that values collaboration and diversity, this role might be the perfect fit for you! Apply today and help us shape the future of the PayTech industry!

+ Show Original Job Post
























Site Reliability Engineer
Barcelona
Engineering
About Pay Retailers
A payment platform offering a variety of online payment solutions across Latin America.