View All Jobs 169001

System Reliability Engineer - Remote Eligible

Lead the migration of Redis services to Kafka-based architecture for scalability
Remote
Senior
yesterday
Zipdev

Zipdev

A tech firm specializing in building remote development teams for businesses seeking software engineering and design talent.

System Reliability Engineer

We're looking for a passionate and experienced System Reliability Engineer to play a key role in designing, implementing, and maintaining our evolving cloud-native platform. You’ll be instrumental in shaping our reliability practices, automating operational tasks, and driving continuous improvement across our systems. This is an exciting time to join us as we embark on significant refactoring efforts and continue to leverage cutting-edge technologies.

What You'll Do:

  1. Design, build, and maintain highly available, scalable, and resilient systems on Google Cloud Platform (GCP).
  2. Proactively monitor system health, performance, and capacity, identifying and resolving issues before they impact users.
  3. Develop and implement automation for infrastructure provisioning, deployment, and operational tasks (e.g., CI/CD pipelines, disaster recovery).
  4. Collaborate with development teams to ensure new features are designed and implemented with reliability and operational excellence in mind.
  5. Manage and optimize our MongoDB Atlas instances, ensuring data integrity, performance, and security.
  6. Lead the refactoring effort of our Redis services to a more scalable and resilient Pub/Sub or Kafka-based architecture.
  7. Participate in on-call rotations and incident response, conducting thorough post-mortems and implementing preventative measures.
  8. Contribute to the development of best practices, runbooks, and documentation for system operations.
  9. Identify and implement opportunities for cost optimization without compromising reliability.
+ Show Original Job Post
























System Reliability Engineer - Remote Eligible
Remote
Engineering
About Zipdev
A tech firm specializing in building remote development teams for businesses seeking software engineering and design talent.