View All Jobs 153621

Senior Customer Reliability Engineer (US) - Remote Eligible

Support customers in deploying and troubleshooting Kubernetes applications in customer-managed environments
Remote
Senior
11 hours agoBe an early applicant
Replicated

Replicated

A platform that enables cloud-based applications to be deployed and managed on-premises in customers' private data centers or VPCs.

1 Similar Job at Replicated

Customer Reliability Engineering (CRE)

The Customer Reliability Engineering (CRE) team, a group of dedicated global engineers focused on helping our vendors successfully deliver and support Kubernetes applications in customer-managed environments. As a CRE, you'll be on the front lines, working directly with customers to solve complex technical challenges related to application deployment, management, and troubleshooting. You'll gain deep expertise in Kubernetes, the Replicated product suite, and the intricacies of customer-managed deployments, including scenarios where cluster installation is required. This role prioritizes exceptional support and customer success, collaborating closely with Sales and Product Engineers.

This role is perfect for you if you are passionate about problem-solving, enjoy helping people, and thrive on diving deep into technical challenges. You'll leverage your operational knowledge to build best practices and contribute to tooling that empowers both our internal teams and our vendors. This is an excellent opportunity to extend a strong foundation in Kubernetes, Linux, and the broader cloud-native ecosystem, while learning from experienced engineers on a successful, growing team.

What You'll Be Doing

Provide expert support to customers, resolving issues related to Kubernetes, Linux, and Replicated products. This includes troubleshooting failures, identifying root causes, and implementing solutions. Every day will present new and unique challenges.

Enable Customer Success: Work proactively with customers to ensure they are successfully deploying, managing, and scaling their applications using Replicated. This includes providing guidance, best practices, training, and assisting with onboarding new applications.

Collaborate with Engineering: Proactively work closely with CREs and product engineers to share customer feedback, identify product improvements, and contribute to the overall Replicated product roadmap. While this role doesn't require implementing code changes on day one, you'll be a key contributor in identifying areas for improvement, and the team regularly makes code contributions to enhance our products and tools. As you grow within the team, you'll have opportunities to develop your coding skills and contribute directly to these improvements.

Continuous Learning: Invest in your personal and professional growth. Replicated is committed to supporting your development through courses, certifications, and other learning opportunities.

To Be Successful In This Role, You Will Need To Bring

Preferably 3 or more years of professional experience in the following areas:

Experience with Linux system administration. You have the knowledge and ability to troubleshoot complex system and network issues, at an advanced level, as well as clearly explain the findings to customers.

Experience with Kubernetes and Helm. You have the knowledge and ability to diagnose complex issues with Kubernetes on bare metal, develop and troubleshoot advanced Helm charts, and guide customers in designing scalable deployment strategies.

Exceptional technical and non-technical communication and interpersonal skills. You must be able to clearly explain complex technical concepts to both technical and non-technical audiences in English.

Strong problem-solving skills, the ability to think critically, and act quickly under pressure.

A customer-centric mindset and a genuine desire to help others succeed.

Experience working remotely with teams across various time zones.

Nice To Haves

Experience with CNCF tools

Familiarity with Go and the ability to debug Go programs

Customer facing experience

Your Growth Journey At Replicated

In your first 30 days:

Immerse Yourself: Dedicate yourself to learning about Replicated - the company, the global CRE team, our products, and our customers (vendors).

Hands-on Training: Complete comprehensive hands-on training with the Replicated platform, working through a structured onboarding checklist.

Team Connections: Meet with team members across Replicated, including senior CREs, product engineers, and other departments, to build relationships and understand different perspectives.

Onboarding Improvement: As you go through the onboarding process, actively identify areas for improvement and suggest changes to make it even better for future CREs.

Active Support Participation: Begin working on real support cases from the queue, with direct oversight and guidance from senior CREs. This hands-on approach will accelerate your learning and understanding of customer issues and troubleshooting techniques.

In your first 60 days:

Deeper Support Immersion: Continue working on support cases, increasing the complexity and variety of issues you handle. Focus on understanding the "why" behind customer problems and the solutions implemented.

Process Improvement: Proactively suggest improvements to the support process, both technical (e.g., tooling, diagnostics) and procedural (e.g., communication workflows, escalation paths).

Product Knowledge Expansion: Deepen your understanding of how Replicated's products are developed, how different services interact, and how they are used in customer-managed environments.

Vendor Interaction: Begin to participate in some supervised customer interactions, gradually taking on more responsibility under the guidance of senior CREs.

Documentation Review: Review existing support documentation and training materials, identifying areas for updates or improvements.

In your first 90 days:

Independent Support: Take on full responsibility for handling support issues from the queue, working independently to diagnose, resolve, and prevent recurrence.

On-Call Rotation: Join the on-call rotation, providing 24/7 support coverage (primarily weekends due to the global team) for specific Replicated products. Remember, you're never alone - the team is always available to support you.

Customer Success Engagement: Begin actively participating in proactive customer success activities, such as assisting with onboarding new applications or providing best-practice guidance.

Feedback Loop: Become a key contributor to the feedback loop between customers and engineering, sharing insights and identifying areas for product improvement.

Continued Learning: Continue to invest in your personal and professional growth, leveraging Replicated's resources (like the curiosity budget) to expand your skills in Kubernetes, Linux, and other relevant technologies. Begin exploring opportunities to develop your Go coding skills.

At Replicated, we value our teammates as individuals who are stronger together. We offer a robust pay and benefits package that rewards employees for their contributions to our success, supports their well-being, and helps all of us create a great remote work environment.

For team members outside of the US, our salary ranges are at localized rates for the countries we support. This is dependent on several factors, including level, qualifications, and experience. We also offer stock options, as well as a unique home office allowance & a professional development budget.

In the US, the salary range for this role is as follows: Software Engineer II: $127,500 - $165,000

Sr. Software Engineer II: $149,500 - $192,500

We invest in our team and love candidates who are eager to learn and grow. We have a fantastic team of highly collaborative individuals who enjoy learning, growing, and mentoring others.

Our core values are:

Care Deeply: Care deeply about the work that you do. Because of that you are constantly learning and willing to go out on a limb, challenge assumptions, go back to first principles, etc.

Longterm: Treat every interaction as part of a 30 year relationship, you'll see everyone down the road again as customers, partners, coworkers, etc.

Curious: We're always learning and we approach everyone and every problem with curiosity. When needed we challenge assumptions, and go back to first principles.

We offer strong benefits to help you stay healthy and productive. For the US, our benefits are listed below:

  • Health/Dental/Vision
  • Life/AD&D
  • LTD/STD
  • FSA
  • 401K
  • Stock options
  • Partner perk programs
  • Generous time off, we expect you to take a minimum of 3 weeks of per year
  • Laptop+accessories you need to get set up
  • Generous home office set up allowance or co-working space allowance - up to $10,000 per year!
  • Curiosity Budget to help you keep learning and growing!

Replicated

+ Show Original Job Post
























Senior Customer Reliability Engineer (US) - Remote Eligible
Remote
Engineering
About Replicated
A platform that enables cloud-based applications to be deployed and managed on-premises in customers' private data centers or VPCs.