View All Jobs 114391

Cloud Infrastructure Engineer (kubernetes, Redhat Openshift)

Design and implement scalable GPU-accelerated Kubernetes infrastructure for language models
Singapore
Mid-Level
1 month ago
Assurity Trusted Solutions

Assurity Trusted Solutions

A Singapore-based company providing digital security services such as secure digital identities and authentication for individuals and businesses.

34 Similar Jobs at Assurity Trusted Solutions

Cloud Infrastructure Engineer

Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a dynamic digital and cyber landscape, where trust & collaboration are key, ATS continues to drive mutually beneficial business outcomes through collaboration with GovTech, government agencies and commercial partners to mitigate cyber risks and bolster security postures.

We are looking for a Cloud Infrastructure Engineer (Kubernetes, Redhat Openshift) to join us! This will be on a 2 year contract (subjected to extension/rolling).

You will be working on:

  • Design, deploy, and optimize Kubernetes clusters using the Nvidia software stack to support large language model applications.
  • Collaborate with cross-functional teams to integrate Nvidia GPU resources effectively within Kubernetes environments, ensuring optimal performance.
  • Implement and manage infrastructure as code (IaC) for Nvidia GPU configurations, focusing on scalability and high availability.
  • Monitor, troubleshoot, and resolve issues related to both Kubernetes clusters and Nvidia GPU resources to maintain a reliable and performant infrastructure.
  • Stay abreast of industry best practices and emerging technologies related to Kubernetes and the Nvidia GPU ecosystem.
  • Work closely with development teams to automate deployment processes, leveraging Nvidia GPU capabilities, and streamline workflows.
  • Implement security best practices to safeguard Kubernetes environments, Nvidia GPU resources, and sensitive data.
  • Participate in on-call rotation and provide timely response to incidents, minimizing downtime for language model applications.
  • Contribute to capacity planning and performance tuning activities, considering the demands of large-scale language model applications utilizing Nvidia GPU acceleration.
  • Document infrastructure configurations, processes, and procedures, facilitating knowledge sharing and team member onboarding.
+ Show Original Job Post
























Cloud Infrastructure Engineer (kubernetes, Redhat Openshift)
Singapore
Engineering
About Assurity Trusted Solutions
A Singapore-based company providing digital security services such as secure digital identities and authentication for individuals and businesses.