View All Jobs 138918

Mlops/platform Engineer

Build a Kubernetes-based GPU infrastructure to support AI research and production workflows
Istanbul
Senior
16 hours agoBe an early applicant
Codeway

Codeway

Codeway is a technology company specializing in software development and digital transformation services.

MLOps/Platform Engineer

We are seeking an MLOps / Platform Engineer to design, build, and scale our GPU-enabled AI/ML infrastructure. This role will empower our AI researchers, data engineers, and product teams by providing them with robust, Kubernetes-based platforms for containerized workloads.

The ideal candidate has a platform engineering mindset—focused on enabling teams through self-service tooling, automation, and reliability. You will take ownership of our GPU infrastructure, ensuring workloads scale efficiently, pipelines run smoothly, and deployments are reliable across environments.

What You'll Be Doing

  • Design, build, and operate Kubernetes-based GPU infrastructure for AI/ML research and production workloads.
  • Provide self-service deployment capabilities to AI researchers, data engineers, and product teams.
  • Implement and manage continuous delivery pipelines using Kubernetes-native deployment tools (e.g., ArgoCD).
  • Configure and optimize GPU workloads (CUDA, drivers, device plugins, and scheduling strategies).
  • Implement autoscaling strategies for both CPU and GPU workloads on Kubernetes clusters.
  • Partner with data teams to ensure platforms are tailored to data pipelines, training workflows, and ML model lifecycle needs.
  • Contribute to the adoption of ML orchestration and experiment tracking platforms (e.g., Kubeflow, MLflow, ClearML, Anyscale).
  • Support notebook environments for experimentation (e.g., JupyterHub).
  • Act as a platform engineering lead for AI/ML workloads—adopting the best practices, reliability, and cost-efficiency.

What You'll Bring

  • Strong hands-on experience with Kubernetes (designing, operating, and troubleshooting clusters).
  • Proven experience deploying workloads with ArgoCD or similar GitOps tools.
  • Exposure to ML platforms such as Kubeflow, MLflow, ClearML, or Anyscale.
  • Prior experience working with data engineering teams or ML model lifecycle management.
  • Experience enabling notebook-based workflows (JupyterHub, SageMaker Studio, etc.).
  • Knowledge of cloud platforms (AWS, GCP, or Azure) and their GPU offerings.
  • Strong collaboration and communication skills—able to partner effectively with researchers, engineers, and product teams.

Nice To Haves

  • Hands-on experience with GPU configuration, CUDA, or NVIDIA ecosystem tools (NCCL, Triton Inference Server).
  • Knowledge of autoscaling patterns for AI/ML workloads on Kubernetes (KEDA, HPA, VPA, custom controllers).
  • Experience with observability and monitoring tools (Prometheus, Grafana, Loki, ELK, etc.).
  • Solid understanding of containerization (Docker, OCI) and CI/CD pipelines.
  • Familiarity with infrastructure as code (Terraform, Helm, Kustomize).
  • Background in platform engineering, SRE, or DevOps with a focus on enabling other teams.

What's In It For You?

Great Place to Work. As our top team member, you'll be a part of a fast-growing startup and have the privilege to enjoy our accredited "BEST WORKPLACES" environment, ranked first among all workplaces.

A Competitive Compensation Package. Long story short, we take care of you.

A Meal Compensation that is actually enough for a decent & nutritious lunch, we're not following the industry standard.

Full Health Benefits. To keep you away from all the trouble, we provide unlimited private health insurance and cover the HPV vaccine.

Pet Adoption Support. We cover the primary healthcare expenses, parasite vaccinations, and microchip costs that may arise within the first year after employees adopt a pet.

Cool Tech Stack. Macbook, iPhone 15 Pro, magic mouse, magic keyboard; an adjustable desk with a 4K screen and any other gadget you may need in your job.

Sport Activities Support. We care about your physical wellbeing and support your gym membership.

Learning Never Ends! Training budget to help you grow in your role, gain new skills, and learn new things!

Udemy Free Pass. An unlimited Udemy subscription.

Flexible Schedule & Unlimited Vacations. This isn't a "clock in, clock out" company. We care about your productivity, not tracking every minute you're on site. It's up to you to always be responsible with your work, no matter where you are or what schedule you're keeping.

English Course Support. Be more global and perform best at your work.

Comprehensive Well-being Support. Through our partnership with Meditopia, we now offer free psychological counseling, along with additional services like dietitian, personal trainer, parenting consultant, physiotherapist, child & adolescent therapist, and face-to-face counseling for employees in Istanbul.

A Top-Notch Office. Located at the heart of Istanbul, right next to Levent Metro Station at Ferko Signature.

Codebrew. Yes, you've heard it right: We love coffee so much that we've built our own coffee shop inside our office, where we also provide healthy snacks at certain hours.

Free Breakfast. At Codebrew - every morning at 9:30 AM.

No Dress Code. Dress as you like.

Happy Hours Every Friday. Every. Single. Friday. When it's 5 PM: Screens off, party on! (Check out Codeway on Instagram)

Dream Team. Average Codeway member is young, talented, and passionate - which makes our working environment extremely dynamic. Need proof? Take a look at our Instagram.

Gaming Area. Whenever you need to take a break from hard work and relax with your favorite games, our PS5 corner will be waiting for you.

Software Support. Subscription to any software you might need to perform at your best.

Public Transportation Support. Daily public transportation support is on us, covered by an additional monthly compensation.

Parent Support. Becoming a parent is a big moment, and Codeway celebrates it with you. We offer a one-time grant to support our new parents.

Massage Room. When you need a moment to recharge and unwind, our massage chair room will be ready to offer you a relaxing break.

The Recruiting Process

We are committed to keeping our recruitment process short and transparent. Here's how it looks like:

  1. Application: Send us your CV or LinkedIn profile. You can also just write a few words about yourself.
  2. Case Study: We might send you a task to solve.
  3. Talent & Culture Interview: Let's talk about your experience & expectations and see what we can achieve together.
  4. Technical Interview: You will meet with future team lead to go over the case and your technical capabilities in detail.
  5. Reference Check: We will contact your references to gain insights into your past experiences and work ethic.
  6. Welcome Aboard! You are now a part of the team.
+ Show Original Job Post
























Mlops/platform Engineer
Istanbul
Engineering
About Codeway
Codeway is a technology company specializing in software development and digital transformation services.