View All Jobs 118380

Data Center Network Deployment Engineer

Build and deploy scalable AI data center networking infrastructure for HPC clusters
Yokneam Ilit, North District, Israel
1 week ago
NVIDIA

NVIDIA

Designs advanced GPUs, AI computing platforms, and related technologies powering graphics, data centers, autonomous machines, and high-performance computing.

Data Center Network Deployment Engineer

NVIDIA is looking for a Data Center Network Deployment Engineer to join the Networking Clusters Solutions HPC/AI Infrastructure team. We are building supercomputers and AI clusters based on groundbreaking technologies. We are looking for a network/system engineer to be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing.

You will work with the latest accelerated computing and deep learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms. Does this sound like you? If so, we would love to hear from you!

What You'll Be Doing:

  • Deploy, manage and maintain large scale AI Data Centers - control, network and storage stack
  • Work with multiple software and hardware teams to optimize the clusters networking health and performance
  • Develop and implement automation scripts for network, compute and storage operations and deployments
  • Supporting Research & Development activities and engaging in POCs/POVs for future improvements

What We Need To See:

  • B.Sc. in Engineering or CCNP certificate
  • 3+ years of proficiency in networking fundamentals, configuring ethernet switches, understanding the TCP/IP stack, and data center architecture.
  • Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalls, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols e.g. TCP, DHCP, DNS, etc.
  • Proactive individual with the ability to work independently, prioritizing tasks to optimize technology and enhance customer experience.
  • Provides ad-hoc knowledge transfers, develops handover materials, and offers deployment support for engagements.

Ways To Stand Out From The Crowd:

  • Combination of interpersonal skills and technical competence
  • Knowledge of HPC and AI solution technologies from CPUs and GPUs to high speed interconnects and supporting software
  • Experience with multiple storage solutions such as Lustre, GPFS, and newer and emerging storage technologies.
  • Automation tooling background (Ansible, Salt, Puppet etc.).

NVIDIA is widely considered to be one of the technology world's most desirable employers! We have some of the most forward-thinking and hardworking individuals in the world working for us. If you're creative and autonomous, we want to hear from you!

+ Show Original Job Post
























Data Center Network Deployment Engineer
Yokneam Ilit, North District, Israel
Engineering
About NVIDIA
Designs advanced GPUs, AI computing platforms, and related technologies powering graphics, data centers, autonomous machines, and high-performance computing.