Job location: Springfield, VA
High Performance Computer (HPC) Engineer
A strong experience with working on Linux systems
Experience with building and deploying containerized, GPU-enabled applications in Docker, Singularity, or Kubernetes
Experience with orchestration and cluster management tools, including Slurm, Mesos, or Moab
Experience with AI and Machine Learning Development Tool Sets, including Jupyter, Keras, TensorFlow, MPI, OpenMP, OpenCL, or CUDA
Lustre and Infiniband maintenance and troubleshooting. Infiniband/fibre/network plumbing, configuration, and maintenance
Experience with deploying systems in both on-premise and Cloud environments, including AWS, Azure, or Google
Server hardware maintenance and troubleshooting
Created and maintained system documentation
RHEL and CentOS administration and ACE cluster administration for HPC clusters
Experience with supporting environments for massively parallel computation
Experience with certification and accreditation of containers
Experience with programming and implementing scientific and physics M&S algorithms, Big Data, and Data Science
Experience with optimizing applications to use AI and ML toolsets