View All Jobs 138652

Senior Linux System Administrator - Support Engineer (with High Performance Computing Focus)

Own the HPC Linux infrastructure, designing and optimizing across on-prem, virtual, and cloud environments.
Canberra, Australian Capital Territory, Australia
Senior
18 hours agoBe an early applicant
Hewlett Packard Enterprise

Hewlett Packard Enterprise

Provides enterprise IT solutions including servers, storage, networking, cloud services, and edge computing for businesses and organizations worldwide.

64 Similar Jobs at Hewlett Packard Enterprise

Senior Linux System Administrator - Support Engineer (with High Performance Computing Focus)

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world. Our culture thrives on finding new and better ways to accelerate what's next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

We are seeking an experienced Senior Linux System Administrator/System Support Engineer with expertise supporting High Performance Computing (HPC) environments to join our HPC support team. In this role, you will design, implement, maintain, and optimize Linux-based infrastructure—ensuring high availability, security, and performance for mission-critical systems and services, including complex HPC platforms. You will provide advanced technical support, troubleshoot challenging issues across hardware and software, and act as a trusted advisor to both internal teams and external customers. On-site presence is mandatory to deliver exceptional customer support and maintain system performance.

Key Responsibilities:

  • Deploy, configure, maintain, and troubleshoot Linux servers and HPC cluster systems (Red Hat, CentOS, Ubuntu, or others) across physical (primarily), virtual, and cloud environments.
  • Support, maintain, and optimize HPC systems, including cluster manager, operating system and network fabric installation, servicing, and advanced technical troubleshooting of hardware/software and parallel file systems (e.g., Lustre, GPFS).
  • Monitor system performance, availability, and security using industry-standard tools and practices; ensure compliance with organizational policies and external regulations.
  • Plan and execute upgrades, patches, enhancements, and migrations to ensure systems are current, secure, and optimized.
  • Automate system administration tasks using scripting languages (Bash, Python, Perl, etc.) and configuration management tools (Ansible, Puppet, Chef, Terraform).
  • Implement and maintain backup/recovery strategies, disaster recovery plans, and system documentation.
  • Collaborate with development, network, and security teams to support application deployments and troubleshoot issues, particularly in multi-technology HPC environments.
  • Provide technical consulting, mentoring, and guidance to junior team members and contribute to internal knowledge sharing.
  • Ensure compliance with strict security protocols in sensitive environments (e.g., government, research); TSPV clearance will be required.
  • Participate in on-call rotation and respond to system incidents and outages.
  • Assist with technical proposals, solution design, and enterprise-level architecture for new projects and customer engagements.

About You:

  • Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent work experience.
  • At least 5 years of hands-on experience managing Linux systems in production environments, including HPC systems.
  • Expertise in Linux/Unix operating systems, parallel file systems (Lustre, GPFS), and networking technologies.
  • Proficiency in scripting/programming languages (Bash, Python, Perl, C++).
  • Experience with automation/configuration management tools (Ansible, Puppet, Chef, Terraform).
  • Strong understanding of networking concepts (TCP/IP, DNS, DHCP, firewalls, VPNs).
  • Familiarity with monitoring/logging tools (Nagios, Grafana, ELK Stack).
  • Experience with containerization technologies (Docker, Kubernetes).
  • Excellent problem-solving, analytical, and communication skills; able to diagnose complex technical problems to root cause.
  • Demonstrated ability to work independently in multi-technology environments and collaborate across teams.
  • Relevant certifications (RHCE, LFCS, AWS Certified SysOps Administrator, etc.) are a plus.
  • TSPV Government Security clearance (mandatory).

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

+ Show Original Job Post
























Senior Linux System Administrator - Support Engineer (with High Performance Computing Focus)
Canberra, Australian Capital Territory, Australia
Support
About Hewlett Packard Enterprise
Provides enterprise IT solutions including servers, storage, networking, cloud services, and edge computing for businesses and organizations worldwide.