View All Jobs 124604

Linux Systems Engineer

Manage and optimize high-volume satellite data processing on HPC Linux systems
Monterey, California, United States
Senior
yesterday
Federated IT

Federated IT

A provider of customized IT and cybersecurity solutions to government and commercial clients.

1 Similar Job at Federated IT

Hpc Linux System Administrator

Federated IT seeks a highly qualified HPC Linux System Administrator to conduct Satellite Data Management Support in support of the Fleet Numerical Meteorology and Oceanography Center (FNMOC) in Monterey, California. The objective of this position is to provide system administration support for all aspects of the FNMOC systems that support high-volume processing of satellite data. The position requires the ability to integrate new types of satellite-related data flows (various formats, file sizes, addressing storage space issues, and file quantities) and to ensure down-stream applications can handle and manipulate the data. The performer shall provide maintenance and support to multicore HPC and virtualized and hardware-based infrastructure (typically running mail, Domain Name System (DNS), an identity management system (Red Hat Identity Management System [IdM] or Light Weight Directory Protocol [LDAP]), and data routing) systems, to include all associated subsystems. Support also extends to supporting FNMOC's backup processes. The performer shall participate in the investigation of relevant systems/subsystems performance including interconnecting networks, storage, patch management, the data flow on the networks, the performance of computers and the overall flow and handling of information across the hardware and software enterprise. Work will consist of the effort required to support the operations and maintenance of HPC systems at FNMOC and shall include operational testing, validation, and documentation necessary for transition to operations. Work will extend to Cyber Security compliance and ensuring system backups occur correctly and on a regular basis. The performer shall be familiar with the High Performance Computing techniques, Linux Systems, system provisioning, Cyber Security Compliance, and be capable of addressing general system administration issues including addressing capacity issues that may affect application and processing performance. The performer shall participate in the investigation of relevant systems/subsystems including interconnecting networks, the data flow on the networks, the performance of computers and the overall flow and handling of information across the hardware and software enterprise.

Essential Duties and Responsibilities:

  1. All work will be performed on site (remote support is not an option)
  2. SYSTEM INTEGRATION & SUPPORT
    • Provide end-to-end system integration and provide recommendations to support the performance of systems and their associated subsystems in support of HPC, satellite, and related government requirements. The contractor shall participate in the investigation of relevant systems/subsystems including interconnecting networks, the data flow on the networks, the performance of computers and the overall flow and handling of information across the hardware and software enterprise. Infrastructure support extends to identity management (LDAP/IdM) and node-provisioning via Red Hat Satellite Server.
    • Performance Standard: Provide system integration for systems and sub-systems as they relate to management of satellite data.
    • Assessment Method: Direct observation and review of system integration contributions for relevant area.
  3. ARCHITECTURE GUIDANCE
    • Provide technical assistance to HPC Linux customer base regarding system/code/performance issues. This work may require problem resolution or working with an escalation team for problem resolution.
    • Performance Standard: Provide architectural guidance for systems and subsystems as they relate to management of satellite data.
    • Assessment Method: Direct observation and review of architectural contributions for relevant area.
  4. NETWORK ATTACHED STORAGE SUPPORT
    • Provide system administration support to Network Attached Storage [NAS] (currently NetApp systems) to ensure optimal NAS file system support for alpha/beta/operations systems for NIPR/SIPR environments. Resolve issues related to file system performance with subject matter expertise.
    • Performance Standard: Ability to support standard NAS performance issues; routine file system mounting support; and hardware support (e.g. disk drive replacement).
    • Assessment Method: Direct observation and review of supported systems.
  5. BACKUP SUPPORT
    • Provide daily backup support to NIPR/SIPR backup systems (currently managed by the IBM Spectrum Protect [a.k.a. Tivoli] product). Ensure regular configuration and verification of backups by management of backup policies to address system and user file backups. Support maintenance of backups, backup-media rotation, and support hardware/firmware/software upgrades.
    • Performance Standard: Ability to keep backup systems are functional and cyber compliant, revision current, and policies are correct allowing recoverable data.
    • Assessment Method: Inspection of backup processes and SW/firmware versions based on Cyber Security requirements.
  6. DISA STIG REQUIREMENTS
    • Ensure work performed under this task order complies with applicable Defense Information Systems Agency (DISA) Application Security and Development Security Technical Implementation Guides (STIG) and/or Application Services STIG requirements.
    • Performance Standard: Work shall conform to DISA guidelines.
    • Assessment Method: Observation of work performed.
  7. HPC ENGINEERING SUPPORT
    • Provide related engineering support to implement the NIPR/SIPR A2 HPC/Infrastructure systems at FNMOC. This support shall include operational testing, validation, and documentation necessary for transition to FNMOC IT environment.
    • Performance Standard: Ability to provide meaningful engineering support for the complexity of the utilized HPC systems...
    • Assessment Method: Technical review of provided engineering input.
  8. TECHNICAL INPUT TO PRODUCTION RELATED MEETINGS
    • Participate in production-related technical meetings and provide input to plans, schedules, documents, and engineering tasks as required facilitating system upgrade efforts.
    • Performance Standard: Sound engineering and technical input to production related meetings.
    • Assessment Method: Observation of ability to provide meaningful and useful input to technical meetings.

Required Qualifications, Education, and Experience:

  • Active TOP SECRET Clearance
  • DoD 8570 IAT/IAM II (e.g., Security +)
  • Experience with supporting Linux systems and subsystems, including storage and backup solutions, in an operational environment
  • Experience with provisioning Linux systems and performing operational testing
  • Experience with securing Linux systems using tools, including STIGS and vulnerability scanners
  • Knowledge of project management processes and documentation
  • Ability to draft technical documentation targeted at various reader levels, including users, operators, and system analysts

The Successful Candidate will Possess:

  • Prospective candidates should have strong risk management skills, excellent communication, teamwork, and conflict management skills.
  • The candidate must be analytical and effectively able to prioritize needs, requirements, and other issues.
  • Ability to communicate and interact effectively at all levels of staff and management.
  • Ability to exercise independent judgment, develop relationships, and obtain consensus among interested parties.
  • Critical thinker with strong technical skills, diagnostic skills and problem-solving ability
  • Solid written and verbal communication skills to negotiate direction, drive projects to successful conclusion and deliver knowledge to team members verbally and via clear designs, runbooks and technical engineering and exchange sessions
  • Self-starter, flexible, adaptable, collaborative and motivated to champion continuous improvement
  • Ability to develop peer networks across an enterprise to maintain technology awareness and to support resolution of problems
  • Ability to operate across traditional technical boundaries, comfortable working in the compute space as well as the storage space in an operational capacity
  • Technically curious and driven to learn new skills.

General Factors:

  • Depending on project requirements, may be required to work within a compressed schedule; overtime should be expected when schedules demand it.
  • Willing to travel, if needed.
  • No Relocation.
+ Show Original Job Post
























Linux Systems Engineer
Monterey, California, United States
Engineering
About Federated IT
A provider of customized IT and cybersecurity solutions to government and commercial clients.