Location: Springfield, VA
Eligibility: Candidate must possess an active TS/SCI clearance
Job Description:
- Administration, management, and troubleshooting for physical and virtual platforms
- Proactive monitoring concepts, including experience configuring and deploying Network and systems monitoring (i.e. SNMP, Nagios, Splunk, SolarWinds, etc.)
- Python scripting
- Performing trend analysis on overall system health, performance, and capacity management with regard to utilization and growth
- Develop and maintain capacity metric
- Performs software upgrades, patch installs, firmware upgrades then test for functionality on a periodic basis
- Performs: Fault tolerance, High availability, Scalability and Capacity planning, Data center organization, Backup / Recovery
- Creates shell and Perl scripts in various shells to automate daily and periodic tasks
- Maintain server configuration baselines and configuration compliance against baseline/benchmarks
- Collaborate with Application Teams to perform system maintenance and patch management tasks
- Document work for leadership, update/create Standard Operating Procedures, and brief staff and customers various tasks
- Interfaces with other engineering teams to adapt performance management tool capabilities to meet operational requirements
- Assists with analysis using enterprise tool solutions and other tools to detect and respond to IT events, incidents, and outages
- Performing systems hardening to DoD Standards
- Apply vendor patches and new designs to keep products up-to-date and meet security requirements
- Work with other Service Providers to support areas of common interest
- Working with software and hardware vendors to resolve issues and share requirements
- Assume other duties/projects as they arise and be responsive to the needs of the department