View All Jobs 126878

Site Reliability Engineer (SRE) - CA

Design and implement end-to-end observability across enterprise systems to improve reliability.
Concord, California, United States
Mid-Level
$79 – 85 USD / hour
23 hours agoBe an early applicant
Apex Systems

Apex Systems

Provides IT staffing, consulting, and workforce solutions, connecting organizations with technology talent and project-based services.

74 Similar Jobs at Apex Systems

Software Engineer 4 / Site Reliability Engineer (SRE)

Client: Financial Services

Location: Concord, CA – Hybrid (3 days onsite; Mon & Tues preferred)

Contract Length: 12 months (possible extension or conversion)

Pay Rate: $79 - $85

Top Requirements

  1. 5+ years of experience with observability and monitoring tools (Grafana, Splunk, ThousandEyes, AppDynamics)
  2. Experience with Kubernetes/OpenShift (OCP) and containerized environments
  3. Strong understanding of databases (Postgres, MySQL) and system monitoring/analysis

Plusses

  1. Experience with object storage (S3, NAS)
  2. Ability to analyze and monitor network traffic end-to-end
  3. Experience building monitoring strategies and alerting frameworks
  4. Exposure to Skan.AI or similar third-party platforms
  5. Experience in enterprise-grade environments with governance and reliability standards

Job Summary

In this contingent resource assignment, you may: Consult on complex initiatives with broad impact and large-scale planning for Software Engineering. Review and analyze complex multi-faceted, larger scale or longer-term Software Engineering challenges that require in-depth evaluation of multiple factors including intangibles or unprecedented factors. Contribute to the resolution of complex and multi-faceted situations requiring solid understanding of the function, policies, procedures, and compliance requirements that meet deliverables. Strategically collaborate and consult with client personnel.

Day-to-Day Responsibilities

  • Design and implement end-to-end observability and monitoring strategies for enterprise systems
  • Build dashboards, alerts, and monitoring solutions using tools like Grafana, Splunk, and AppDynamics
  • Monitor and analyze system performance, latency, and data flow across platforms
  • Identify bottlenecks, thresholds, and performance issues across distributed systems
  • Work with Kubernetes/OpenShift environments to monitor containerized applications
  • Analyze network traffic and collaborate with networking teams to improve visibility
  • Monitor and support databases (Postgres, MySQL) and storage systems (S3, NAS)
  • Integrate third-party systems (e.g., Skan.AI) into enterprise monitoring frameworks
  • Ensure reliability, availability, and performance of production systems
  • Collaborate with global teams (including India) to troubleshoot and resolve issues
  • Recommend and implement improvements to enhance system resilience and operational efficiency

EEO Employer

Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at employeeservices@apexsystems.com or 844-463-6178.

+ Show Original Job Post
























Site Reliability Engineer (SRE) - CA
Concord, California, United States
$79 – 85 USD / hour
Engineering
About Apex Systems
Provides IT staffing, consulting, and workforce solutions, connecting organizations with technology talent and project-based services.