Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for the availability and reliability of our firm's most critical platform services, and ensures they meet the requirements of our internal and external users. We look for engineers who are motivated to collaborate with our businesses to build and run sustainable production systems, which can evolve and adapt to changes in our fast-paced, global business environment.
As a SRE Logging Engineer, you will work with customers, product owners, and SREs to design and develop a large-scale application to process, store and read large volumes of log events. You will run a production environment spanning AWS, GCP and on-prem datacentres.
Basic Qualifications:
Preferred Experience: