Senior Software Engineer
Wells Fargo is seeking a Senior Software Engineer to join our Platform Services team. This role supports the operational stability of GenAI platforms used across the enterprise. You'll contribute to triaging issues, maintaining observability and support tooling, and partnering with infrastructure and cloud teams to ensure continuity of service.
In this role, you will:
- Lead moderately complex initiatives and deliverables within technical domain environments
- Contribute to large scale planning of strategies
- Design, code, test, debug, and document for projects and programs associated with technology domain, including upgrades and deployments
- Review moderately complex technical challenges that require an in-depth evaluation of technologies and procedures
- Resolve moderately complex issues and lead a team to meet existing client needs or potential new clients needs while leveraging solid understanding of the function, policies, procedures, or compliance requirements
- Collaborate and consult with peers, colleagues, and mid-level managers to resolve technical challenges and achieve goals
- Lead projects and act as an escalation point, provide guidance and direction to less experienced staff
Required Qualifications:
- 4+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
Desired Qualifications:
- 4+ years of experience in platform operations, SRE, or infrastructure engineering
- 2+ years of experience with observability tools (e.g., Prometheus, Grafana, Splunk)
- 2+ years of experience in incident management and diagnostics in production environments
- 2+ years of experience working with cloud infrastructure platforms, preferably GCP
- 1+ year of experience supporting internal platforms or services used by engineering or ML teams
- 1+ year of experience collaborating across geographically distributed teams
- Experience with infrastructure-as-code tools (i.e., Terraform, Ansible)
Job Expectations:
- Participate in incident triage and resolution across platform services.
- Maintain observability tooling to ensure visibility into system performance and reliability.
- Collaborate with infrastructure teams (e.g., GCP support) to resolve platform level issues.
- Conduct diagnostics and contribute to root cause analysis for platform incidents.
- Support internal Gen-AI facing platforms, including AgentSpace, ensuring operational stability and performance.
- Contribute to automation, runbooks and service documentation to improve operational efficiency
We Value Equal Opportunity
Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.
Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.