View All Jobs 156622

Platform Engineer – Python/fastapi/kubernetes/terraform/ai - Vice President - Software Engineering

Build a firmwide AI platform leveraging Kubernetes, Terraform, and Generative AI models
BangaloreMumbai
Expert
yesterday

Platform Engineer – Python/FastAPI/Kubernetes/Terraform/AI

We're seeking someone to join our Enterprise Technology & Services Team as Vice President. This role is for a platform engineer who will help build a firmwide AI Development Platform and drive adoption of AI capabilities throughout the enterprise. We have multiple focus areas across the platform and are looking for energetic, multi-disciplinary candidates who are eager to contribute to providing scalable, secure, enterprise-wide solutions for the firm.

Architecture & Modernization Drives development of the global firm strategy to define modern architectures and guardrails to reduce legacy debt, while partnering with app dev to accelerate the adoption of modern capabilities

Advisory is responsible for end-to-end process life cycle support for Investment Banking, Global Capital Markets and Research with a focus on efficiency, scale, and time to market. Sales Distribution develops technology for the firm's IED and FID Sales and Distribution business units, creating application synergies across the businesses various platforms

In the Technology division, we leverage innovation to build the connections and capabilities that power our Firm, enabling our clients and colleagues to redefine markets and shape the future of our communities. This is a Lead Software Engineering position at Vice President level, which is part of the job family responsible for developing and maintaining software solutions that support business needs.

Since 1935, Morgan Stanley is known as a global leader in financial services, always evolving and innovating to better serve our clients and our communities in more than 40 countries around the world.

What You'll Do In The Role:

  • Develop tooling and self-service capabilities for deploying AI solutions for the firm leveraging Kubernetes/OpenShift, Python, authentication solutions, APIs, REST framework, etc.
  • Develop Terraform modules and Cloud architecture to enable secure AI cloud service deployment and consumption at scale
  • Have a platform mindset and build common, reusable solutions to scale Generative AI use cases using pre-trained models as well as fine-tuned models. Leverage Kubernetes/OpenShift to develop modern containerized workloads
  • Integrate with capabilities such as large-scale vector stores for embeddings
  • Author best practices on the Generative AI ecosystem, when to use which tools, available models such as GPT, Llama, Hugging Face etc. and libraries such as Langchain.
  • Analyze, investigate, and implement GenAI solutions focusing on Agentic Orchestration and Agent Builder frameworks
  • Author and publish architecture decision records to capture major design decisions and product selection for building Generative AI solutions. Inclusive of app authentication, service communication, state externalization, container layering strategy and immutability.
  • Ensure AI platform is reliable, scalable, and operational; (e.g. blueprints for upgrade/release strategies (E.g. Blue/Green); logging/monitoring/metrics; automation of system management tasks)
  • Participate in all team's Agile/ Scrum ceremonies.
  • Participate in teams on call rotation in build/run team model

What You'll Bring To The Role:

The ideal candidate will have strong hands-on experience building software platforms on any combination of the following platforms - Kubernetes, Cloud (AWS, Azure, and/or Google), API based development, REST framework, data engineering, and large-scale API Gateway environments etc. Knowledge of AIML and hands-on experience implementing solutions using Generative AI are also preferable. The candidate will have great communication skills, a team-based mentality and a strong passion for using AI to increase productivity as well as help generate new ideas for product & technical improvements.

Required Skills:

  • Strong hands-on Application Development background in at least one prominent programming language, preferably Python Flask or FAST Api.
  • Broad understanding of data engineering (SQL, NoSQL, Big Data, Kafka, Redis), data governance, data privacy and security.
  • Experience in development, management, and deployment of Kubernetes workloads, preferably on OpenShift.
  • Experience with designing, developing, and managing RESTful services for large-scale enterprise solutions.
  • Experience deploying applications on Azure, AWS, and/or GCP using IaC (Terraform)
  • Hands-on experience with multiprocessing, multithreading, asynchronous I/O, performance profiling in at least one prominent programming language, preferably Python.
  • Ability to articulate technical concepts effectively to diverse audiences.
  • Excellent communication skills.
  • Demonstrated ability to work effectively and collaboratively in a global organization, across time zones, and across organizations
  • Demonstrated experience in DevOps, understanding of CI/CD (Jenkins) and GitOps.
  • Knowledge of DevOps and Agile practices.
  • At least 8 years' relevant experience would generally be expected to find the skills required for this role

Desired Skills:

  • Practitioner of unit testing, performance testing and BDD/acceptance testing.
  • Understanding of OAuth 2.0 protocol for secure authorization.
  • Proficiency with Open Telemetry tools including Grafana, Loki, Prometheus, and Cortex.
  • Good knowledge of Microservice based architecture, industry standards, for both public and private cloud.
  • Good understanding of modern Application configuration techniques.
  • Hands on experience with Cloud Application Deployment patterns like Blue/Green.
  • Good understanding of State sharing between scalable cloud components (Kafka, dynamic distributed caching).
  • Good knowledge of various DB engines (SQL, Redis, Kafka, etc) for cloud app storage.
  • Experience building AI applications, preferably Generative AI and LLM based apps.
  • Deep understanding of AI agents, Agentic Orchestration, Multi-Agent Workflow Automation, along with hands-on experience in Agent Builder frameworks such Lang Chain and Lang Graph.
  • Experience working with Generative AI development, embeddings, fine tuning of Generative AI models.
  • Understanding of ModelOps/ ML Ops/ LLM Op.
  • Understanding of SRE

What You Can Expect From Morgan Stanley:

We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - aren't just beliefs, they guide the decisions we make every day to do what's best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. At Morgan Stanley, you'll find an opportunity to work alongside the best and the brightest, in an environment where you are supported and empowered. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. There's also ample opportunity to move about the business for those who show passion and grit in their work.

+ Show Original Job Post
























Platform Engineer – Python/fastapi/kubernetes/terraform/ai - Vice President - Software Engineering
BangaloreMumbai
Engineering
About Morgan Stanley