Location: Remote-First
Level: Senior (5+ years) | Full-time
DevSavant is the engineering powerhouse behind Savant Growth's portfolio companies. We're not your typical dev shop—we pair small, surgical teams with cutting-edge GenAI tooling to solve real business problems. Our unique approach: ship experimental features in focused one-week innovation sprints, then mature the winners into robust production services that scale.
We're seeking a senior, self-directed GenAI Full-Stack Engineer who thrives at the intersection of rapid innovation and production excellence. You'll own the complete lifecycle of GenAI initiatives:
Prototype Fast – Design, build, and demo proof-of-concepts within single sprints
Pressure-Test & Iterate – Instrument, evaluate, and refine until ideas prove value
Productionize – Apply MLOps, observability, and CI/CD so successful PoCs scale
The ideal teammate moves quickly with minimal supervision, selects the right tool for each job, and communicates clearly across engineering and product stakeholders.
Architect and develop LLM-powered workflows using LangGraph, LangChain, and emerging frameworks
Design and implement production RAG systems with hybrid search, re-ranking, and semantic caching
Build multi-agent orchestration with CrewAI, AutoGen, AgentKit—selecting build-vs-buy per use case
Create real-time AI interfaces with streaming responses, progressive enhancement, and edge deployment
Connect GenAI pipelines to enterprise data via low-code orchestrators (n8n, Make, Langflow)
Implement formal prompt lifecycles with version control, A/B testing, and performance tracking
Add runtime guardrails and automated evaluation using Guardrails-AI, Ragas, Promptfoo
Optimize for cost & latency via prompt caching, selective chunking, and model routing
Instrument with enterprise observability (LangSmith, OpenTelemetry, Prometheus/Grafana)
When prototypes show traction, you'll:
Design scalable architectures handling 100K+ concurrent users
Implement comprehensive testing (unit, integration, E2E, evaluation suites)
Build CI/CD pipelines with automated quality gates and progressive rollouts
Deploy with container orchestration (Docker, Kubernetes) and auto-scaling
Monitor AI-specific metrics (latency, cost per request, quality scores)
Python (5+ years): Advanced proficiency with type hints, async/await, dataclasses
TypeScript/JavaScript (5+ years): Modern ES2022+, React 18+, Node.js 20+
Full-Stack Frameworks: Next.js 14+ (App Router), Remix, FastAPI, Express.js
LLM Orchestration: LangGraph, LangChain, LlamaIndex (2+ years production)
Multi-Agent Systems: CrewAI, AutoGen, Autogen Studio, custom agent frameworks
Vector Databases: Pinecone, Weaviate, Qdrant, pgvector (with hybrid search)
RAG Expertise: Advanced patterns (HyDE, multi-hop, contextual compression)
Model Integration: OpenAI, Anthropic, Mistral, Llama, multimodal models
Evaluation & Testing: Ragas, Promptfoo, custom eval frameworks
Cloud Platforms: AWS/GCP/Azure with AI services (Bedrock, Vertex AI, Azure OpenAI)
Containerization: Docker, Kubernetes, serverless architectures
Observability: LangSmith, DataDog, OpenTelemetry distributed tracing
MLOps: Model versioning, A/B testing, feature stores, experiment tracking
Low-Code Tools: n8n, Make, Zapier for rapid integration
Databases: PostgreSQL, Redis, DynamoDB, vector stores
Streaming: Kafka, Redis Streams, WebSockets for real-time AI
Caching: Multi-tier caching strategies, CDN optimization
Security: OAuth2, JWT, API gateways, prompt injection prevention
5+ years professional full-stack development
2+ years shipping GenAI/LLM features to production
1+ year with vector databases and production RAG systems
Proven track record of 0→1 product development
Experience with high-traffic applications (10K+ concurrent users)
Autonomous & Accountable – Set your course, surface blockers early, deliver independently
Bias for Action – Spike new libraries in the morning, demo results by afternoon
Quality Owner – Comprehensive testing mindset with evaluation-driven development
Continuous Learner – Track bleeding-edge releases, introduce relevant tools
Strong Communicator – Translate complex AI concepts to all stakeholders
Experience with fine-tuning (LoRA/QLoRA) on Mistral, Llama, or domain-specific models
Multimodal AI applications (vision + language, audio processing)
Research implementation – turning papers into production code
Open source contributions to major AI/ML projects
Published technical content (blogs, talks, papers) on GenAI topics
Experience with AI safety and alignment practices