View All Jobs 114331

Data Engineer III, RAG And Gen AI

Own our production-ready RAG data pipelines and embeddings infrastructure for Gen AI applications
Gurgaon, Haryāna, India
Senior
13 hours agoBe an early applicant
Expedia Group

Expedia Group

Operates a global online travel platform connecting consumers with flights, hotels, vacation rentals, car rentals, and travel experiences.

11 Similar Jobs at Expedia Group

Data Engineer III

Expedia Group brands power global travel for everyone, everywhere. We design cutting-edge tech to make travel smoother and more memorable, and we create groundbreaking solutions for our partners. Our diverse, vibrant, and welcoming community is essential in driving our success.

To shape the future of travel, people must come first. Guided by our Values and Leadership Agreements, we foster an open culture where everyone belongs, differences are celebrated and know that when one of us wins, we all win.

We provide a full benefits package, including exciting travel perks, generous time-off, parental leave, a flexible work model (with some pretty cool offices), and career development resources, all to fuel our employees' passion for travel and ensure a rewarding career journey. We're building a more open world. Join us.

Our Technology Team partners with teams across Expedia Group to create innovative products, services, and tools to deliver high-quality experiences for travelers, partners, and our employees. A singular technology platform powered by data and machine learning provides secure, differentiated, and personalized experiences that drive loyalty and traveler satisfaction.

As a Data Engineer III with Expedia Engineering teams you will have the opportunity to leverage your technical expertise to design solutions that enrich the Data and Intelligent service Metric Enablement platform with new features and functionality to run the business. This role goes beyond traditional data engineering and you will be designing, building, deploying, and operating data pipelines, embeddings workflows to power Agentic AI applications in production and will be expected to own architecture decisions, drive AI platform evolution, and ensure enterprise-grade reliability, governance, and scalability.

You will also have the opportunity to work alongside junior developers as a coach/mentor to them & work with Sr. Devs on various tech teams to come up with solutions.

In this role, you will:

  • Design and develop, scalable cloud-native solutions that are scalable, responsive & resilient.
  • Build scalable ingestion pipelines for structured and unstructured data (documents, logs, knowledge bases, transactional data)
  • Design semantic layers and context-building strategies for LLM consumption
  • Architect and build production-ready RAG systems (retrieval pipelines, embeddings, vector indexing, ranking strategies) and work with vector databases and retrieval systems
  • Develop embedding pipelines and manage vector databases at scale
  • Develop, test, own and deliver Sprint tasks and help drive the team forward
  • Collaborate with teams and individuals to complete your team assignment on time, with quality
  • Be a coach/mentor to junior developers on the team
  • Work across multiple layers of the stack as the problem demands.
  • Have a strong sense of ownership of all technical issues
  • Identify risks, and issues & drive them to mitigation/resolution as required in the scope of your work
  • Prototype ideas, execute and learn from them and enrich the overall team experience

Experience and Qualifications

  • 6+ years of development experience in an enterprise-level engineering environment increasing levels of technical expertise.
  • 4+ years of hands-on backend Data Engineering application development experience with an excellent understanding of products with microservice architecture.
  • Proven hands-on experience designing, building, and operating data pipelines that enable LLM-based agentic AI systems, including support for embeddings, retrieval layers, and orchestration workflows.
  • Expert-level SQL and strong Python proficiency (Java is a plus)
  • Experience with distributed processing frameworks (Spark, Databricks, Flink, etc.)
  • Experience building data pipelines in cloud-native environments (AWS/GCP/Azure)
  • Experience building scalable, fault-tolerant, observable systems
  • Good knowledge of Data Structures and Algorithm.
  • Strong understanding of data modeling and semantic layer design
  • Understanding of embeddings, chunking strategies, retrieval optimization, and re-ranking

Please note that this role is only available in the following locations: Gurgaon or Bangalore, in alignment with our flexible work model which requires employees to be in-office at least three days a week. Relocation assistance will be considered for candidates relocating to these locations for this role.

If you need assistance with any part of the application or recruiting process due to a disability, or other physical or mental health conditions, please reach out to our Recruiting Accommodations Team.

Expedia is committed to creating an inclusive work environment with a diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, gender, sexual orientation, national origin, disability or age.

+ Show Original Job Post
























Data Engineer III, RAG And Gen AI
Gurgaon, Haryāna, India
Engineering
About Expedia Group
Operates a global online travel platform connecting consumers with flights, hotels, vacation rentals, car rentals, and travel experiences.