Job Title:  Deputy Manager | Engineering Foundry & Managed Services | Bengaluru | Engineering as a Service/ Oper

Job requisition ID ::  98752
Date:  Feb 23, 2026
Location:  Bengaluru
Designation:  Deputy Manager
Entity:  Deloitte LLP

Role: Generative AI Engineer 

 

Role Overview 

We are seeking an innovative and experienced Generative AI Engineer to join our team. In this role, you will bridge the gap between cutting-edge Generative AI technologies and seamless user experiences. You will lead the creation of intelligent, agentic applications powered by LLMs and Retrieval-Augmented Generation (RAG) pipelines while delivering intuitive and responsive frontend interfaces. 


 


Key Responsibilities 

  • Design and implement Generative AI solutions utilizing LLMs such as GPT, Claude, and LLaMA 
  • Develop and optimize RAG pipelines using vector databases (ChromaDB for local development and Databricks Mosaic ML Vector Stores) 
  • Build and maintain agentic workflows and conversational AI agents using frameworks like LangGraph and OpenAI 
  • Perform vectorization using models such as GTE-large and BGE, and optionally fine-tune them for improved accuracy 
  • Collaborate with backend and app teams to integrate Gen AI capabilities into products 

 

Required Skills & Experience 

  • Deep understanding of LLMs, prompt engineering, and transformer architectures. 
  • Hands-on experience with RAG architectures, vector search, and embedding models. 
  • Proficiency in Python and libraries such as LangChain, Transformers, OpenAI SDK, or Hugging Face. 
  • Familiarity with agentic frameworks like AutoGen, OpenAI, CrewAI, or LangGraph. 
  • Experience deploying AI models via APIs or microservices. 

 

Preferred Skills 

  • Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related field 
  • Experience with cloud platforms such as AWS, GCP, or Azure, and CI/CD pipeline