Job Title: Deputy Manager | Engineering Foundry & Managed Services | Bengaluru | Engineering as a Service/ Oper

Job requisition ID :: 98752

Date: Feb 23, 2026

Location: Bengaluru

Designation: Deputy Manager

Entity: Deloitte LLP

Role: Generative AI Engineer

Role Overview

We are seeking an innovative and experienced Generative AI Engineer to join our team. In this role, you will bridge the gap between cutting-edge Generative AI technologies and seamless user experiences. You will lead the creation of intelligent, agentic applications powered by LLMs and Retrieval-Augmented Generation (RAG) pipelines while delivering intuitive and responsive frontend interfaces.

Key Responsibilities

Design and implement Generative AI solutions utilizing LLMs such as GPT, Claude, and LLaMA
Develop and optimize RAG pipelines using vector databases (ChromaDB for local development and Databricks Mosaic ML Vector Stores)
Build and maintain agentic workflows and conversational AI agents using frameworks like LangGraph and OpenAI
Perform vectorization using models such as GTE-large and BGE, and optionally fine-tune them for improved accuracy
Collaborate with backend and app teams to integrate Gen AI capabilities into products

Required Skills & Experience

Deep understanding of LLMs, prompt engineering, and transformer architectures.
Hands-on experience with RAG architectures, vector search, and embedding models.
Proficiency in Python and libraries such as LangChain, Transformers, OpenAI SDK, or Hugging Face.
Familiarity with agentic frameworks like AutoGen, OpenAI, CrewAI, or LangGraph.
Experience deploying AI models via APIs or microservices.

Preferred Skills

Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related field
Experience with cloud platforms such as AWS, GCP, or Azure, and CI/CD pipeline