Questhiring
Website:
questhiring.com
Job details:
Job Title: Director – Agentic AI, RAG & GenAI Platforms
Location: Gurgaon
Experience: 14–19 Years
About the Role
We are seeking a seasoned technology leader to drive architecture, engineering, and innovation across Agentic AI, Retrieval-Augmented Generation (RAG), LLMOps, and Enterprise AI Platforms. This role requires deep expertise in designing scalable GenAI ecosystems, intelligent multi-agent systems, low-latency AI architectures, and production-grade LLM-powered applications.
Key Responsibilities
- Architect and scale Agentic AI systems, autonomous workflows, and multi-agent orchestration frameworks.
- Design and implement advanced RAG frameworks, including Hybrid Search, GraphRAG, Agentic RAG, and knowledge-grounded AI systems.
- Lead development of enterprise-grade LLM-powered platforms using frameworks such as LangChain, Semantic Kernel, AutoGen, CrewAI, or similar.
- Define and implement LLMOps / MLOps frameworks, including model deployment, monitoring, governance, evaluation, and responsible AI controls.
- Build scalable AI infrastructure supporting low latency, high QPS systems, vector search, embeddings, model serving, and inference optimization.
- Drive AI platform strategy, intelligent automation, and AI cost optimization strategies such as LLM routing, SLM-first approaches, and model cascading.
- Lead development and fine-tuning of foundation models leveraging LoRA, QLoRA, PEFT, prompt engineering, and model evaluation frameworks.
- Architect AI-ready data pipelines integrating structured/unstructured enterprise data, vector databases, knowledge graphs, and semantic retrieval systems.
- Partner with product, engineering, and business stakeholders to deliver scalable AI products driving measurable business impact.
- Lead and mentor high-performing engineering and AI teams while driving enterprise-wide AI adoption.
Required Skills & Experience
- 14–19 years of experience in software engineering, AI/ML, distributed systems, and enterprise platform architecture.
- Strong expertise in Agentic AI, RAG, GraphRAG, Multi-Agent Systems, and LLM application development.
- Hands-on experience with Azure OpenAI, GPT-4, Phi-3, Llama, Hugging Face, or similar foundation models.
- Strong experience with Semantic Kernel, AutoGen, LangChain, Prompt Flow, or equivalent orchestration frameworks.
- Deep expertise in Vector Databases (FAISS, Chroma, Azure Cosmos DB vCore) and semantic retrieval architectures.
- Strong background in LLMOps / MLOps, including AzureML, MLflow, Kubeflow, Docker, Kubernetes, and model governance.
- Experience building high-scale, low-latency distributed systems (1000+ QPS, sub-4 sec SLAs preferred).
- Strong programming skills in Python and Java.
- Experience in cloud-native architectures across Azure / AWS / GCP.
Preferred Background
- Prior experience with global product/technology organizations such as Microsoft, Google, Amazon, Adobe, Salesforce, or equivalent.
- Strong experience in enterprise AI transformation, AI governance, and responsible AI frameworks.
- Experience leading large-scale AI platform adoption and driving measurable business outcomes.
- Exposure to AI cost optimization, intelligent model routing, and advanced inference strategies is highly preferred.
Why Join
- Opportunity to build next-generation Agentic AI platforms at scale.
- Drive enterprise-wide innovation in GenAI and autonomous systems.
- Lead strategic AI architecture initiatives with significant business impact.
- Work on cutting-edge problems spanning agents, reasoning, RAG, and intelligent automation.
Click on Apply to know more.