Website:
nexlance.co.in
Job details:
Role Overview
We are seeking a highly skilled Senior GenAI Engineer with deep expertise in building production-grade, stateful multi-agent AI systems on the Google Cloud ecosystem.
The ideal candidate will have hands-on experience designing and deploying agentic workflows using modern LLM orchestration frameworks and implementing scalable backend systems in Python.
This role requires strong architectural thinking, production deployment experience, and the ability to work closely with product owners and business stakeholders.
Key Responsibilities
🔹 Agentic Workflow Development
• Design and implement complex, stateful multi-agent systems using LangGraph and Google Agent Development Kit (ADK).
• Build cyclic and graph-based orchestration workflows for AI agents.
🔹 AI & LLM Integration
• Implement Model Context Protocol (MCP) for structured interaction between LLMs and enterprise systems.
• Build and optimize RAG (Retrieval-Augmented Generation) pipelines.
• Fine-tune prompts and optimize LLM latency and performance.
• Work with Google Gemini models in production environments.
🔹 Cloud & Infrastructure
• Architect and deploy AI microservices on Google Cloud Platform (GCP).
• Work extensively with:
• Vertex AI
• Model Garden
• Cloud Run
• Cloud Functions
• Manage data pipelines using BigQuery and vector databases.
🔹 Backend Development
• Develop robust, scalable RESTful APIs using Python (FastAPI/Flask).
• Strong knowledge of asynchronous programming and Pydantic.
🔹 Data & Vector Infrastructure
• Work with vector databases such as:
• Vertex AI Search & Conversation
• Pinecone
• Weaviate
• Manage embeddings lifecycle and evaluation frameworks.
🔹 DevOps & Quality
• Containerization using Docker and Kubernetes.
• Strong Git workflows (branching strategy, PR reviews).
• Participate in CI/CD automation pipelines.
• Provide design documentation, installation guides, configuration documentation, and runbooks.
🔹 Production Support
• Provide hypercare support during initial production rollout.
• Troubleshoot and resolve production incidents.
• Participate in requirement gathering sessions with product and business teams.
Required Skills & Experience
• 4+ years in software engineering with strong backend expertise.
• Advanced Python programming (async programming mandatory).
• Hands-on experience with LangGraph and LangChain.
• Strong experience deploying AI solutions on Google Cloud Platform.
• Experience implementing MCP (Model Context Protocol).
• Strong understanding of RAG architecture and evaluation.
• Experience with vector databases.
• Familiarity with Google Gemini models.
• Strong API design principles and third-party integration experience.
Preferred Qualifications
• Prior experience building production-scale multi-agent systems.
• Experience optimizing LLM latency and cost.
• Experience with enterprise AI governance and security best practices.
• Exposure to BigQuery-based data orchestration.
Click on Apply to know more.