Persistent Systems
Website:
persistent.com
Job details:
About Position:
We are looking for a highly skilled Senior AI/GenAI Engineer to lead the development of scalable, production-grade AI systems. The role involves designing and implementing advanced LLM/RAG solutions, establishing ML Ops pipelines, and building end-to-end AI workflows using Azure AI Foundry, including its Control Plane, prompt flow, and agent capabilities.
- Role: Senior AI/GenAI Engineer
- Location: Mumbai
- Experience: 6 to 10 Years
- Job Type: Full Time Employment
What You'll Do:
- Lead the design and development of scalable LLM-based and RAG (Retrieval-Augmented Generation) solutions
- Architect and implement production-grade AI/ML systems with high availability and performance
- Build and manage ML Ops pipelines for model training, deployment, monitoring, and lifecycle management
- Design and automate end-to-end AI workflows using Azure AI Foundry Control Plane, Prompt Flow, and Agent frameworks
- Develop reusable frameworks for prompt engineering, orchestration, and experimentation
- Integrate LLMs (Azure OpenAI, Open-source models) into enterprise systems and applications
- Optimize AI models and pipelines for latency, cost, and accuracy
- Implement evaluation frameworks for LLM and RAG systems, including benchmarking and monitoring
- Collaborate with cross-functional teams to translate business requirements into scalable AI solutions
- Mentor junior engineers and drive best practices in AI/GenAI development
Expertise You'll Bring:
- Strong expertise in Python and AI/ML system design
- Hands-on experience with Generative AI, LLMs, and RAG architectures
- Deep understanding of vector databases (e.g., FAISS, Pinecone, Azure Cognitive Search)
- Experience with Azure AI Foundry, including: Control Plane, Prompt Flow, Agent capabilities
- Proficiency in ML Ops tools and frameworks (Azure ML, MLflow, CI/CD pipelines)
- Experience in building and deploying microservices-based AI architectures
- Knowledge of API integration, distributed systems, and cloud-native development (Azure preferred)
- Strong understanding of model evaluation, monitoring, and observability
- Experience with LangChain, LlamaIndex, Semantic Kernel
- Exposure to fine-tuning and custom model training
- Knowledge of containerization (Docker, Kubernetes)
- Familiarity with data engineering pipelines (ETL, streaming)
- Understanding of Responsible AI and governance practices
Benefits:
- Competitive salary and benefits package
- Culture focused on talent development with quarterly growth opportunities and company-sponsored higher education and certifications
- Opportunity to work with cutting-edge technologies
- Employee engagement initiatives such as project parties, flexible work hours, and Long Service awards
- Annual health check-ups
- Insurance coverage: group term life, personal accident, and Mediclaim hospitalization for self, spouse, two children, and parents
Values-Driven, People-Centric & Inclusive Work Environment:
Persistent is dedicated to fostering diversity and inclusion in the workplace. We invite applications from all qualified individuals, including those with disabilities, and regardless of gender or gender preference. We welcome diverse candidates from all backgrounds.
- We support hybrid work and flexible hours to fit diverse lifestyles.
- Our office is accessibility-friendly, with ergonomic setups and assistive technologies to support employees with physical disabilities.
- If you are a person with disabilities and have specific requirements, please inform us during the application process or at any time during your employment
Let’s unleash your full potential at Persistent - persistent.com/careers
“Persistent is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind.”
Click on Apply to know more.