Viraaj HR Solutions Private Limited
Website:
viraajhrsolutions.com
Job details:
A technology services firm operating in the Enterprise AI and Software Engineering sector, building production-grade Generative AI solutions for search, automation, and conversational interfaces. We partner with finance, retail, and SaaS customers to deliver end-to-end GenAI features—model fine-tuning, retrieval-augmented generation (RAG), and scalable inference pipelines. This is an on-site role based in India for engineers who can productionize GenAI systems and drive measurable product impact.
Role & Responsibilities
- Design, fine-tune, and validate generative language models for production use—implement training, adapters, and quantization workflows to meet latency and cost targets.
- Build end-to-end RAG pipelines: embed and index content, implement vector search, and integrate retrieval with LLM inference for contextualized responses.
- Develop and deploy inference APIs and microservices in Python; containerize with Docker and orchestrate on Kubernetes for resilient, scalable delivery.
- Integrate orchestration and agent frameworks (LangChain or equivalent), implement prompt engineering, response ranking, and automated evaluation suites.
- Collaborate with data scientists and product teams to translate experiments into production features; implement CI/CD, model governance, monitoring, and drift detection.
- Ensure security, privacy, and bias controls in model pipelines and maintain cost-efficient inference strategies for on-site deployments across cloud environments.
Skills & Qualifications Must-Have
- Python
- PyTorch
- Hugging Face Transformers
- LangChain
- FAISS
- Milvus
- Docker
- Kubernetes
Preferred
- TensorFlow
- Ray
- AWS SageMaker
Qualifications
- 3+ years of hands-on ML/AI engineering experience with at least 1+ years focused on LLMs or GenAI model productionization.
- Proven track record deploying ML services to production and ownership of CI/CD, monitoring, and cost-optimisation for inference.
- Willingness to work on-site in India and collaborate closely with cross-functional engineering and product teams.
Benefits & Culture Highlights
- Work on cutting-edge GenAI projects that move from research to production quickly.
- Collaborative, R&D-driven engineering culture with mentorship and career growth opportunities.
- Competitive compensation and performance-linked rewards; on-site engagement with focused product teams.
Skills: ml,kubernetes,docker,pytorch,python,tensorflow,gen ai
Click on Apply to know more.