AI Engineer

VAYUZ Technologies

full-time

Required skills

Website: vayuz.com
Job details:

Job Description:

We are seeking a seasoned AI Engineer to build, fine-tune, and deploy intelligent AI systems at scale. You

will work at the intersection of LLMs, machine learning, and software engineering — developing production-

ready AI features and pipelines that power our core product.

Key Responsibilities:

• Design, develop, and deploy AI/ML models and pipelines in production environments

• Implement Retrieval-Augmented Generation (RAG) architectures and agentic AI workflows

• Fine-tune and optimize LLMs for domain-specific use cases using RLHF, LoRA, QLoRA

• Build robust prompt engineering frameworks and evaluation pipelines

• Integrate LLM APIs (OpenAI, Claude, Gemini) and open-source models into product features

• Develop and maintain vector search infrastructure and embedding pipelines

• Collaborate with architects, backend engineers, and product teams on AI feature delivery

• Monitor model performance, conduct A/B testing, and iterate based on metrics

• Implement guardrails, safety layers, and hallucination-mitigation strategies

• Contribute to MLOps practices: model versioning, deployment pipelines, monitoring

KEY SKILLS & REQUIREMENTS:

• Strong expertise in Python, with deep knowledge of AI/ML libraries (PyTorch, TensorFlow,

HuggingFace Transformers)

• Hands-on experience with LLM APIs and prompt engineering techniques (CoT, few-shot, ReAct)

• Experience with RAG systems, embedding models (text-embedding-3, BGE, Cohere), and vector

stores

• Knowledge of agentic frameworks: LangChain, LlamaIndex, AutoGen, CrewAI, or Semantic Kernel

• Familiarity with fine-tuning techniques: LoRA, QLoRA, PEFT, instruction tuning

• Experience deploying models on cloud platforms (AWS SageMaker, GCP Vertex AI, Azure ML)

• Understanding of data preprocessing, feature engineering, and model evaluation metrics

• Proficiency with MLOps tools: MLflow, DVC, Weights & Biases, BentoML

• Experience with containerization and orchestration: Docker, Kubernetes

• Strong debugging and experimentation skills with Jupyter, FastAPI, Streamlit

NICE TO HAVE:

• Experience with multi-modal models (vision-language models, Whisper, DALL-E)

• Published papers or Kaggle/competition achievements

• Exposure to speech AI, computer vision, or NLP specializations

• Knowledge of responsible AI, fairness metrics, and bias mitigation.

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.