AI Engineer

Darwix AI

Location: Gurugram, Haryana, India
Job type: Full-time

Required skills

LangChain
Python
AWS
Artificial Intelligence
backend
caching
cross-functional
customer engagement
data ingestion
data pipeline
deep learning
DevOps
Django
Docker
end-to-end
enterprise sales
FastAPI
Flask
frontend
GitHub
Kafka
Kubernetes
Lambda
machine learning
MySQL
NLP
Postgres
Redis
Salesforce
TensorFlow
Pytorch

About the role

Darwix AI

Website: darwix.ai
Job details:

Job Title: AI Engineers

Location: Gurugram (In-office)

Working Days: Monday to Saturday, with 1st and 3rd Saturdays off

Working Hours: 10:30 AM – 8:00 PM

Experience Required: Prior experience in AI field

About Darwix AI

Darwix AI is one of India’s fastest-growing GenAI startups, revolutionizing the future of enterprise sales and customer engagement with real-time conversational intelligence. We are building a GenAI-powered agent-assist and pitch intelligence suite that captures, analyzes, and enhances every customer interaction across voice, video, and chat in real time.

We serve leading enterprise clients across India, the UAE, and Southeast Asia and are backed by global VCs, top operators from Google, Salesforce, and McKinsey, and CXOs from the industry. This is your opportunity to join a high-caliber team solving frontier problems in real-time voice AI, multilingual transcription, retrieval-augmented generation (RAG), and fine-tuned LLMs at scale..

Role Overview

As the AI Engineer, you will drive the development, deployment, and optimization of AI systems that power Darwix AI's real-time conversation intelligence platform. This includes voice-to-text transcription, speaker diarization, GenAI summarization, prompt engineering, knowledge retrieval, and real-time nudge delivery. You will lead a team of AI engineers and work closely with product managers, software architects, and data teams to ensure technical excellence, scalable architecture, and rapid iteration cycles. This is a high-ownership, hands-on leadership role where you will code, architect, and lead simultaneously.

Key Responsibilities

AI Architecture & Model Development

Architect end-to-end AI pipelines for transcription, real-time inference, LLM integration, and vector-based retrieval.
Build, fine-tune, and deploy STT models (Whisper, Wav2Vec2.0) and diarization systems for speaker separation.
Implement GenAI pipelines using OpenAI, Gemini, LLaMA, Mistral, and other LLM APIs or open-source models.

Real-Time Voice AI System Development

Design low-latency pipelines for capturing and processing audio in real-time across multi-lingual environments.
Work on WebSocket-based bi-directional audio streaming, chunked inference, and result caching.
Develop asynchronous, event-driven architectures for voice processing and decision-making.

RAG & Knowledge Graph Pipelines

Create retrieval-augmented generation (RAG) systems that pull from structured and unstructured knowledge bases.
Build vector DB architectures (e.g., FAISS, Pinecone, Weaviate) and connect to LangChain/LlamaIndex workflows.
Own chunking, indexing, and embedding strategies (OpenAI, Cohere, Hugging Face embeddings).

Fine-Tuning & Prompt Engineering

Fine-tune LLMs and foundational models using RLHF, SFT, PEFT (e.g., LoRA) as needed.
Optimize prompts for summarization, categorization, tone analysis, objection handling, etc.
Perform few-shot and zero-shot evaluations for quality benchmarking.

Pipeline Optimization & MLOps

Ensure high availability and robustness of AI pipelines using CI/CD tools, Docker, Kubernetes, and GitHub Actions.
Work with data engineering to streamline data ingestion, labeling, augmentation, and evaluation.
Build internal tools to benchmark latency, accuracy, and relevance for production-grade AI features.

Team Leadership & Cross-Functional Collaboration

Lead, mentor, and grow a high-performing AI engineering team.
Collaborate with backend, frontend, and product teams to build scalable production systems.
Participate in architectural and design decisions across AI, backend, and data workflows.

Key Technologies & Tools

Languages & Frameworks: Python, FastAPI, Flask, LangChain, PyTorch, TensorFlow, HuggingFace Transformers
Voice & Audio: Whisper, Wav2Vec2.0, DeepSpeech, pyannote.audio, AssemblyAI, Kaldi, Mozilla TTS
Vector DBs & RAG: FAISS, Pinecone, Weaviate, ChromaDB, LlamaIndex, LangGraph
LLMs & GenAI APIs: OpenAI GPT-4/3.5, Gemini, Claude, Mistral, Meta LLaMA ⅔
DevOps & Deployment: Docker, GitHub Actions, CI/CD, Redis, Kafka, Kubernetes, AWS (EC2, Lambda, S3)
Databases: MongoDB, Postgres, MySQL, Pinecone, TimescaleDB
Monitoring & Logging: Prometheus, Grafana, Sentry, Elastic Stack (ELK)

Requirements & Qualifications

Experience

1-4 years of experience in building and deploying AI/ML systems, with at least 2+ years in NLP or voice technologies.
Proven track record of production deployment of ASR, STT, NLP, or GenAI models.
Hands-on experience building systems involving vector databases, real-time pipelines, or LLM integrations.

Educational Background

Bachelor's or Master's in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
Tier 1 institute preferred

Technical Skills

Strong coding experience in Python and familiarity with FastAPI/Django.
Understanding of distributed architectures, memory management, and latency optimization.
Familiarity with transformer-based model architectures, training techniques, and data pipeline design.

Bonus Experience

Worked on multilingual speech recognition and translation.
Experience deploying AI models on edge devices or browsers.
Built or contributed to open-source ML/NLP projects.
Published papers or patents in voice, NLP, or deep learning domains.

What Success Looks Like in 6 Months

Lead the deployment of a real-time STT + diarization system for at least 1 enterprise client.
Deliver high-accuracy nudge generation pipeline using RAG and summarization models.
Build an in-house knowledge indexing + vector DB framework integrated into the product.
Mentor 2–3 AI engineers and own execution across multiple modules.
Achieve <1 sec latency on real-time voice-to-nudge pipeline from capture to recommendation.

What We Offer

Compensation: Competitive fixed salary + equity + performance-based bonuses
Impact: Ownership of key AI modules powering thousands of live enterprise conversations
Learning: Access to high-compute GPUs, API credits, research tools, and conference sponsorships
Culture: High-trust, outcome-first environment that celebrates execution and learning
Mentorship: Work directly with founders, ex-Microsoft, IIT-IIM-BITS alums, and top AI engineers
Scale: Opportunity to scale an AI product from 10 clients to 100+ globally within 12 months

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.