Website:
greyhiring.work
Job details:
● Title: AI Engineer (Voice AI)
● Location: Gurugram
● Engagement: Full-time
● Experience: 4-6 Years From Tier I & Tier II Colleges including all IITs, NITs & BITS but not limited to.
What You’ll Do
● Build and improve real-time voice agents for sales, support, and operations use cases
● Work with ASR, TTS, LLMs, and dialog orchestration frameworks to enhance voice quality and response accuracy
● Work on multiple models for VAD, turn detection, interruption handling etc.
● Contribute to core voice infrastructure: WebRTC, streaming sockets, call routing, and scaling systems
● Reduce latency, improve interrupt handling, and make conversations more natural
● Integrate multiple providers (Deepgram, Sarvam, ElevenLabs, OpenAI) into a single modular voice pipeline
● Own agent evaluation: call audits, response quality scoring, accuracy improvements
● Work closely with product and customers to understand real call behaviours and fix friction points
What You Bring
● 4–6 years experience as AI/ML/LLM engineer, experience with Voice AI is mandatory, you should have built and taken Voice AI agents to production
● We expect you to have built things from 0 to 1 or 1 to 10 with ownership mindset and willingness to iterate
● working with voice or conversational systems
● Hands-on with LLM prompting, finetuning, embeddings, and RAG
● Solid understanding of STT and TTS models and how to tune them for natural speech
● Proficient in Python and Node
● Experience working with real-time audio streaming (WebRTC, WebSockets) is a strong advantage ● Ability to debug live calls and improve agent behaviour fast
● Ownership mindset and willingness to iterate directly with users
Click on Apply to know more.