Josh Talks
Website:
joshtalks.com
Job details:
AI / ML Engineer Intern
Josh Talks AI
Location: Gurgaon, India
Type: Full-time Internship (6 months)
About JoshTalks AI
At JoshTalks AI, we believe that voice will become the primary interface between humans and machines. Our mission is ambitious and focused:
- Enable machines to communicate as naturally as humans
- Build benchmarks and datasets that form the backbone of global progress in speech AI
- Drive improvements not only through models or compute, but through high-quality, diverse, real-world data
Our datasets already power some of the world’s largest and most widely used speech models (ones you’ve almost certainly interacted with, even if we can’t name them).
What You’ll Work On
This is not a conventional internship. You will contribute directly to work that influences the global speech AI ecosystem.
Benchmarking & Evaluation
- Design and run evaluations for ASR and speech-to-speech systems
- Benchmark leading speech models to identify real-world strengths, weaknesses, and failure modes
- Build evaluation frameworks that guide top AI labs on where models perform well and where they need improvement
Modeling & Fine-Tuning
- Fine-tune speech recognition models (e.g., Whisper, wav2vec2) to push Word Error Rates toward ~5%
- Experiment with multilingual, code-switched, accented, and noisy speech to reflect real-world usage
- Work with large-scale, production-grade speech datasets
Impact at Scale
- Your work will go beyond internal experiments or academic exercises
- It will directly influence how large speech models are built, tested, and improved globally
Who We’re Looking For
We are looking for motivated builders with a strong interest in voice AI.
Required Qualifications
- Final-year undergraduate students or recent graduates (B.Tech/B.E.) in CSE, EE, AI/ML, or related fields
- Strong interest in speech, audio, NLP, or multimodal AI
- Hands-on experience in one or more of the following:
- Fine-tuning speech or language models (Whisper, wav2vec2, HuBERT, SER, etc.)
- Building speech-driven systems such as assistants, classifiers, or SER pipelines
- Working with PyTorch, TensorFlow, or Hugging Face Transformers
Preferred Qualifications
- Open-source contributions, GitHub projects, Kaggle experience, or research papers
- Experience working with multilingual or low-resource speech data
Why Join Us
Meaningful Ownership
Even as an intern, you will own problems of global relevance, from reducing ASR error rates to building benchmarks that influence the next generation of speech-to-speech models.
Front-Row Seat to Speech AI
Your work will shape benchmarks and datasets used by leading AI research labs worldwide.
Deep Technical Learning
Work alongside experts tackling speech challenges across 20+ Indian languages and real-world audio conditions.
Startup Environment, Global Reach
A small, focused team working on problems that impact billions of users.
Details
- Location: Gurgaon (on-site preferred for close collaboration)
- Duration: 6
- Type: Paid, full-time internship
- Start Date: Flexible (aligned with academic calendars)
If you are passionate about making speech AI as natural as human conversation, this role offers the opportunity to work at the true frontier of the field.
To Apply:
Please email your profile to careers@joshtalks.com
Click on Apply to know more.