Flag job

Report

Speech Recognition Intern

Min Experience

0 years

Location

remote

JobType

internship

About the role

Sony is hiring for the position of Speech Recognition Intern in Mumbai, India. Candidates with Bachelor's/ Master's Degree/ Ph.D are eligible to apply for this position. The complete information, eligibility criteria, and requirements are provided below. Job Description: ompany Name Sony Position Speech Recognition Intern Qualifications Bachelor's/ Master's degree/ Ph.D Batch Recent Batches Experience Freshers Location Work From Home (Remote) Key Responsibilities: - Collaborate with the research team to design, implement, and evaluate advanced speech recognition models such as Whisper, Wav2Vec2, and Conformer. - Optimize existing speech recognition algorithms to improve accuracy, noise robustness, and reduce latency. - Stay informed about the latest advancements in speech recognition and speaker diarization. - Contribute insights and findings to enhance the team's knowledge base and research direction. Eligibility Criteria: - Currently pursuing or completed a Master's (Research) or Ph.D. in Deep Learning or Machine Learning, with practical experience applying Transformer models in audio/speech-related applications. Essential Skills: - Proficient in programming with Python and shell scripting. - Familiar with Automatic Speech Recognition (ASR) frameworks such as HuggingFace Transformers, ESPnet, SpeechBrain, Kaldi, or OpenAI Whisper. - Hands-on experience in deep learning and machine learning using libraries like PyTorch, TensorFlow, or Librosa. - Strong foundational knowledge in machine learning principles and signal processing. Preferred skills: - Understanding of deep learning techniques for speech, including CTC, encoder-decoder architectures, and attention mechanisms. - Prior experience in developing ASR systems for Indian languages and building noise-robust ASR solutions.

About the company

Sony's purpose is simple. We aim to fill the world with emotion, through the power of creativity and technology. We want to be responsible for getting hearts racing, stirring ambition, and putting a smile on the faces of our customers. That challenge, combined with our spirit of innovation, motivates us to create groundbreaking technology, entertainment, and services for people worldwide. Our history as a global brand has been built around employees that all have a passion for touching peoples'​ lives, and pride in pushing beyond the status quo to produce truly extraordinary results.

Skills

python
shell scripting
automatic speech recognition
deep learning
machine learning
pytorch
tensorflow
librosa