Innodata Inc.
Website:
innodata.com
Job details:
Role- Senior Data Scientist
Experience- 8 years+
Work mode- Remote
Role Overview
We are seeking an experienced Senior Data Scientist with deep expertise in Generative AI and Large Language Models (LLMs). The ideal candidate will have hands-on experience in building and deploying solutions using Retrieval-Augmented Generation (RAG), embeddings, and model fine-tuning techniques to solve complex business problems.
Key Responsibilities
- Design, develop, and deploy scalable AI/ML solutions using LLMs
- Build and optimize Retrieval-Augmented Generation (RAG) pipelines
- Develop and manage embedding-based retrieval systems for semantic search and knowledge augmentation
- Fine-tune large language models using domain-specific datasets
- Work with vector databases (e.g., FAISS, Pinecone, Weaviate) for efficient information retrieval
- Collaborate with cross-functional teams to translate business requirements into AI solutions
- Evaluate model performance and continuously improve accuracy, latency, and scalability
- Ensure best practices in data handling, model governance, and deployment
- Stay up to date with advancements in Generative AI and NLP
Required Skills & Qualifications
- Bachelor’s/Master’s/PhD in Computer Science, Data Science, or a related field
- 8+ years of experience in Data Science / Machine Learning
- Strong experience with Python and ML frameworks (PyTorch, TensorFlow)
- Hands-on experience with:
- RAG (Retrieval-Augmented Generation) architectures
- Embeddings (OpenAI, Sentence Transformers, etc.)
- LLM fine-tuning (LoRA, PEFT, RLHF is a plus)
- Experience with LLM orchestration frameworks (LangChain, LlamaIndex, etc.)
- Familiarity with vector databases and semantic search
- Strong understanding of NLP concepts and transformer architectures
- Experience deploying models in cloud environments (AWS, GCP, or Azure)
- Solid understanding of APIs, microservices, and MLOps practices
Click on Apply to know more.