Flag job

Report

Data Scientist – Generative AI

Min Experience

2 years

Location

India, MH, Pune

JobType

Regular

About the job

Info This job is sourced from a job board

About the role

As a Data Scientist – Generative AI, you will be responsible for developing advanced solutions in Generative AI, Machine Learning, and Data Science, integrating Large Language Models (LLMs), RAG (Retrieval-Augmented Generation), AI Agentic Systems and other cutting-edge technologies. You will work closely with Data Engineering, DevOps and MLOps teams to design, develop, and optimize AI Solutions applied to software development challenges. Role Description & Responsibilities: Model Optimization: Experiment with and adapt Large Language Models (LLMs) for domain-specific tasks, ensuring performance improvements. RAG Integration: Integrate Retrieval Augmented Generation (RAG) techniques to combine information retrieval and generation, improving the relevance and accuracy of model responses. Prompt Engineering: Advance prompt engineering methodologies and context augmentation to ensure generated results align with objectives and are optimized for specific use cases. Data Pipelines: Implement and industrialize data preparation and ingestion pipelines for efficient context augmentation ensuring high-quality and scalable data workflows. Model Evaluation: Evaluate LLM models for coherence and accuracy, assess embedding models for semantic relevance and validate reranking models for ranking quality. Vector Databases & Embedding Management: Leverage vector databases (e.g., FAISS, ChromaDB …) to optimize data embedding and improve information retrieval for Generative AI solutions. Scalability & Performance: Analyze the performance, scalability, latency, and efficiency of deployed models in production environments. AI Research & Trends: Stay ahead of the curve by conducting research on emerging Generative AI, RAG, and AI Agentic Systems advancements, and apply these findings to enhance the AI-driven solutions. Qualifications: Programming Languages: Proficient in Python (including libraries such as transformers, fastapi, openai, langchain, etc.) Generative AI & LLMs: Experience with fine-tuning models, RAG frameworks, AI agentic development and advanced prompt engineering for AI systems. Vector Databases: Experience working with FAISS, ChromaDB, etc. for efficient embedding and retrieval. Machine Learning: Expertise in transformers, embeddings and reranking for AI solutions optimization. Backend & AI APIs: Knowledge of FastAPI, REST API development for AI integration. Scalability & Performance: Experience with applications or tools for load testing. Data Engineering: Proficiency with SQL and Python for data pipeline development and management. Education: Batcheler's / Master's degree or higher in Engineering, AI, Data Science, Computer Science, or a related field. Experience: 2 years of experience in Data Science with significant expertise in Generative AI, Applied Machine Learning, and AI Model Optimization.

About the company

Dassault Systèmes is a catalyst for human progress. We provide business and people with collaborative virtual environments to imagine sustainable innovations. By creating virtual twin experiences of the real world with our 3DEXPERIENCE platform and applications, we bring value to more than 350,000 customers of all sizes, in all industries, in more than 150 countries. Join our global community of more than 23,800 passionate individuals!

Skills

python
transformers
fastapi
openai
langchain
generative ai
llms
rag
ai agentic
prompt engineering
vector databases
faiss
chromedb
machine learning
embeddings
reranking
fastapi
rest api
load testing
sql
data engineering