Report

Intern LLM and RAGS

Location

Waltham, Massachusetts, United States

JobType

internship

About the job

Info This job is sourced from a job board

Overview

About the role

AI/Machine Learning Research Intern (RAG & LLMs)

Location: Waltham, MA (On-site only)

Duration: 3 months minimum (can start as early as January)

Commitment: approximately 40 hours per week Compensation: Unpaid (Academic Credit Eligible)

Role Overview

We are seeking a highly motivated AI/ML Intern with a focus on Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). You will be working at the frontier of generative AI, helping us explore, build and optimize systems that bridge the gap between static models and dynamic, private data.

This role is ideal for a student looking to apply theoretical knowledge to a production-grade environment and build a significant portfolio piece.

Key Responsibilities

Architect & Optimize RAG Pipelines: Improve retrieval accuracy using vector databases (e.g., Pinecone, Milvus, or Weaviate).

LLM Implementation: Experiment with and fine-tune prompts and workflows using models like GPT-4, Claude, or Llama 3.

Data Engineering: Assist in the cleaning, chunking, and embedding of proprietary datasets.

Evaluation: Implement benchmarking frameworks to measure hallucinations, faithfulness, and relevancy of model outputs.

Qualifications

Academic Standing: Graduate student (Masters/PhD) in Computer Science, Data Science, or a related technical field.

GPA: 3.5 or higher preferred.

Technical Skills: Proficiency in Python and experience with AI frameworks (e.g., LangChain, LlamaIndex, PyTorch, or TensorFlow).

Domain Knowledge: A solid understanding of transformer architectures and the mechanics of RAG.

Soft Skills: A research-oriented mindset with the ability to troubleshoot complex, non-deterministic systems.

What You Will Gain

Mentorship: Direct access to several AI chief architects and weekly 1-on-1 growth sessions.

Portfolio Impact: Significant contribution to a live AI project that you can showcase to future employers.

Flexibility: We respect your academic schedule and offer flexible working hours.

Future Opportunities: Top performers will be prioritized for future full-time, paid openings.

About the company

Ottometric provides analytics and enhanced capabilities for the automotive supply chain to understand challenges in ADAS features being delivered in modern vehicles. As vehicle complexity increases with more sensors and systems, the complexity and interdependency of the data fusion makes validation ever more complex, time consuming and expensive. The Ottometric solution provides simplified data management and visualization for this overwhelming deluge of validation data and our proprietary artificial intelligence (AI) and computer vision automates validation data review. The result is significant cost reduction and higher accuracy than manual review of data that utilizes in-house tools or unscalable off-the-shelf software. With more extensive and rapid data analysis, long tail problems can be better understood, further improving ADAS features being delivered to the market.

Skills

Python

LangChain

LlamaIndex

PyTorch

TensorFlow