Report

Python Software Engineering Intern, Accelerated LLM Data Applications - Fall 2025

Min Experience

0 years

Location

Santa Clara, California, United States

JobType

internship

Overview

About the role

NVIDIA seeks a Python Software Engineering Intern to accelerate data engineering for Large Language Models (LLMs). The intern will develop and optimize Python-based data processing frameworks for GPU-accelerated environments, contributing to RAPIDS and other GPU-accelerated libraries. Responsibilities include designing and implementing components for Retrieval Augmented Generation (RAG) pipelines, benchmarking algorithms, and collaborating with LLM & ML researchers. The ideal candidate possesses strong Python skills, familiarity with LLMs and RAG pipelines, experience with PyData and ML/DL ecosystems, and a passion for optimization and iterative development. The internship involves working with large datasets, optimizing for speed and cost, and improving system accuracy through various techniques.

About the company

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world.

Skills

Python

GitHub

Docker

Algorithms

GitHub Actions

Kubernetes

Spark

pandas

PyTorch

SQL

Scikit-Learn

NumPy

Data Science

Foundation

CUDA