About the role
NVIDIA seeks a Python Software Engineering Intern to accelerate data engineering for Large Language Models (LLMs). The intern will develop and optimize Python-based data processing frameworks for GPU-accelerated environments, contributing to RAPIDS and other GPU-accelerated libraries. Responsibilities include designing and implementing components for Retrieval Augmented Generation (RAG) pipelines, benchmarking algorithms, and collaborating with LLM & ML researchers. The ideal candidate possesses strong Python skills, familiarity with LLMs and RAG pipelines, experience with PyData and ML/DL ecosystems, and a passion for optimization and iterative development. The internship involves working with large datasets, optimizing for speed and cost, and improving system accuracy through various techniques.
About the company
Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world.