Flag job

Report

Summer 2025 Research Intern

Min Experience

0 years

Location

San Francisco, Amsterdam, London

JobType

internship

About the job

Info This job is sourced from a job board

About the role

Together AI is seeking exceptional Research Interns to join our research team for Summer 2025. You will work on cutting-edge research in foundation model architectures and efficiency, contributing to our mission of advancing open and transparent AI systems. Research Areas As a Research Intern, you will work on one or more of the following areas: Novel model architectures and architectural adaptations for foundation models Inference optimization algorithms and techniques (e.g., speculative decoding, quantization, sparsity, model compression, knowledge distillation) High-performance kernel development and optimization Advanced post-training optimization and finetuning methods New techniques and systems for efficient training of neural networks (e.g., distributed training, algorithmic improvements, optimization methods) Robust and reliable evaluation of foundation model capabilities Reasoning strategies and inference-time compute techniques Responsibilities Research and implement novel techniques in one or more of our focus areas Design and conduct rigorous experiments to validate hypotheses Document findings in scientific publications and blog posts Integrate the research results into Together products Communicate the plans, progress, and results of projects to the broader team Requirements Currently pursuing a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field Strong knowledge of Machine Learning and Deep Learning fundamentals Experience with deep learning frameworks (PyTorch, JAX, etc.) Strong programming skills in Python Strong programming skills in C++ (for kernel development) Familiarity with Transformer architectures and recent developments in foundation models Preferred Qualifications Prior research experience with foundation models or efficient machine learning Publications at leading ML conferences (such as NeurIPS, ICML, or ICLR) Experience with CUDA programming (for kernel development) Understanding of model optimization techniques and hardware acceleration approaches Contributions to open-source machine learning projects Internship Details Duration: ~12 weeks (Summer 2025) Location: San Francisco, Amsterdam and London

About the company

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, Mamba, FlexGen, Petals, Mixture of Agents, and RedPajama.

Skills

machine learning
deep learning
Python
C++
PyTorch
JAX