About the role
Together AI is seeking exceptional Research Interns to join our research team for Summer 2025. You will work on cutting-edge research in foundation model architectures and efficiency, contributing to our mission of advancing open and transparent AI systems.
Research Areas
As a Research Intern, you will work on one or more of the following areas:
Novel model architectures and architectural adaptations for foundation models
Inference optimization algorithms and techniques (e.g., speculative decoding, quantization, sparsity, model compression, knowledge distillation)
High-performance kernel development and optimization
Advanced post-training optimization and finetuning methods
New techniques and systems for efficient training of neural networks (e.g., distributed training, algorithmic improvements, optimization methods)
Robust and reliable evaluation of foundation model capabilities
Reasoning strategies and inference-time compute techniques
Responsibilities
Research and implement novel techniques in one or more of our focus areas
Design and conduct rigorous experiments to validate hypotheses
Document findings in scientific publications and blog posts
Integrate the research results into Together products
Communicate the plans, progress, and results of projects to the broader team
Requirements
Currently pursuing a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or a related field
Strong knowledge of Machine Learning and Deep Learning fundamentals
Experience with deep learning frameworks (PyTorch, JAX, etc.)
Strong programming skills in Python
Strong programming skills in C++ (for kernel development)
Familiarity with Transformer architectures and recent developments in foundation models
Preferred Qualifications
Prior research experience with foundation models or efficient machine learning
Publications at leading ML conferences (such as NeurIPS, ICML, or ICLR)
Experience with CUDA programming (for kernel development)
Understanding of model optimization techniques and hardware acceleration approaches
Contributions to open-source machine learning projects
Internship Details
Duration: ~12 weeks (Summer 2025)
Location: San Francisco, Amsterdam and London
About the company
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, Mamba, FlexGen, Petals, Mixture of Agents, and RedPajama.