About the role
We are seeking a Machine Learning Engineer specializing in post-training optimization to refine and enhance large language models (LLMs) and AI companions. This role focuses on alignment tuning, preference optimization, reinforcement learning (RLHF), and continual learning to create AI systems that are more conversational, emotionally aware, and adaptive to user interactions. You will work on making AI more intelligent, personalized, and capable of remembering and reasoning over long-term interactions, ensuring alignment with human values and real-world applications.
About the company
Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of computer, focused on making voice companions part of our daily lives. Our team brings together founders from Oculus and Ubiquity6, alongside proven leaders from Meta, Google, and Apple, with deep expertise spanning hardware and software. Join us in shaping a future where computers truly come alive.