Flag job

Report

Machine Learning Infrastructure Software Engineer, Dojo

Salary

$0.132k - $0.3k

Min Experience

0 years

Location

PALO ALTO, California

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

As a Machine Learning Infrastructure Software Engineer on Tesla's Dojo team, you will build the infrastructure used to for training of neural networks on our custom-built supercomputer. In this role, you may work across software, hardware, and machine learning teams to solve challenges related to performance, scalability, and reliability. You will have the opportunity to contribute to Tesla's cutting edge Dojo Supercomputer, which will have a significant impact on the future of autonomous driving, Optimus, and real-world AI. What You'll Do Collaborate with machine learning researchers and engineers to optimize training workloads on Tesla's Dojo system Work on a variety of tasks, from improving the performance of neural network training to optimizing hardware-software interactions Help identify and solve bottlenecks across distributed systems to ensure efficient training and faster model convergence Contribute to the development and optimization of training software, ensuring smooth operation of the system and integration with Tesla's broader infrastructure Support the reliability and performance of the Dojo system, including monitoring, troubleshooting, and making improvements where needed Collaborate with cross-functional teams to ensure that training workflows run efficiently, from data management to system-level optimizations What You'll Bring Degree in Engineering, Computer Science and graduating or equivalent in experience and evidence of exceptional ability Strong proficiency in C++ and/or Python programming Experience or interest in distributed systems, parallel programming, and hardware/software optimization A willingness to work across different technical areas, from deep learning frameworks to hardware systems Strong communication skills and an ability to work in a fast-paced, collaborative environment An eagerness to learn and tackle new technical challenges in AI and machine learning systems

About the company

Tesla homepage Careers Skip to main content Explore Jobs Manufacturing AI Internships About Us Profile US Go to search

Skills

c++
python
distributed systems
parallel programming
hardware optimization
software optimization