Flag job

Report

Say no to manually filling long application forms

Visit any careers page and a lightning button will pop up on any compatible page.
Use ChatGPT to auto-fill

Use AI to auto fill job forms

Use ChatGPT to customise your resume for every job that you apply to

Ask for Referral for any job post

DevOps Infrastructure Engineer

Salary

$0.15k - $0.275k

Min Experience

4 years

Location

San Jose

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

Designing and writing software for new ASICs is hard, and requires a huge amount of software and tooling. It is even more challenging for model-specific ASICs, as it is important for them to hit the market at the right time, and thus moving fast is essential. You will drive adoption of cutting-edge tooling, to improve the speed and reliability of our toolchains. You will help us innovate to do better than the industry norm, by running massively parallel CI jobs, specifying and building our own fully-redundant SSD-only server infrastructure, and making sure these tools run automatically and reliability. You will work with an IT contracting firm to do the day-to-day maintenance and installation - while you must be knowledgeable enough about IT to work with this firm, most of your time will be spent designing new toolchains entirely The scope and title of this role can be modified for exceptional candidates. Representative projects: ● Spec out a server using a 6 GHz desktop CPU to speed up single-threaded workloads ● Decide if moving our servers to the cloud/a colo facility makes sense to improve uptime ● Set up networking infrastructure to allow Jupyter notebook users to connect to our servers, without waiting for them to be restarted. ● Parallelize our CI stack to run on dozens of different machines at once, designing a policy to avoid unnecessary CI failures if a machine goes down.

About the company

Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning. We are a fully in-person team in West San Jose, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

Skills

linux
containerization
ci/cd
python
c++