Nucleus AI
Website:
withnucleus.ai
Job details:
Frontier research moves faster when the systems behind it make ambitious experimentation easier, not harder.
We’re hiring a Software Engineer, Research Data Infrastructure to build the data systems and tooling researchers use for large-scale experiments, evaluations, and data pipelines at Nucleus. This role is focused on enabling research teams with reliable, flexible, and high-throughput infrastructure for working with datasets, experiment outputs, and measurement pipelines.
You’ll sit close to the research loop—building systems that help teams prepare datasets, run large jobs, analyze results, and iterate with confidence. This is a highly collaborative role for someone who enjoys turning research needs into durable engineering systems.
In this role, you will
- Build and maintain data infrastructure that supports large-scale research experiments, dataset generation, evaluations, and analysis workflows.
- Create tools and services that help researchers manage inputs, outputs, metadata, lineage, and reproducibility across experiments.
- Improve the reliability and usability of pipelines for data preparation, experiment execution, and results aggregation.
- Design systems that support rapid iteration while preserving correctness, traceability, and operational visibility.
- Work closely with researchers to understand evolving workflow needs and translate them into scalable platform capabilities.
- Support high-volume experimental workloads involving multimodal data, model outputs, and evaluation artifacts.
- Develop interfaces, automation, and internal tooling that reduce friction for research teams and shorten iteration cycles.
You may be a good fit if you
- Have strong software engineering experience in data systems, backend infrastructure, platform engineering, or research tooling.
- Have worked on systems for experimentation, evaluations, large-scale pipelines, or internal data platforms.
- Are comfortable operating in environments where requirements evolve quickly and close collaboration with technical users matters.
- Write clear, maintainable code and think carefully about reliability, observability, and developer ergonomics.
- Are proficient with Python and at least one additional language commonly used in backend or infrastructure engineering.
- Enjoy supporting researchers by building tools that feel fast, robust, and thoughtfully designed.
What makes Nucleus different
Nucleus brings research and engineering into close conversation. In this role, you’ll help define the infrastructure layer that turns exploratory work into repeatable progress. The systems you build will shape how researchers test ideas, measure outcomes, and push the boundaries of what our models and agents can do.
- If you’re motivated by building data infrastructure that directly amplifies research, we’d be excited to meet you.
Click on Apply to know more.