Report

SEAL Research Scientist, Frontier Risk Evaluations

Salary

$176k - $300k

Min Experience

3 years

Location

San Francisco, CA

JobType

full-time

About the job

Info This job is sourced from a job board

Overview

About the role

As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding large language models (LLMs). Safety, Evaluations and Alignment Lab (SEAL) is Scale's frontier research effort dedicated to tackling the challenging research problems in evaluation, red teaming, and alignment of advanced AI systems. We are actively seeking talented researchers to join us in shaping the landscape for safety and transparency for the entire AI industry. We support collaborations across the industry and academia and the publication of our research findings. As a Research Scientist focused on Frontier Risk Evaluations, you will design and create evaluation measures, harnesses and datasets for measuring the risks posed by frontier AI systems. For example, you might do any or all of the following: Design and build harnesses to test AI agents for dangerous capabilities such as hacking or exploiting security vulnerabilities; Develop and run human-in-the-loop tests of AI capabilities to deceive, manipulate, blackmail, or otherwise engage in social engineering; Work with government agencies or other labs to collectively scope and design evaluations to measure and mitigate risks posed by advanced AI systems.

About the company

At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications.

Skills

pytorch

jax

tensorflow

machine learning

generative ai