AI Safety Analyst | Full Time | Singapore or Palo Alto

TrustLab

Location: Palo Alto

Required skills

Python
data science
end-to-end
SQL

About the role

**AI Safety Analyst | Full Time | Singapore or Palo Alto** Singapore / Palo Alto General & Administrative / Hybrid / Hybrid About TrustLab Online misinformation, hate speech, child endangerment, and extreme violence are some of the world's most critical and complex problems. TrustLab is a fast-growing, VC-backed startup, founded by ex-Google, TikTok and Reddit executives determined to use software engineering, ML, and data science to tackle these challenges and make the internet healthier and safer for everyone. If you’re interested in working with the world’s largest social media companies and online platforms, and building technologies to mitigate these issues, you’ve come to the right place. About the Role As an AI Safety analyst, you will be engaging on the full spectrum of policy issues on AI Safety and play an integral role in building deep expertise within the team. You will work directly on solving real world complex trust & safety and fraud issues. Your work will be critical in the design & development of our AI safety product & service offerings. Day-to-day work may encompass anything from risk helping to shape strategic initiatives, technical/policy research, risk evaluations and investigations. You will also get to work on adversarial and red-teaming opportunities to protect real users and improve AI security. This role can be performed remotely from anywhere in Singapore or Palo Alto. **Responsibilities** + Develop deep subject matter expertise in role of AI safety in cyber security risks + Discover and exploit Responsible AI vulnerabilities end-to-end in order to assess the safety of systems by developing responsible AI red teaming methodologies + Develop a framework for testing and benchmarking the safety of AI Models + Play a role in building & improving Gen AI fraud & risk detection capabilities + Monitor the policy landscape to identify relevant questions and emerging policy areas to build our expertise in the subject + Keep up to date with new and existing AI policy norms and standards, particularly those related to cyber security, and use these to inform our decision-making on policy areas **Minimum qualifications** + Bachelor's degree or equivalent practical experience + 3+ years track record in trust & safety, risk evaluations, fraud investigations, technical/data analysis + Experience and familiarity with AI or a demonstrated interest in AI policy issues + Experience in data analysis or data science - identifying trends and drawing actionable insights + Have a deep practical familiarity with understanding of how AI technology contributes to online risks & threats + Worked on topics around: AI risk assessment, model safety, prompting + Stay up-to-date and informed by taking an active interest in emerging research and industry + Passion for using AI to create safe and beneficial products **Preferred skills** + Experience and familiarity with AI or a demonstrated interest in AI policy issues and research + Strong familiarity with existing GenAI / LLM / ML standards - prior experience exploring, testing and evaluation of language model behavior. + Experience in benchmarking Generative AI issues and quantify improvements + Experience with SQL and a programming language (e.g., Python or R) + Competitive compensation at a rapidly growing Series A, VC-backed startup + Remote-first, with the ability to work from home or co-locate with our Singapore or Palo Alto teams + Influence new product direction from idea to commercialization + Help develop critical tech to solve one of the 21st century’s trickiest societal problems

About TrustLab

TrustLab provides cutting-edge software and metrics to the world's largest social media platforms, online marketplaces and apps to enable them to protect their users against misinformation, hate speech, identity fraud, and other harmful content. Our customers are large enterprises with complex Trust & Safety needs and small companies building out their internal policies and teams. With a founding team with over 40 years of collective Trust & Safety experience at companies like Google, YouTube, Reddit and TikTok, Trust Lab is the trusted third-party solution for detecting and mitigating critical safety threats on the internet.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.