AI Safety & Alignment
Researcher, Automated Red Teaming
San Francisco Bay Area
$295,000 - $445,000
-
In this role, you'll lead Automated Red Teaming, building scalable systems that discover AI model weaknesses and drive improvements.
Researcher, Automated Red Teaming
OpenAI · Added 3 days ago
Applications are handled by the employer on an external website. AI Safety Careers does not process applications directly.
AI Safety & Alignment
San Francisco Bay Area
$295,000 - $445,000
In this role, you'll lead Automated Red Teaming, building scalable systems that discover AI model weaknesses and drive improvements.
Own research and technical direction for automated red teaming across cyber, bio, and loss-of-control areas.
Partner with risk teams and stakeholders to define threat models, prioritize targets, and land mitigations.
Build automated systems to discover classifier jailbreaks, bio threats, and monitoring evasion probes.
Convert discovered attacks into training data, evaluations, and measurable robustness gains.
OpenAI is a frontier AI research and product company, with teams working on alignment, policy, and security. You can read concerns about doing harm by working at a frontier AI company in our career review on the topic, including concerns about OpenAI in particular. Our Take On This Role: We have concerns about OpenAI's track record on safety and responsible development and do not recommend almost any roles at OpenAI. Nonetheless, it is possible that OpenAI will create AGI in the next decade, in which case safety and security work at the company could be extremely important. If you receive a job offer from OpenAI, consider contacting us for career advice.
This listing may be aggregated from a public source or submitted by a third party. If you represent this employer and would like to update or remove this listing, contact support@aisafetycareers.com.