AI Safety & Alignment
Researcher, Pretraining Safety
San Francisco Bay Area
$310,000 - $460,000
-
In this role, you'll build safer AI models and enable earlier safety evaluation during training.
-
Develop techniques to identify and evaluate unsafe behavior in early-stage models.