AI Safety & Alignment
Research Scientist
Remote, Global, London, UK, San Francisco Bay Area
$100,000 - $250,000
Research Scientist
Principles of Intelligence · Added today
Applications are handled by the employer on an external website. AI Safety Careers does not process applications directly.
AI Safety & Alignment
Remote, Global, London, UK, San Francisco Bay Area
$100,000 - $250,000
In this role, you'll advance mechanistic interpretability by developing data structure models and synthetic datasets to benchmark AI interpretability tools.
Develop tractable, scale-aware data structure models grounded in physics theory.
Quantify how features learned by AI systems relate to underlying data structures.
Create synthetic datasets to benchmark and improve interpretability research tools.
Conduct interdisciplinary research projects bridging physics and AI interpretability.
Principles of Intelligence aims to diversify the AI safety research portfolio by targeting a diverse set of scientific domains with the potential to address key bottlenecks in AI safety.
This listing may be aggregated from a public source or submitted by a third party. If you represent this employer and would like to update or remove this listing, contact support@aisafetycareers.com.