Research Scientist

Principles of Intelligence · Added today

Applications are handled by the employer on an external website. AI Safety Careers does not process applications directly.

Back to roles

AI Safety & Alignment

Research Scientist

Added todayPrinciples of IntelligenceRemote, Global, London, UK, San Francisco Bay AreaRemote$100K-$250K / year

Remote, Global, London, UK, San Francisco Bay Area

$100,000 - $250,000

In this role, you'll advance mechanistic interpretability by developing data structure models and synthetic datasets to benchmark AI interpretability tools.

Develop tractable, scale-aware data structure models grounded in physics theory.

Quantify how features learned by AI systems relate to underlying data structures.

Create synthetic datasets to benchmark and improve interpretability research tools.

Conduct interdisciplinary research projects bridging physics and AI interpretability.

Principles of Intelligence aims to diversify the AI safety research portfolio by targeting a diverse set of scientific domains with the potential to address key bottlenecks in AI safety.

This listing may be aggregated from a public source or submitted by a third party. If you represent this employer and would like to update or remove this listing, contact support@aisafetycareers.com.

View all jobs from Principles of Intelligence