The AI Red Team Analyst conducts red-teaming exercises to identify AI security weaknesses, evaluates AI outputs for safety and compliance, and documents vulnerabilities.
504 active roles found
The AI Red Team Analyst conducts red-teaming exercises to identify AI security weaknesses, evaluates AI outputs for safety and compliance, and documents vulnerabilities.
The Visiting Fellowship involves working on an AI safety research project at Constellation, collaborating with various researchers.
The role involves probing AI models for vulnerabilities, analyzing their behavior under attack scenarios, and investigating system misuse to recommend mitigations.
The Research Lead will develop and lead a research agenda focused on reducing catastrophic risks from advanced AI, mentoring a team and sharing findings through publications and policy briefings.
A concise digest of alignment, governance, and AI risk jobs.
By subscribing, you agree to receive the AI Safety Careers newsletter. You can unsubscribe at any time. See our Privacy Policy.
The AI Policy Fellow will advise on California's AI policy and governance, focusing on AI safety, transparency, and risk management.
The Distributed Organizing Manager will build and support a nationwide volunteer program focused on grassroots action against the risks of unchecked AI.
Develop machine learning-based prototypes and systems for AI security, focusing on red teaming and adversarial machine learning.
Develop machine learning-based prototypes and tools to address AI security challenges, focusing on red teaming and adversarial machine learning.
Coordinate the production and delivery of AI Safety Connect's conference program in Shanghai, focusing on AI safety engagement with stakeholders.
Teach technical AI safety to legal professionals during a 5-day bootcamp, focusing on curriculum development and participant mentoring.
A six-day programme focused on building hardware assurance mechanisms for AI chip operations, including technical talks and practical projects.
The Research Engineer will develop open-source verification tools and infrastructure for AI-assisted formal verification to ensure bug-free code.
A program to develop AI safety ideas into fundable organizations through expert consultation and feedback.
The Policy Analyst will develop policy analysis and frameworks for AI policy advocacy, focusing on economics, labor, education, and sectoral transformation.
The role involves leading initiatives to address catastrophic risks in AI security, focusing on research and team building for impactful projects.
Research role focused on AI governance, geopolitical dimensions, and policy implications.
The Director of Evaluations will lead a team focused on assessing AI capability and safety, including red-teaming and designing benchmarks.
The Research Fellow position at the University of California's Center for Human-Compatible Artificial Intelligence focuses on advancing beneficial AI through novel research.
This role involves executing operational functions across various domains to support Constellation's mission in AI safety.
The role involves building AI tooling and infrastructure for hackathons and fellowship programs, focusing on automations and solutions to improve event operations.