Product Policy Manager, Frontier Risk

Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DCFull-TimeManagerOther

You will be redirected to the company career page

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

As the Product Policy Manager for Product Risk, you will play a crucial role in setting and executing the Safeguards team’s approach to assessing product launches for safety risks and working with cross functional stakeholders to drive appropriate safety mitigations. You'll work closely with a suite of cross-functional teams, within and outside of Safeguards, including, product, legal, public policy, and engineering teams to understand upcoming features, anticipate potential misuses or unintended consequences, and craft policies that balance innovation with responsibility. Your work will be essential in maintaining Anthropic's commitment to safe and beneficial AI as we continue to expand our product capabilities.

Develop and maintain risk assessment frameworks to identify and evaluate potential safety risks associated with new product features and functionality
Conduct comprehensive product safety reviews, covering technical and non technical harms, to inform product launch and safety mitigation strategies
Collaborate closely with a variety of stakeholders including product and engineering teams and the broader Safeguards team to leverage deep policy, enforcement, and engineering expertise
Analyze the potential for misuse, unintended consequences, and harmful outputs of new model and product capabilities
Leverage SME risk assessments to inform overall product safety recommendations
Design and run bespoke evaluations for products that require tailored assessment
Craft policy recommendations that strike a balance between enabling innovation and ensuring responsible AI deployment
Work with the Safeguards enforcement team to develop clear guidelines for implementing new policies related to product features
Lead agentic product policy development, incorporating internal and external feedback
Stay current on industry trends and emerging risks in AI development to proactively address potential issues
Contribute to regular reports on product policy risks and mitigations for senior leadership

Have a strong technical background while also feeling equally comfortable explaining highly technical concepts to non-technical stakeholders
Understand safety policies and safety considerations associated with a wide range of product surfaces
Are adept at prioritizing where, and how, to focus resources for safety mitigations in a high volume launch process
Have conducted risk evaluations of novel products in fast moving organizations
Demonstrated expertise collaborating with product and engineering teams to integrate safety considerations into product development
Have familiarity with AI ethics, responsible AI principles, and current debates surrounding AI safety and governance
Have the ability to think creatively about potential misuses of technology and develop innovative solutions to mitigate risks
Have shown strong project management skills with the ability to drive policy development processes from ideation to implementation
The annual compensation range for this role is listed below.
For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

CompanyAnthropic

LocationRemote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC

TypeFull-Time

LevelManager

DomainOther

Similar roles you might like