Product Policy Manager, Frontier Risk

Remote-Friendly (Travel-Required) | San Francisco, CA | Washington, DCFull-TimeManagerOther

Skills

You will be redirected to the company career page

About Anthropic

  • Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role

  • As the Product Policy Manager for Product Risk, you will play a crucial role in setting and executing the Safeguards team’s approach to assessing product launches for safety risks and working with cross functional stakeholders to drive appropriate safety mitigations. You'll work closely with a suite of cross-functional teams, within and outside of Safeguards, including, product, legal, public policy, and engineering teams to understand upcoming features, anticipate potential misuses or unintended consequences, and craft policies that balance innovation with responsibility. Your work will be essential in maintaining Anthropic's commitment to safe and beneficial AI as we continue to expand our product capabilities.

Responsibilities

  • Develop and maintain risk assessment frameworks to identify and evaluate potential safety risks associated with new product features and functionality
  • Conduct comprehensive product safety reviews, covering technical and non technical harms, to inform product launch and safety mitigation strategies
  • Collaborate closely with a variety of stakeholders including product and engineering teams and the broader Safeguards team to leverage deep policy, enforcement, and engineering expertise
  • Analyze the potential for misuse, unintended consequences, and harmful outputs of new model and product capabilities
  • Leverage SME risk assessments to inform overall product safety recommendations
  • Design and run bespoke evaluations for products that require tailored assessment
  • Craft policy recommendations that strike a balance between enabling innovation and ensuring responsible AI deployment
  • Work with the Safeguards enforcement team to develop clear guidelines for implementing new policies related to product features
  • Lead agentic product policy development, incorporating internal and external feedback
  • Stay current on industry trends and emerging risks in AI development to proactively address potential issues
  • Contribute to regular reports on product policy risks and mitigations for senior leadership

You might thrive in this role if you

  • Have a strong technical background while also feeling equally comfortable explaining highly technical concepts to non-technical stakeholders
  • Understand safety policies and safety considerations associated with a wide range of product surfaces
  • Are adept at prioritizing where, and how, to focus resources for safety mitigations in a high volume launch process
  • Have conducted risk evaluations of novel products in fast moving organizations
  • Demonstrated expertise collaborating with product and engineering teams to integrate safety considerations into product development
  • Have familiarity with AI ethics, responsible AI principles, and current debates surrounding AI safety and governance
  • Have the ability to think creatively about potential misuses of technology and develop innovative solutions to mitigate risks
  • Have shown strong project management skills with the ability to drive policy development processes from ideation to implementation
  • The annual compensation range for this role is listed below.
  • For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

How we're different

  • We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
  • The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Job Summary

CompanyAnthropic
LocationRemote-Friendly (Travel-Required) | San Francisco, CA | Washington, DC
TypeFull-Time
LevelManager
DomainOther