Software Engineer, Encoding Libraries

San Francisco, CA | New York City, NYFull-TimeMid-levelSoftware Engineering

You will be redirected to the company career page

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

The Encodings Infrastructure team maintains the libraries that engineers and researchers across Anthropic use to encode text and multimodal data into a form that Claude can consume. As a Software Engineer on this team, you'll own the design and maintenance of these libraries — keeping their APIs intuitive, their performance sharp, and their abstractions solid enough that most of the org never has to think about encoding at all. You’ll have the satisfaction of knowing that your work enabled Claude to learn new ways of understanding the world.
This role is unusually broad: your work will touch systems across the codebase, from pretraining to finetuning to the API, and you'll collaborate closely with both researchers and engineers to make sure new encoding ideas can move quickly from experiment to production.

Maintain and improve the encoding libraries used by engineers and researchers across Anthropic, with a focus on clean, user-friendly APIs
Design data structures and abstractions that shield most of the organization from the details of how encoded data works while enabling “power users”
Adapt the encoding libraries to support new research directions as they emerge, and make sure that we can ship these research ideas to production
Optimize encoding performance across the systems that depend on these libraries
Work across Anthropic's codebase to improve how encoded and unencoded data is handled
Maintain and improve the encoding libraries used by engineers and researchers across Anthropic, with a focus on clean, user-friendly APIs
Design data structures and abstractions that shield most of the organization from the details of how encoded data works while enabling “power users”
Adapt the encoding libraries to support new research directions as they emerge, and make sure that we can ship these research ideas to production
Optimize encoding performance across the systems that depend on these libraries
Work across Anthropic's codebase to improve how encoded and unencoded data is handled

Have 5+ years of software engineering experience, with meaningful time spent maintaining libraries, SDKs, or developer-facing APIs
Have familiarity with ML terminology and LLM architecture — you don't need to be an ML expert, but enough understanding to work effectively alongside researchers
Have experience carrying out complex refactors in large codebases
Have strong communication skills and enjoy working closely with researchers and engineers to understand what they need
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Care about the societal impacts of your work
Have 5+ years of software engineering experience, with meaningful time spent maintaining libraries, SDKs, or developer-facing APIs
Have familiarity with ML terminology and LLM architecture — you don't need to be an ML expert, but enough understanding to work effectively alongside researchers
Have experience carrying out complex refactors in large codebases
Have strong communication skills and enjoy working closely with researchers and engineers to understand what they need
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Care about the societal impacts of your work

Redesigning a core encoding abstraction so that downstream teams can adopt new encoding schemes without changing their code
Working with a research team to add support for a new multimodal data type in the encoding libraries
Auditing how encoded data flows through the finetuning and serving stacks and cleaning up inconsistencies
Redesigning a core encoding abstraction so that downstream teams can adopt new encoding schemes without changing their code
Working with a research team to add support for a new multimodal data type in the encoding libraries
Auditing how encoded data flows through the finetuning and serving stacks and cleaning up inconsistencies
The annual compensation range for this role is listed below.
For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

CompanyAnthropic

LocationSan Francisco, CA | New York City, NY

TypeFull-Time

LevelMid-level

DomainSoftware Engineering

Similar roles you might like