Benjamin Mann

Co-founder

Anthropic

⚡ Execution (2)📈 Growth & Metrics (1)🚀 Career & Leadership (1)

Methodologies(4)

by Benjamin Mann

⚡ Execution

A method for aligning AI models by training them to follow a set of natural language principles (a 'Constitution') using AI feedback (RLAIF), rather than relying solely on human contractors.

Core Principles

1.Define Principles: Establish a constitution of values (e.g., helpful, harmless, honest, human rights).
2.Generate & Critique: The model generates a response, then critiques itself based on the constitution.
3.Recursive Revision: If the response violates principles, the model rewrites it.
+1 more...

"First we figure out which ones might apply... then we ask the model itself to critique itself and rewrite its own response in light of the principle."

#constitutional#execution#process

View Deep Dive →

Responsible Scaling Policy (ASL Framework)

by Benjamin Mann

⚡ Execution

A framework analogous to Biosafety Levels (BSL) that defines specific AI Safety Levels (ASL) based on model capabilities and potential risk, mandating stricter containment measures as intelligence increases.

Core Principles

1.Define Levels: Establish capability thresholds (ASL-1 to ASL-5).
2.Assess Capability: Test models in laboratory settings for dangerous abilities (e.g., bio-weapons).
3.Implement Safeguards: Apply security and deployment restrictions corresponding to the level.
+1 more...

"ASL-3 is maybe a little bit risk of harm... ASL-4 starts to get to significant loss of human life... ASL-5 is potentially extinction level."

#responsible#scaling#policy

View Deep Dive →

The Economic Turing Test

by Benjamin Mann

📈 Growth & Metrics

A metric to define Transformative AI or AGI based on an agent's ability to autonomously perform economically valuable work indistinguishable from a human.

Core Principles

1.Define Basket of Jobs: Select a representative set of money-weighted roles.
2.Contract Agent: Hire the AI agent for a set period (e.g., 1-3 months).
3.Evaluate Performance: Determine if the agent performed the role as well as a human hire.
+1 more...

"If you contract an agent for a month... and it turns out to be a machine rather than a person, then it's passed the Economic Turing Test for that role."

#economic#turing#growth

View Deep Dive →

Resting in Motion

by Benjamin Mann

🚀 Career & Leadership

A mental framework for sustainable high-performance, recognizing that the 'default state' of humanity is activity, not leisure. It encourages maintaining composure and effectiveness amidst constant pressure.

Core Principles

1.Reject Passive Default: Accept that having problems to solve is the natural state.
2.Sustainable Pace: Treat work as a marathon, not a sprint.
3.Community Support: Surround yourself with mission-aligned peers.
+1 more...

"Some people think that the default state is rest, but actually that was never in the state of evolutionary adaptation... the busy state is the normal state."

#resting#motion#career

View Deep Dive →