Researcher

Toby Ord

Unknown

His post on RL and inference compute scaling for frontier AI models is the central focus of analysis and critique.

Mentions

Articles

Outlets

Sep 2025 — Feb 2026

Coverage

Associated Topics

Companies Covered

Coverage Patterns

How media typically covers Toby Ord

Article Types

analysis2

Media Angles

technical2

Narrative Framing

trend1
cautionary1

Associated AI Models

GPT-11GPT-41GPT-51

Writes About

Steve NewmanAnalyst

1 article

Dario AmodeiExecutive

1 article

Articles

Most recent first

Articles Written

Toby Ord as author

The Extreme Inefficiency of RL for Frontier Models

analysiscautiousSep 22, 2025

The shift from pre-training to reinforcement learning for frontier models reduces information efficiency by 1,000 to 1,000,000x, potentially constraining future scaling progress despite unlocking new reasoning capabilities.

“Author of "The Extreme Inefficiency of RL for Frontier Models"”

Foundation ModelsReinforcement LearningAI ResearchLLMs