Researcher

Brynjolfsson

Researcher at Unknown

Work cited showing most jobs consist of multiple tasks where AI can only match human performance on some

Mentions

Articles

Outlets

Associated Topics

Companies Covered

Coverage Patterns

How media typically covers Brynjolfsson

Article Types

news1

Media Angles

technical1

Narrative Framing

trend1

Associated AI Models

Claude Opus 4.11Gemini 2.5 Pro1Grok 41GPT-5 Thinking1GPT-4o1

Articles

Most recent first

Cited In

Research or work cited

AI Models Are Already as Good as Experts at Half of Tasks, New OpenAI Benchmark GDPval Suggests

FortunenewsneutralOct 7, 2025

OpenAI's GDPval benchmark shows Claude Opus 4.1 achieves expert-level performance on 47.6% of real-world professional tasks across 44 different professions, while performance varies significantly across models with GPT-4o performing worst at only 10%.

“Work cited showing most jobs consist of multiple tasks where AI can only match human performance on some”

Foundation ModelsAI BenchmarkingAI ResearchLLMs