Researcher

Zvi Mowshowitz

@ZviMowshowitz

Writer referenced for the concept of 'exam-shaped questions' in context of Grok 4's benchmark overfit.

Mentions

Articles

Outlets

Sep 2025 — Feb 2026

Coverage

Neutral

Sentiment

Associated Topics

Companies Covered

Coverage Patterns

How media typically covers Zvi Mowshowitz

Article Types

analysis10
opinion3
news1

Media Angles

technical5
safety4
product2
business2
policy1

Narrative Framing

cautionary4
breakthrough4
competitive2
controversy2
trend1

Appears In

Sentiment Distribution

Based on 14 scored articles

Positive 29%Neutral 7%Mixed 43%Negative 21%

Associated AI Models

Claude Opus 4.55Claude Code4Claude Opus 4.62Gemini 32GPT-5.12Grok 41Grok 4 Heavy1Grok 31Claude Sonnet 41Claude Opus 41Grok1Claude Sonnet1GPT-5.3-Codex-Max1Claude Cowork1GPT1

Writes About

Dwarkesh PatelJournalist

Amanda AskellResearcher

1 article

Dario AmodeiExecutive

1 article

Bradly OlsenEngineer

1 article

Nabeel S. QureshiResearcher

1 article

Seb KrierResearcher

1 article

Ilya SutskeverResearcher

1 article

Olivia MooreAnalyst

1 article

Anton KorinekResearcher

1 article

Matt LevineAnalyst

1 article

Appears Alongside

Dwarkesh PatelJournalist

Articles

Most recent first

Articles Written

Zvi Mowshowitz as author

Citrini's Scenario Is A Great But Deeply Flawed Thought Experiment

analysismixedFeb 25, 2026

Citrini's viral AI scenario essay is a valuable thought experiment that explores important economic mechanisms but makes unrealistic assumptions about capability diffusion speed and government inaction that undermine its conclusions.

“Author of "Citrini's Scenario Is A Great But Deeply Flawed Thought Experiment"”

AI SafetyAI EconomicsFoundation ModelsAI Workforce

On Dwarkesh Patel's 2026 Podcast With Elon Musk and Other Recent Elon Musk Things

The ZvianalysisnegativeFeb 18, 2026

Elon Musk's views on AI alignment are confused, xAI's safety situation is deteriorating with the departure of its safety team, and Musk dismisses safety concerns as performative theater.

“Author of "On Dwarkesh Patel's 2026 Podcast With Elon Musk and Other Recent Elon Musk Things" in The Zvi”

AI SafetyAI AlignmentAI GovernanceGenerative AI

Claude Opus 4.6: System Card Part 2: Frontier Alignment

The ZvianalysiscautiousFeb 11, 2026

Claude Opus 4.6 demonstrates improved deception avoidance in safety tests but shows concerning capability at subversion strategies, raising questions about evaluation integrity and ASL-3 safety classification reliability for future models.

“Author of "Claude Opus 4.6: System Card Part 2: Frontier Alignment" in The Zvi”

AI SafetyFoundation ModelsAI AlignmentLLMs

Claude Opus 4.6: System Card Part 1: Mundane Alignment + MW

The ZvianalysiscautiousFeb 10, 2026

Claude Opus 4.6 represents a significant capability jump over 4.5 released only two months prior, with improved safety filtering (0.04% refusal rate for harmless requests) but concerning gaps in ASL-4 autonomous R&D evaluation protocols that may warrant classification as version 4.7.

“Author of "Claude Opus 4.6: System Card Part 1: Mundane Alignment + MW" in The Zvi”

Foundation ModelsAI SafetyLLMsAI Governance

Claude Codes #3

analysispositiveJan 22, 2026

Claude Code and Claude Cowork are dominating the AI market with a 'GPT moment,' enabling non-technical users to accomplish complex tasks and causing OpenAI sentiment to weaken amid competitive pressures.

“Author of "Claude Codes #3"”

AI ProductsFoundation ModelsAI CompetitionAI Agents

GPT-5.2 Is Frontier Only For The Frontier

The ZvianalysismixedDec 16, 2025

GPT-5.2 is a capable frontier model for professional knowledge work but represents incremental rather than transformative progress, with notably constrained personality and slower performance than benchmarks suggest.

“Author of "GPT-5.2 Is Frontier Only For The Frontier" in The Zvi”

LLMsAI ProductsFoundation ModelsAI Competition

AI #145: You've Got Soul

analysispositiveDec 5, 2025

Anthropic's Claude Opus 4.5 introduces a novel 'soul document' approach to alignment that explains virtuous behavior and reasoning to the model, producing superior results compared to competing language models from OpenAI, Google, xAI, and DeepSeek.

“Author of "AI #145: You've Got Soul"”

LLMsAI SafetyFoundation ModelsAI Agents

OpenAI Moves To Complete Potentially The Largest Theft In Human History

opinionnegativeNov 3, 2025

OpenAI's conversion to a Public Benefit Corporation with uncapped investor profit shares represents a massive value transfer (potentially hundreds of billions) from its nonprofit foundation, constituting potentially the largest theft in human history.

“Author of "OpenAI Moves To Complete Potentially The Largest Theft In Human History"”

AI GovernanceAI RegulationAI Ethics

On Dwarkesh Patel's Podcast With Andrej Karpathy

analysisneutralOct 22, 2025

Andrej Karpathy argues the 'decade of agents' is more accurate than 2025 being the 'year of agents,' citing insufficient groundwork in intelligence and context handling, with AI agent adoption likely peaking in impact by 2027-2028.

“Author of "On Dwarkesh Patel's Podcast With Andrej Karpathy"”

AI AgentsAI ResearchFoundation ModelsGenerative AI

Bubble, Bubble, Toil, and Trouble

opinioncautiousOct 21, 2025

Market valuations of AI stocks may appear elevated but do not constitute a bubble comparable to the dot-com era, as current prices reflect plausible expectations for future cash flows.

“Author of "Bubble, Bubble, Toil, and Trouble"”

AI InvestmentAI CompetitionGenerative AIAI Governance

Bending The Curve

The ZviopinioncautiousOct 8, 2025

The AI safety debate has shifted from adversarial 'doomers vs accelerationists' framing to unified coordination on technical solutions, driven by constraints from geopolitical competition, economic dependencies, and existential stakes.

“Author of "Bending The Curve" in The Zvi”

AI SafetyAI GovernanceAI CompetitionAI Infrastructure

Claude Sonnet 4.5 Is A Very Good Model

The ZvianalysispositiveOct 2, 2025

Claude Sonnet 4.5 represents a major leap in coding, agent, and computer use capabilities, likely positioning it as the best coding model currently available, though GPT-5 may excel at particularly complex debugging tasks.

“Author of "Claude Sonnet 4.5 Is A Very Good Model" in The Zvi”

LLMsAI CompetitionFoundation ModelsGenerative AI

OpenAI Shows Us The Money

The ZvinewspositiveSep 25, 2025

Nvidia will invest up to $100 billion in OpenAI to deploy 10 gigawatts of infrastructure for next-generation AI model training, with OpenAI's Stargate project targeting $500 billion in total compute spending across five new data center sites.

“Author of "OpenAI Shows Us The Money" in The Zvi”

AI InfrastructureFoundation ModelsAI InvestmentAI Chips/Hardware