Founder

Alex Duffy

Head of AI Training at Every

Built Good Start Labs to teach AI models to play games and generate reinforcement learning data for frontier labs, starting with AI Diplomacy.

Mentions

Articles

Outlets

Sep 2025 — Feb 2026

Coverage

Positive

Sentiment

Associated Topics

Companies Covered

Coverage Patterns

How media typically covers Alex Duffy

Article Types

analysis6
announcement3
opinion2
news2
tutorial1

Media Angles

product8
technical3
business2
safety1

Narrative Framing

breakthrough9
trend2
humaninterest1
competitive1
cautionary1

Appears In

Sentiment Distribution

Based on 14 scored articles

Positive 79%Neutral 7%Mixed 7%Negative 7%

Associated AI Models

Gemini3Claude Sonnet 4.52Claude Opus 4.12Claude 3.7 Sonnet2ChatGPT2Claude2DeepSeek R12Qwen3-235B1LLMs (Large Language Models)1GPT-5 Codex1OpenAI o31Google Gemini 2.5 Pro1GPT-21OpenAI Codex1Claude Code1

Writes About

Katie ParrottEngineer

2 articles

Kate LeeEngineer

2 articles

Michael ReillyResearcher

2 articles

Dan ShipperFounder

2 articles

Kieran KlaassenExecutive

2 articles

Willie WilliamsEngineer

1 article

Daniel RodriguesEngineer

Evan ArmstrongEngineer

1 article

Peter GostevResearcher

1 article

Kieran KlaassanExecutive

1 article

Brandon GellExecutive

1 article

Dan Shipper, Alex DuffyJournalist

1 article

Toro y MoiArtist

1 article

Appears Alongside

Kieran KlaassenExecutive

5 shared articles

Dan ShipperExecutive

5 shared articles

Katie ParrottEngineer

2 shared articles

Kate LeeEngineer

2 shared articles

Michael ReillyResearcher

2 shared articles

Tyler MarquesFounder

2 shared articles

Articles

Most recent first

Articles Written

Alex Duffy as author

DeepSeek's Big Week

EverynewsmixedNov 14, 2025

DeepSeek's R1 model release proved that advanced AI can come from smaller, well-resourced teams rather than only the largest companies, serving as a wake-up call to incumbents like OpenAI.

“Author of "DeepSeek's Big Week" in Every”

LLMsAI CompetitionFoundation ModelsGenerative AI

Vibe Check: Claude Haiku 4.5 Anthropic Cooked

EveryanalysispositiveOct 23, 2025

Claude Haiku 4.5 delivers nearly Sonnet 4.5-level performance at 3x cheaper pricing, making it an optimal choice for developers building agentic applications.

“Co-author of the article and cofounder of Good Start Labs”

Foundation ModelsAI ProductsGenerative AIAI Competition

AI Diplomacy

EveryannouncementpositiveOct 15, 2025

Every is launching AI Diplomacy, a Diplomacy game benchmark where 18 LLMs compete to evaluate their negotiation, alliance-forming, and honesty behaviors in strategic conflict scenarios.

“Author of "AI Diplomacy" in Every”

LLMsAI SafetyAI ResearchGenerative AI

Vibe Check: OpenAI DevDay 2025

Every - Vibe ChecknewsneutralOct 12, 2025

OpenAI's DevDay 2025 launched AppsSDK and operator features, but lacked groundbreaking announcements compared to prior years, suggesting the company is optimizing existing opportunities rather than pushing innovation frontiers.

“Co-author of the article 'Vibe Check: OpenAI DevDay 2025' providing analysis and closing thoughts”

AI ProductsGenerative AIAI InfrastructureAI Agents

Google's AI Vision Make Tech Human Again

EveryopinionpositiveOct 6, 2025

Google's cascade of AI announcements at I/O 2024 signals exponential progress toward AGI that amplifies human capabilities rather than replacing them, enabled by decades of ecosystem construction.

“Author of "Google's AI Vision Make Tech Human Again" in Every”

Generative AIFoundation ModelsAI InfrastructureMultimodal AI

How We Shape AI and How It Shapes Us

EveryanalysiscautiousOct 6, 2025

OpenAI's flawed thumbs-up/thumbs-down evaluation system caused GPT-4o to become overly accommodating and praise flawed ideas, revealing that how we measure AI success shapes its behavior and societal impact.

“Author of "How We Shape AI and How It Shapes Us" in Every”

AI SafetyAI EthicsAI GovernanceFoundation Models

Quoted In

Directly quoted in these articles

Claude Code

EveryanalysispositiveJan 13, 2026

Anthropic released Claude 3.7 Sonnet, a hybrid reasoning model with dual modes of thinking, and Claude Code, an agentic coding tool that is particularly powerful for development tasks despite not yet being fully production-ready.

“Discussed Claude 3.7's optimization for coding tasks and Anthropic's reasoning for this focus”

LLMsAI AgentsGenerative AINLP

Cited In

Research or work cited

I Fed My Essays to ChatGPT Until It Learned My Voice

EverytutorialpositiveNov 3, 2025

Writers can train ChatGPT to learn and replicate their personal writing voice and style by uploading essays and building custom style guides within ChatGPT Projects.

“The author credits Alex Duffy with a brainstorming technique of interviewing oneself that they adopted for their essay writing process.”

Generative AIAI ProductsAI PersonalizationAI in Writing

Also Mentioned In

Referenced in coverage

We Trained an AI on a Board Game. It Became a Better Customer Support Agent.

Every - PlaytestinganalysispositiveFeb 8, 2026

“Fine-tuned a model on the strategy game Diplomacy which improved its performance on customer support and industrial operations benchmarks.”

AI ResearchReinforcement LearningFoundation ModelsAI Training Methods

AI Ran Out of Internet. Now It's Learning by Playing Games Again.

EveryopinioncautiousNov 16, 2025

Games can serve as synthetic training environments to fill the 'jagged frontier' of AI capabilities by creating custom scenarios where models can be tested and improved in domains where internet-sourced training data is insufficient or biased.

“Explores how games can help make AI smarter and more beneficial, focusing on using games to improve AI training data.”

Generative AIAI Training DataFoundation ModelsAI Safety

Vibe Check: Claude Sonnet 4.5

EveryanalysispositiveOct 23, 2025

Claude Sonnet 4.5 is 50% faster and more steerable than previous Claude versions, excelling as a day-to-day coding tool, though GPT-5 Codex still outperforms it on difficult production bugs.

“Mentioned as testing Claude Sonnet 4.5's improved steerability.”

LLMsAI ProductsGenerative AIAI Agents

Vibe Check: Claude Opus 4.1

EveryanalysispositiveOct 23, 2025

Claude Opus 4.1 outperforms competing models like OpenAI's o3 and Google's Gemini 2.5 Pro on specific tasks, particularly excelling at autonomous coding, honest editing, and long-form task completion without intervention.

“Writing about Google's I/O event.”

LLMsAI ProductsFoundation ModelsAI Competition

Our New Incubation Raised $3.6 Million to Teach AIs to Play Games

Every - On EveryannouncementpositiveOct 19, 2025

Good Start Labs raised $3.6 million to teach AI models to play games like AI Diplomacy and Bad Cards, generating high-quality reinforcement learning training data for frontier AI labs.

“Built Good Start Labs to teach AI models to play games and generate reinforcement learning data for frontier labs, starting with AI Diplomacy.”

Reinforcement LearningAI Training DataFoundation ModelsAI Agents

How AI Diplomacy Works

EveryannouncementpositiveSep 22, 2025

AI Diplomacy, a benchmark game that measures how AI models perform in complex strategic environments, succeeded by reframing how to communicate context to LLMs through narrative storytelling rather than raw data dumps.

“Built AI Diplomacy, an innovative AI benchmark game that transformed a strategy game into a way to measure how AI models work in complex environments.”

AI ResearchFoundation ModelsAI AgentsAI Benchmarks