← Back to archive
GPT-5.2 makes its first physics discovery

Image via Agglomerations, Substack

Monday, February 16, 2026

GPT-5.2 makes its first physics discovery

We're seeing some wild milestone moments: OpenAI's GPT-5.2 just published original theoretical physics research (yikes, we're past the "helpful assistant" phase), while a new benchmark called AIRS-Bench is stress-testing whether AI research agents are ready for the real lab. Meanwhile, Google's flagging distillation attacks from state-backed actors trying to replicate Gemini, and Alibaba's making a bold move pivoting developers toward robotics with RynnBrain instead of just app-building. So here's the question: if AI agents are publishing physics papers and building robots, what's the human researcher's job anymore?

Top Stories

1
AIRS-Bench: A Suite of Tasks for Frontier AI Research Science Agents

arXiv

AIRS-Bench introduces 20 research-focused tasks to evaluate LLM agents across the full scientific research lifecycle, with preliminary results showing agents exceed human performance on only 4 tasks while failing on 16 others, signaling significant opportunities for advancing autonomous scientific research.

agentsllmbenchmarkresearch-automation
2
OpenAI Credits GPT-5.2 With First Original Theoretical Physics Contribution And Formal Proof

GPT-5.2 discovered and proved a previously unknown formula for gluon amplitudes in theoretical physics, marking a significant AI contribution to frontier scientific research. The achievement demonstrates how LLMs coupled with human expertise can generate original theoretical insights and formal proofs at the research frontier.

llmopenaiai-sciencetheoretical-physics
3
OpenAI Backs OpenClaw's Open-Source Future While Accelerating Personal Agent Research Internally

An AI researcher joins OpenAI to advance agent research while committing to keep their popular open-source project OpenClaw independent as a foundation-backed initiative. This signals OpenAI's strategy of supporting open-source ecosystems while developing proprietary agent capabilities internally.

agentsopenaiopen-sourceai-research
4
Google Flags Intellectual Property Theft Attempt Using Distillation To Replicate Gemini Outputs

Google detected widespread abuse of Gemini by threat actors through model extraction attacks, AI-augmented phishing campaigns, and malware development, prompting aggressive mitigation and a push for industry-wide AI security standards.

llmgeminisecuritythreat-intelligence
5
Alibaba Wants Developers Building Robots, Not Just Apps

Alibaba's open-source robotics AI signals a critical inflection point where physical AI deployment accelerates faster than governance frameworks can scale, creating opportunities and risks in the US-China competition.

roboticsphysical-aiopen-sourcealibaba

Keep Reading

Enjoyed this issue?

Get daily AI intel delivered to your inbox. No fluff, just the stories that matter.