
Image via Agglomerations, Substack
Monday, February 16, 2026
GPT-5.2 makes its first physics discovery
We're seeing some wild milestone moments: OpenAI's GPT-5.2 just published original theoretical physics research (yikes, we're past the "helpful assistant" phase), while a new benchmark called AIRS-Bench is stress-testing whether AI research agents are ready for the real lab. Meanwhile, Google's flagging distillation attacks from state-backed actors trying to replicate Gemini, and Alibaba's making a bold move pivoting developers toward robotics with RynnBrain instead of just app-building. So here's the question: if AI agents are publishing physics papers and building robots, what's the human researcher's job anymore?

Image via Agglomerations, Substack
Top Stories
arXiv
AIRS-Bench introduces 20 research-focused tasks to evaluate LLM agents across the full scientific research lifecycle, with preliminary results showing agents exceed human performance on only 4 tasks while failing on 16 others, signaling significant opportunities for advancing autonomous scientific research.
GPT-5.2 discovered and proved a previously unknown formula for gluon amplitudes in theoretical physics, marking a significant AI contribution to frontier scientific research. The achievement demonstrates how LLMs coupled with human expertise can generate original theoretical insights and formal proofs at the research frontier.
An AI researcher joins OpenAI to advance agent research while committing to keep their popular open-source project OpenClaw independent as a foundation-backed initiative. This signals OpenAI's strategy of supporting open-source ecosystems while developing proprietary agent capabilities internally.
Google detected widespread abuse of Gemini by threat actors through model extraction attacks, AI-augmented phishing campaigns, and malware development, prompting aggressive mitigation and a push for industry-wide AI security standards.
Alibaba's open-source robotics AI signals a critical inflection point where physical AI deployment accelerates faster than governance frameworks can scale, creating opportunities and risks in the US-China competition.
Keep Reading
Enjoyed this issue?
Get daily AI intel delivered to your inbox. No fluff, just the stories that matter.