GPT-5.2 makes its first physics discovery

We're seeing some wild milestone moments: OpenAI's GPT-5.2 just published original theoretical physics research (yikes, we're past the "helpful assistant" phase), while a new benchmark called AIRS-Bench is stress-testing whether AI research agents are ready for the real lab. Meanwhile, Google's flagging distillation attacks from state-backed actors trying to replicate Gemini, and Alibaba's making a bold move pivoting developers toward robotics with RynnBrain instead of just app-building. So here's the question: if AI agents are publishing physics papers and building robots, what's the human researcher's job anymore?

Image via Agglomerations, Substack

arXiv

AIRS-Bench introduces 20 research-focused tasks to evaluate LLM agents across the full scientific research lifecycle, with preliminary results showing agents exceed human performance on only 4 tasks while failing on 16 others, signaling significant opportunities for advancing autonomous scientific research.

agentsllmbenchmarkresearch-automation

OpenAI Credits GPT-5.2 With First Original Theoretical Physics Contribution And Formal Proof

GPT-5.2 discovered and proved a previously unknown formula for gluon amplitudes in theoretical physics, marking a significant AI contribution to frontier scientific research. The achievement demonstrates how LLMs coupled with human expertise can generate original theoretical insights and formal proofs at the research frontier.

llmopenaiai-sciencetheoretical-physics

OpenAI Backs OpenClaw's Open-Source Future While Accelerating Personal Agent Research Internally

An AI researcher joins OpenAI to advance agent research while committing to keep their popular open-source project OpenClaw independent as a foundation-backed initiative. This signals OpenAI's strategy of supporting open-source ecosystems while developing proprietary agent capabilities internally.

agentsopenaiopen-sourceai-research

Google Flags Intellectual Property Theft Attempt Using Distillation To Replicate Gemini Outputs

Google detected widespread abuse of Gemini by threat actors through model extraction attacks, AI-augmented phishing campaigns, and malware development, prompting aggressive mitigation and a push for industry-wide AI security standards.

llmgeminisecuritythreat-intelligence

Alibaba Wants Developers Building Robots, Not Just Apps

Alibaba's open-source robotics AI signals a critical inflection point where physical AI deployment accelerates faster than governance frameworks can scale, creating opportunities and risks in the US-China competition.

roboticsphysical-aiopen-sourcealibaba

Keep Reading

•

ByteDance Introduces Seed 2.0 Agentic AI Models And Positions Seedance 2.0 As A Studio-Level Generative Video SystemByteDance

•

GitHub Enables Copilot Coding Agent To Access External Tools Directly Using MCP Server IntegrationGitHub

•

AI and the Economics of the Human TouchAgglomerations, Substack

Enjoyed this issue?

Get daily AI intel delivered to your inbox. No fluff, just the stories that matter.

Top Stories

Keep Reading

Enjoyed this issue?