Claude Sonnet 4.5 is now the best coding model

The model wars are heating up with Claude Sonnet 4.5 staking its claim as the world's best coding model while also nailing agent improvements (wild), and Google's throwing down with Tunix, a JAX-native library that's making LLM post-training way more accessible. Meanwhile, Thinking Machines launched the Tinker API for democratizing fine-tuning at scale, and there's some internal drama at OpenAI over whether Sora should be chasing consumer growth or staying true to the mission (yikes). On the reliability front, Alephic broke down how Claude's using basic filesystem and Unix philosophy to build agents you can actually trust. If you had to bet your next project on one model right now, would it be Claude or waiting for what's next?

Image via Google Blog

Top Stories

Tinker API by Thinking Machines Lab

Thinking Machines Lab

Tinker democratizes language model fine-tuning by offering researchers a managed, cost-effective API that handles infrastructure complexity while maintaining control over training methods and data. This enables broader participation in cutting-edge model customization without requiring in-house distributed training expertise.

fine-tuningllmopen-sourceapi

The Magic of Claude Code

Alephic

Claude Code's breakthrough success comes from combining filesystem access with Unix command integration, enabling stateful AI agents that overcome context window and memory limitations. This approach demonstrates a powerful template for building reliable agentic systems around existing model capabilities rather than constraining them.

claudeagentsai-toolingunix

Claude Sonnet 4.5 Is A Very Good Model

The Zvi

Claude Sonnet 4.5 represents a significant leap in coding and agentic capabilities with state-of-the-art benchmarks and improved system prompt, positioning Anthropic competitively against OpenAI's GPT-5 despite higher per-token costs for non-subscription users.

claudeanthropicllmcoding

OpenAI Staff Grapples with the Company's Social Media Push

TechCrunch

OpenAI's launch of Sora, an AI video social media app, has created internal friction between researchers worried about mission drift and leadership defending the consumer business as necessary funding for AGI research. The move raises questions about whether profit incentives will ultimately override the company's nonprofit charter.

openaisorasocial-mediamission-alignment

Introducing Tunix: A JAX-Native Library for LLM Post-Training

Google Developers Blog

Google released Tunix, a JAX-native library that simplifies LLM post-training with modular APIs for fine-tuning, preference optimization, reinforcement learning, and distillation, enabling faster iteration on model alignment for production deployment.

llmopen-sourcejaxpost-training

Keep Reading

•

Opera Thinks Your Tabs Should Work Like Apps

•

Anthropic Works on Customizable Skills for ClaudeTesting Catalog

•

DeepMind's GenAI for Design PrototypingGoogle Blog

•

YouTube Thinks AI Is Its Next Big BangWired

•

CoreWeave Signs $14 Billion AI Infrastructure Deal with MetaCNBC

Industry Voices

Nick Turley

Head of ChatGPT at OpenAI

Gets first look at ChatGPT features before they ship and shares behind-the-scenes product decisions from OpenAI's most-used product.

Enjoyed this issue?

Get daily AI intel delivered to your inbox. No fluff, just the stories that matter.