← Back to archive
ByteDance just matched GPT-4o without the huge bill

Image via Unknown

Wednesday, September 10, 2025

ByteDance just matched GPT-4o without the huge bill

ByteDance just pulled off something wild with their backward reasoning method, matching GPT-4o's performance without the expensive training bill (yikes for the rest of us). Meanwhile, Google's making moves on two fronts: slashing Veo 3 prices in half while adding 1080p and vertical video support, plus releasing EmbeddingGemma for offline, privacy-first applications that actually work on your phone. And Anthropic's launching the MCP Registry to standardize how AI servers talk to each other, which is the kind of boring-but-necessary infrastructure that actually matters. Oh, and there's this whole debate about gross margins in AI that's forcing application builders to look way beyond just token costs. Here's the real question: are you still optimizing for token efficiency, or have you moved on to the harder stuff?

Top Stories

1
The Gross Margin Debate in AI

AI gross margins diverge significantly across the stack, with application-layer companies facing the widest dispersion; success requires moving beyond token pricing, deepening workflows, and iterating pricing models to balance growth and profitability.

business-modelsai-economicspricing-strategysaas
2
ByteDance's Backward Reasoning

ByteDance's REER method achieves competitive reasoning performance in open-ended tasks by reverse-engineering thinking processes from good outputs, offering a scalable alternative to expensive distillation and reinforcement learning approaches.

llmreasoningopen-sourcebytedance
3
Google releases EmbeddingGemma, an open-source multilingual embedding model optimized for offline and mobile use

Google's EmbeddingGemma is a lightweight, open-source embedding model optimized for offline, on-device AI that enables private semantic search and RAG applications without cloud dependency. This addresses enterprise demand for privacy-preserving AI capabilities while democratizing access to high-quality multilingual embeddings.

googleopen-sourceembeddingsrag
4
Introducing the MCP Registry

The MCP Registry provides a unified, open-source hub for discovering MCP servers, enabling easier integration while allowing enterprises to build custom sub-registries on top of it. This standardization accelerates ecosystem adoption and interoperability across AI applications.

mcpanthropicopen-sourcedeveloper-tools
5
Google slashes Veo 3 prices by 50%, adds 1080p and vertical video support

Google cuts Veo 3 pricing in half while adding vertical video and 1080p support, making its video generation model more accessible for mobile-first and social media applications. The move signals Google's commitment to scaling developer adoption and competing in the rapidly growing generative video market.

video-generationgooglegemini-apipricing

Keep Reading

Industry Voices

Enjoyed this issue?

Get daily AI intel delivered to your inbox. No fluff, just the stories that matter.