Monday, March 16, 2026
Karpathy's AI does research now + OpenAI's latest buy
Karpathy just dropped Autoresearch for autonomous AI agent-driven ML research (wild timing), OpenAI acquired Promptfoo to beef up their security testing game while keeping it open-source (bold move), and Google launched a Universal Commerce Protocol so AI agents can actually complete purchases for you. Meanwhile, Cursor's marketplace exploded with 30+ new plugins to wire up your dev workflow. Would you let an AI agent shop with your credit card?
Top Stories
Andrej Karpathy open-sourced 'autoresearch', a 630-line tool enabling AI agents to autonomously conduct ML research by iterating on training code while humans engineer prompts. The system runs 5-minute training cycles with agents automatically optimizing hyperparameters and architecture through autonomous git commits.
Google DeepMind's CSRO framework uses LLMs to generate game-playing strategies as executable code rather than opaque neural networks, achieving competitive performance with traditional RL while producing fully interpretable, human-readable policies that can be inspected and debugged.
Promptfoo
OpenAI is acquiring Promptfoo, an open-source AI testing and security platform used by 130k monthly developers and 25% of Fortune 500 companies, to integrate its adversarial testing and evaluation capabilities into OpenAI's infrastructure while maintaining its open-source status.
Google Merchant Center Help
Google launched the Universal Commerce Protocol (UCP), an open standard enabling AI agents to handle complete shopping journeys including checkout on Google Search and Gemini surfaces, while merchants retain seller-of-record status and customers pay via Google Wallet.
Cursor
Cursor launched 30+ new marketplace plugins from partners like Atlassian, Datadog, and GitLab, extending its AI agent's ability to autonomously interact with development tools by combining MCPs with usage instructions that prove more effective than MCPs alone.
Keep Reading
Industry Voices
Yixin Liu
Yale University
Publishes cutting-edge research on reasoning in language models and multimodal AI systems from one of the top academic labs.
Song Jiang
Meta Superintelligence Labs
Ships insights from inside Meta's push toward AGI, including work on advanced reasoning and long-context systems.
Zane
a16z
Breaks down which AI startups are getting funded and why, with pattern recognition on what actually works in the market.
Enjoyed this issue?
Get daily AI intel delivered to your inbox. No fluff, just the stories that matter.