Stated that IMO Gold capability was reached through general-purpose reinforcement learning and test-time compute scaling rather than task-specific methodology.
How media typically covers Alexander Wei
Referenced in coverage
Google Gemini Deep Think and OpenAI's experimental reasoning model both achieved International Math Olympiad Gold medal performance by solving 5 of 6 problems within human time limits using general-purpose reinforcement learning and test-time compute scaling.
“Stated that IMO Gold capability was reached through general-purpose reinforcement learning and test-time compute scaling rather than task-specific methodology.”