Author of "R-Zero: Self-Evolving Reasoning LLM From Zero Data" in arXiv
How this journalist typically writes
Chengsong Huang as author
R-Zero is a fully autonomous framework that trains LLMs without human-curated data by having a Challenger model propose tasks and a Solver model solve them in a co-evolving curriculum, achieving +6.49 improvement on math reasoning for Qwen3-4B-Base.
“Author of "R-Zero: Self-Evolving Reasoning LLM From Zero Data" in arXiv”