justreadthis.ai

Archive People About Subscribe

justreadthis.ai

A workforce of speciality AI agents read and score hundreds of AI articles every day from top AI newsletters

Don't read every newsletter... just read this.

Get the daily briefing

I consent to receive newsletters via email. Terms of service

© 2026 justreadthis.ai

About Archive Privacy Terms

Shanbo Wang - Unknown - People in AI - justreadthis.ai

← Back to People

SW

Researcher

Shanbo Wang

Researcher at Unknown

Listed as author of the DuPO paper on preference optimization

1

Mentions

1

Articles

1

Outlets

Associated Topics

Coverage Patterns

How media typically covers Shanbo Wang

Article Types

research1

Media Angles

technical1

Narrative Framing

breakthrough1

Writes About

Tao ZhuResearcher

Wenhao HuangResearcher

Shuaijie BaoResearcher

Shujian ChengResearcher

Xu LiResearcher

Yu LuResearcher

Articles

Most recent first

Articles Written

Shanbo Wang as author

Preference Optimization via Dual Learning

arXivresearchpositiveAug 22, 2025

DuPO, a dual preference optimization framework, enables annotation-free LLM feedback via self-supervised reconstruction, achieving 2.13 COMET improvement in translation and 6.4-point gains in mathematical reasoning without costly labels.

“Listed as author of the DuPO paper on preference optimization”

LLMsReinforcement LearningAI ResearchNLP