Author of "Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Traini" on arXiv
How media typically covers Yixin Nie
Yixin Nie as author
Cross-domain generalization for RL-trained LLM agents is primarily driven by state information richness and planning complexity rather than domain realism, and can be improved through strategic state randomization and step-by-step reasoning during training.
“Author of "Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Traini" on arXiv”