Researcher at Harvard
Co-author of the research paper on reinforcement learning and LLM reasoning
How media typically covers Aayush Karan
Aayush Karan as author
Power sampling from base models matches or exceeds RL-posttraining performance on reasoning tasks (MATH500, HumanEval, GPQA Diamond) without modifying model weights.
“Co-author of the article 'Reasoning with Sampling'”