
Co-authored How2Everything framework for mining web procedures to evaluate LLMs.
How media typically covers Mohit Iyyer
Referenced in coverage
How2Everything, a new framework for mining 351K how-to procedures from the web, enables scalable evaluation and improvement of LLM procedural generation through an LLM judge that achieves 80.5% human agreement and yields measurable performance gains via reinforcement learning.
“Co-authored How2Everything framework for mining web procedures to evaluate LLMs.”