Researcher

Rohan Taori

Researcher at Unknown

Co-author of 'Alpaca: A Strong, Replicable Instruction-Following Model' (2021), cited for work on distillation for instruction-following models

Mentions

Articles

Outlets

Associated Topics

Coverage Patterns

How media typically covers Rohan Taori

Article Types

research1

Media Angles

technical1

Narrative Framing

trend1

Articles

Most recent first

Cited In

Research or work cited

On-Policy Distillation

Thinking Machines LabresearchneutralOct 28, 2025

On-policy distillation training approaches enable smaller specialized models to match or exceed larger generalist models' performance in focused domains through efficient knowledge transfer from teacher models.

“Co-author of 'Alpaca: A Strong, Replicable Instruction-Following Model' (2021), cited for work on distillation for instruction-following models”

LLMsFoundation ModelsAI TrainingReinforcement Learning