Researcher at Unknown
Co-author of 'Alpaca: A Strong, Replicable Instruction-Following Model' (2021), cited for work on distillation for instruction-following models
How media typically covers Rohan Taori
Research or work cited
On-policy distillation training approaches enable smaller specialized models to match or exceed larger generalist models' performance in focused domains through efficient knowledge transfer from teacher models.
“Co-author of 'Alpaca: A Strong, Replicable Instruction-Following Model' (2021), cited for work on distillation for instruction-following models”