Author of "Reference-to-Video Generation" in arXiv
How this journalist typically writes
Shikun Liu as author
Saber, a zero-shot framework trained only on video-text pairs, generates reference-to-video content while preserving subject identity without requiring explicit R2V training datasets, outperforming models trained on costly triplet data.
“Author of "Reference-to-Video Generation" in arXiv”