
Project lead for Depth Anything 3, advancing visual geometry prediction models.
How media typically covers Bingyi Kang
Referenced in coverage
ByteDance released Depth Anything 3, a unified transformer-based model that predicts spatially consistent geometry from arbitrary visual inputs for monocular depth estimation, multi-view depth estimation, pose estimation, and 3D Gaussian generation using a single depth-ray representation.
“Project lead for Depth Anything 3, advancing visual geometry prediction models.”