Yuanbo Yang

I am currently working as research intern with Prof. Jun Gao. Before that, I get my Master Degree at Zhejiang University, working with Prof. Yiyi Liao. I am also lucky to have collaboration with Prof. Andreas Geiger. I obtained my B.Eng. degree in Electronics Engineering from Hangzhou Dianzi University in 2022.

My current research focuses on understanding and improving vision foundation models. Previously, I have worked on 3D generative models. I’d really appreciate the chance to exchange ideas with anyone who shares similar interests! Feel free to reach me via Email or X.

Email / CV / X / Github

News

Jul. 2025: One paper accepted to NeurIPS 2025.

Aug. 2025: UrbanGen accepted to TPAMI 2025.

Jul. 2024: Prometheus🔥 and ChronoDepth🕰️ accepted to CVPR 2025.

Jul. 2024: One paper accepted to ECCV 2024.

May. 2024: One paper accepted to SIGGRAPH 2024.

Jul. 2023: One paper accepted to ICCV 2023.

Research

Prometheus🔥: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
Yuanbo Yang*, Jiahao Shao*, Xinyang Li, Yujun Shen, Andreas Geiger, Yiyi Liao†
CVPR, 2025
project page / Paper / arXiv / code

Prometheus introduces a novel method for feed-forward scene-level 3D generation in seconds. Its key idea is to harness the power of pre-trained 2D priors to enable generalizable and efficient 3D synthesis – hence its name, Prometheus🔥.

UrbanGen🚘: Urban Generation with Compositional and Controllable Neural Fields
Yuanbo Yang, Yujun Shen, Yue Wang, Andreas Geiger, Yiyi Liao†
TPAMI, 2025
project page / Paper / code

UrbanGen proposes a solution for the challenging task of generating 3D urban radiance fields with photorealistic rendering, accurate geometry, high controllability, and diverse city styles.

ChronoDepth🕰️: Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao*, Yuanbo Yang*, Hongyu Zhou, Youmin Zhang, Yujun Shen, Vitor Guizilini, Yue Wang, Matteo Poggi, Yiyi Liao†
CVPR, 2025
project page / arXiv / code

ChronoDepth addresses the challenge of streamed video depth estimation with video diffusion model.

TeFF🐘: Learning 3D-Aware GANs from Unposed Images with Template Feature Field (Oral)
Xinya Chen, Hanlei Guo, Yanrui Bin, Shangzhan Zhang, Yuanbo Yang, Yue Wang, Yujun Shen, Yiyi Liao†
ECCV, 2024
project page / arXiv / code

TeFF targets learning 3D-aware GANs from unposed images, for which we propose to perform on-the-fly pose estimation of training images with a learned template feature field (TeFF).

MaPa🖼️: Text-driven Photorealistic Material Painting for 3D Shapes
Shangzhan Zhang, Sida Peng, Tao Xu, Yuanbo Yang, Tianrun Chen, Nan Xue, Yujun Shen, Hujun Bao, Ruizhen Hu, Xiaowei Zhou
SIGGRAPH, 2024.
project page / paper

MaPa introduces a method to create segment-wise procedural material graphs as the appearance representation, which supports high-quality rendering and provides significant flexibility in editing.

UrbanGIRAFFE🚗: Representing Urban Scenes as Compositional Generative Neural Feature Fields
Yuanbo Yang, Yifei Yang, Hanlei Guo, Rong Xiong, Yue Wang, Yiyi Liao†
ICCV, 2023
project page / arXiv / code

UrbanGIRAFFE leverages a coarse 3D panoptic prior to guide a 3D-aware generative model, enabling photorealistic 3D-aware image synthesis with diverse controllability, including large camera movement, stuff editing, and object manipulation.

Experiences

Ant Group
2023 - Present
Research Intern

Zhejiang University,
Sep. 2022 - June. 2025
Master's student

Hangzhou Dianzi University
Sep. 2018 - June. 2022
Bachelor Degree

Last updated Dec. 2024
Thanks Dr. Jon Barron for sharing the source code of his personal page.