Yuanbo Yang

I am a Master's student at Zhejiang University, working with Prof. Yiyi Liao. I am also lucky to have collaboration with Prof. Andreas Geiger. Previously I obtained my B.Eng. degree in Electronics Engineering from Hangzhou Dianzi University in 2022.

My research interest lies in 3D Computer Vision and Generative Models, including:

  • Structured and Controllable 3D/Video Generative Model
  • Unleash the Power of “Free Lunch” (Synthetic Data, Pretrained Prior, etc.) to Enhance Generative Models
  • Learning Structured Generative Representation from In the Wild Data
  • Email  /  CV  /  X  /  Github

profile photo

News
  • Jul. 2024: Prometheus🔥 and ChronoDepth🕰️ accepted to CVPR 2025.

  • Jul. 2024: One paper accepted to ECCV 2024.

  • May. 2024: One paper accepted to SIGGRAPH 2024.

  • Jul. 2023: One paper accepted to ICCV 2023.

Research
Prometheus🔥: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
Yuanbo Yang*, Jiahao Shao*, Xinyang Li, Yujun Shen, Andreas Geiger, Yiyi Liao†
CVPR, 2025
project page / Paper / arXiv / code

Prometheus introduces a novel method for feed-forward scene-level 3D generation in seconds. Its key idea is to harness the power of pre-trained 2D priors to enable generalizable and efficient 3D synthesis – hence its name, Prometheus🔥.

UrbanGen🚘: Urban Generation with Compositional and Controllable Neural Fields
Yuanbo Yang, Yujun Shen, Yue Wang, Andreas Geiger, Yiyi Liao†
arXiv, 2024
project page / Paper / code

UrbanGen proposes a solution for the challenging task of generating 3D urban radiance fields with photorealistic rendering, accurate geometry, high controllability, and diverse city styles.

ChronoDepth🕰️: Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao*, Yuanbo Yang*, Hongyu Zhou, Youmin Zhang, Yujun Shen, Vitor Guizilini, Yue Wang, Matteo Poggi, Yiyi Liao†
CVPR, 2025
project page / arXiv / code

ChronoDepth addresses the challenge of streamed video depth estimation with video diffusion model.

Animated GIF
Animated GIF
TeFF🐘: Learning 3D-Aware GANs from Unposed Images with Template Feature Field (Oral)
Xinya Chen, Hanlei Guo, Yanrui Bin, Shangzhan Zhang, Yuanbo Yang, Yue Wang, Yujun Shen, Yiyi Liao†
In Proc. of the European Conf. on Computer Vision (ECCV), 2024
project page / arXiv / code

TeFF targets learning 3D-aware GANs from unposed images, for which we propose to perform on-the-fly pose estimation of training images with a learned template feature field (TeFF).

MaPa Image
MaPa Image
MaPa🖼️: Text-driven Photorealistic Material Painting for 3D Shapes
Shangzhan Zhang, Sida Peng, Tao Xu, Yuanbo Yang, Tianrun Chen, Nan Xue, Yujun Shen, Hujun Bao, Ruizhen Hu, Xiaowei Zhou
ACM Special Interest Group on Computer Graphics (SIGGRAPH), 2024.
project page / paper

MaPa introduces a method to create segment-wise procedural material graphs as the appearance representation, which supports high-quality rendering and provides significant flexibility in editing.

Animated GIF
Animated GIF
UrbanGIRAFFE🚗: Representing Urban Scenes as Compositional Generative Neural Feature Fields
Yuanbo Yang, Yifei Yang, Hanlei Guo, Rong Xiong, Yue Wang, Yiyi Liao†
In Proc. of the IEEE International Conf. on Computer Vision (ICCV), 2023
project page / arXiv / code

UrbanGIRAFFE leverages a coarse 3D panoptic prior to guide a 3D-aware generative model, enabling photorealistic 3D-aware image synthesis with diverse controllability, including large camera movement, stuff editing, and object manipulation.

Experiences

Ant Group
2023 - Present
Research Intern

Zhejiang University,
Sep. 2022 - June. 2025
Master's student

Hangzhou Dianzi University
Sep. 2018 - June. 2022
Bachelor Degree

Last updated Dec. 2024
Thanks Dr. Jon Barron for sharing the source code of his personal page.