I am currently a Senior Research Scientist at ByteDance/TikTok, working on video synthesis and generation.
Previously, I was a Postdoc Fellow supervised by Professor
Hao (Richard) Zhang in
GrUVi Lab at Simon Fraser University (SFU), Canada.
During my postdoc, I worked on 3D shape reconstruction and content creation.
I received my PhD degree from Peking University, where I worked on Computer Vision and Computer Graphics.
During my PhD, I focused on applying deep generative models to analyze and synthesize 2D geometric data
such as glyphs, fonts, and layouts. I was supervised by Prof.
Zhouhui Lian and Prof.
Jianguo Xiao .
Representative papers are highlighted.
CVPR 2026
A video editing framework that encodes editing instructions with a vision-language model and leverages GRPO to enhance editing performance.
Yufan Deng, Yuanyang Yin, Xun Guo, Yizhi Wang , Jacob Zhiyuan Fang, Shenghai Yuan, Yiding Yang, Angtian Wang, Bo Liu, Haibin Huang, Chongyang Ma
ICLR 2026
A novel conditioning approach with subject disentanglement for identity-consistent video generation from multiple reference images.
ICLR 2025
A novel representation of 3D shapes using oriented and anisotropic local grids.
CVPR 2024
Slice3D predicts multi-slice images to reveal occluded parts without changing the camera, then lifts the slices into a 3D model.
ECCV 2024
A novel approach for shape abstraction via sweep surfaces, using superellipses for profiles and B-spline curves for axes.
ACM Transactions on Graphics (SIGGRAPH Journal-Track) 2024
A single-image mesh texturing approach that employs diffusion models with judicious conditioning to seamlessly transfer textures.
CVPR 2023
A category-agnostic shape encoding for learning neural field representations amid significant shape variations.
ICCV 2023
Automatically generate artistic typography by stylizing letter fonts to convey the semantics of an input word.
arXiv 2023
A bi-level feature representation for an image collection, with a per-image latent space above a multi-scale feature grid space.
CVPR 2023
Employing Transformers and a relaxed representation for higher-quality vector font generation, including Chinese glyphs.
CVPR 2022
A content-aware layout generation network that synthesizes aesthetic layouts from element images and their content.
ACM Transactions on Graphics (SIGGRAPH Asia 2021 Technical Paper) 2021
Synthesizing vector fonts via dual-modality learning and differentiable rasterization instead of rule-based vectorization.
ACM Transactions on Graphics (SIGGRAPH 2020 Technical Paper) 2020
Synthesizing fonts according to user-specified attributes such as italic, serif, cursive, and angularity.
ACM Multimedia 2020
A font-independent feature representation method to improve scene text recognition robustness under font variance.
MMM 2018
Recognizing font styles of texts in natural images with deep learning and transfer learning.