Publications
The year indicates the time when the work is mostly finished.
* indicates equal contribution.
2025
-
Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation
Preprint, arXiv preprint arXiv:2509.12815.
-

-
X-Part: High Fidelity and Structure Coherent Shape Decomposition
Preprint, arXiv preprint arXiv:2509.08643.[Code]
-
Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details
Preprint, arXiv preprint arXiv:2506.16504.[Code]
-
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Preprint, arXiv preprint arXiv:2506.15442.[Code]
-
FlashVDM: Unleashing Vecset Diffusion Model for Fast Shape Generation
ICCV 2025 Highlight, International Conference on Computer Vision.[Code]
-
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Preprint, arXiv preprint arXiv:2501.12202.[Website] [Code]
2024
-
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
ICML 2025, International Conference on Machine Learning.[Code]
-
Flexitex: Enhancing Texture Generation with Visual Guidance
AAAI 2025, AAAI Conference on Artificial Intelligence.[Website]
-
Scaling Mesh Generation via Compressive Tokenization
CVPR 2025, The IEEE/CVF Conference on Computer Vision and Pattern Recognition.[Website] [Code]
-
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NeurIPS 2024, Conference on Neural Information Processing Systems.[Code]
2023
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
ECCV 2024, European Conference on Computer Vision.[Code]
-
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Preprint, arXiv preprint arXiv:2310.07653.[Website] [Code]
-
∇-Prox: Differentiable Proximal Algorithm Modeling for Large-Scale Optimization
SIGGRAPH 2023, ACM Transactions on Graphics.[Website] [Code] [Colab]
-
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Preprint, arXiv preprint arXiv:2305.05662.[Website] [Code]
-
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Preprint, arXiv preprint arXiv:2306.01721.[Code]
2022
-
Hybrid Spectral Denoising Transformer with Guided Attention
ICCV 2023, International Conference on Computer Vision.[Poster] [Code]
-
Mixed Attention Network for Hyperspectral Image Denoising
Preprint, arXiv preprint arXiv:2301.11525.[Code]
-
Hyperspectral Image Super Resolution with Real Unaligned RGB Guidance
TNNLS, IEEE Transactions on Neural Networks and Learning Systems.[Website] [Code]
2021
-
Deep plug-and-play prior for hyperspectral image restoration
Neurocomputing, Elsevier Neurocomputing.[Code]