Publications
The year indicates the time when the work is mostly finished.
* indicates equal contribution.
2025
- Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Preprint, arXiv preprint arXiv:2501.12202.[Website] [Code]
2024
- MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Preprint, arXiv preprint arXiv:2412.01271.[Code]
- Flexitex: Enhancing Texture Generation with Visual Guidance
AAAI 2025, arXiv preprint arXiv:2409.12431.[Website]
- Scaling Mesh Generation via Compressive Tokenization
Preprint, arXiv preprint arXiv:2411.07025.[Website] [Code]
- VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NeurIPS 2024, arXiv preprint arXiv:2406.08394.[Code]
2023
- ControlLLM: Augment Language Models with Tools by Searching on Graphs
ECCV 2024, arXiv preprint arXiv:2310.17796.[Code]
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Preprint, arXiv preprint arXiv:2310.07653.[Website] [Code]
- ∇-Prox: Differentiable Proximal Algorithm Modeling for Large-Scale Optimization
SIGGRAPH 2023, ACM Transactions on Graphics.[Website] [Code] [Colab]
- InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Preprint, arXiv preprint arXiv:2305.05662.[Website] [Code]
- Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Preprint, arXiv preprint arXiv:2306.01721.[Code]
2022
- Hybrid Spectral Denoising Transformer with Guided Attention
ICCV 2023, International Conference on Computer Vision.[Poster] [Code]
- Mixed Attention Network for Hyperspectral Image Denoising
Preprint, arXiv preprint arXiv:2301.11525.[Code]
- Hyperspectral Image Super Resolution with Real Unaligned RGB Guidance
TNNLS, IEEE Transactions on Neural Networks and Learning Systems.[Website] [Code]
2021
- Deep plug-and-play prior for hyperspectral image restoration
Neurocomputing, Elsevier Neurocomputing.[Code]