Publications

The year indicates the time when the work is mostly finished.
* indicates equal contribution.

2025

  1. Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
    Preprint, arXiv preprint arXiv:2501.12202. [Website] [Code]
    Zibo Zhao*, Zeqiang Lai*, Qingxiang Lin*, Yunfei Zhao*, Haolin Liu, Shuhui Yang*, Yifei Feng*, Mingxin Yang*, and 63 more authors

2024

  1. MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
    Preprint, arXiv preprint arXiv:2412.01271. [Code]
    Sen Xing*, Muyan* Zhong, Zeqiang Lai*, Liangchen Li, Jiawen Liu, Yaohui Wang, Jifeng Dai, and Wenhai Wang
  2. Flexitex: Enhancing Texture Generation with Visual Guidance
    AAAI 2025, arXiv preprint arXiv:2409.12431. [Website]
    DaDong Jiang, Xianghui Yang, Zibo Zhao, Sheng Zhang, Jiaao Yu, Zeqiang Lai, Shaoxiong Yang, Chunchao Guo, and 2 more authors
  3. Scaling Mesh Generation via Compressive Tokenization
    Preprint, arXiv preprint arXiv:2411.07025. [Website] [Code]
    Haohan Weng, Zibo Zhao, Biwen Lei, Xianghui Yang, Jian Liu, Zeqiang Lai, Zhuo Chen, Yuhong Liu, and 3 more authors
  4. VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
    NeurIPS 2024, arXiv preprint arXiv:2406.08394. [Code]
    Jiannan Wu*, Muyan Zhong*, Sen Xing*, Zeqiang Lai*Zhaoyang Liu*Wenhai Wang*, Zhe Chen, Xizhou Zhu, and 3 more authors

2023

  1. ControlLLM: Augment Language Models with Tools by Searching on Graphs
    ECCV 2024, arXiv preprint arXiv:2310.17796. [Code]
    Zhaoyang Liu*Zeqiang Lai*, Gao Zhangwei, Erfei Cui, Zhiheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, and 3 more authors
  2. Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
    Preprint, arXiv preprint arXiv:2310.07653. [Website] [Code]
    Zeqiang Lai, Xizhou Zhu, Jifeng DaiYu Qiao, and Wenhai Wang
  3. InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
    Preprint, arXiv preprint arXiv:2305.05662. [Website] [Code]
    Zhaoyang Liu*Yinan He*Wenhai Wang*, Weiyun Wang*, Yi Wang*, Shoufa Chen*, Qinglong Zhang*, Zeqiang Lai*, and 12 more authors
  4. Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
    Preprint, arXiv preprint arXiv:2306.01721. [Code]
    Zeqiang Lai*Yuchen Duan*Jifeng Dai, Ziheng Li, Ying FuHongsheng LiYu Qiao, and Wenhai Wang

2022

  1. Hybrid Spectral Denoising Transformer with Guided Attention
    ICCV 2023, International Conference on Computer Vision. [Poster] [Code]
    Zeqiang Lai, Chenggang Yan, and Ying Fu
  2. Mixed Attention Network for Hyperspectral Image Denoising
    Preprint, arXiv preprint arXiv:2301.11525. [Code]
    Zeqiang Lai, and Ying Fu
  3. Hyperspectral Image Super Resolution with Real Unaligned RGB Guidance
    TNNLS, IEEE Transactions on Neural Networks and Learning Systems. [Website] [Code]
    Zeqiang LaiYing Fu, and Jun Zhang

2021

  1. Deep plug-and-play prior for hyperspectral image restoration
    Neurocomputing, Elsevier Neurocomputing. [Code]
    Zeqiang LaiKaixuan Wei, and Ying Fu