Publications
The year indicates the time when the work is mostly finished.
2024
- VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NeurIPS 2024, arXiv preprint arXiv:2406.08394.[Code]
2023
- ControlLLM: Augment Language Models with Tools by Searching on Graphs
ECCV 2024, arXiv preprint arXiv:2310.17796.[Code]
- Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Preprint, arXiv preprint arXiv:2310.07653.[Website] [Code]
- ∇-Prox: Differentiable Proximal Algorithm Modeling for Large-Scale Optimization
SIGGRAPH 2023, ACM Transactions on Graphics.[Website] [Code] [Colab]
- InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Preprint, arXiv preprint arXiv:2305.05662.[Website] [Code]
- Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Preprint, arXiv preprint arXiv:2306.01721.[Code]
2022
- Hybrid Spectral Denoising Transformer with Guided Attention
ICCV 2023, International Conference on Computer Vision.[Poster] [Code]
- Mixed Attention Network for Hyperspectral Image Denoising
Preprint, arXiv preprint arXiv:2301.11525.[Code]
- Hyperspectral Image Super Resolution with Real Unaligned RGB Guidance
TNNLS, IEEE Transactions on Neural Networks and Learning Systems.[Website] [Code]
2021
- Deep plug-and-play prior for hyperspectral image restoration
Neurocomputing, Elsevier Neurocomputing.[Code]