-
Unleashing Text-to-Image Diffusion Models for Visual PerceptionWenliang Zhao1, Yongming Rao1, Zuyan Liu1, Benlin Liu, Jie Zhou, Jiwen Lu2023-03-032023-03-03
-
Collage DiffusionVishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon FatahalianarXiv 2023. Paper  2023-03-012023-03-01
-
Towards Enhanced Controllability of Diffusion ModelsWonwoong Cho, Hareesh Ravi, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David I. Inouye, Ajinkya KalearXiv 2023. Paper  2023-02-282023-02-28
-
Directed Diffusion: Direct Control of Object Placement through Attention GuidanceWan-Duo Kurt Ma, J.P. Lewis, W. Bastiaan Kleijn, Thomas LeungarXiv 2023. Paper  2023-02-252023-02-25
-
Modulating Pretrained Diffusion Models for Multimodal Image SynthesisCusuh Ham, James Hays, Jingwan Lu, Krishna Kumar Singh, Zhifei Zhang, Tobias HinzarXiv 2023. Paper  2023-02-242023-02-24
-
Controlled and Conditional Text to Image Generation with Diffusion PriorPranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen, Vinh Khuc, Midhun Harikumar, Ritiz Tambi, Sudharshan Reddy Kakumanu, Purvak Lapsiya, Alvin Ghouas, Sarah Saber, Malavika Ramprasad, Baldo Faieta, Ajinkya KalearXiv 2023. Paper  2023-02-232023-02-23
-
Region-Aware Diffusion for Zero-shot Text-driven Image EditingNisha Huang, Fan Tang, Weiming Dong, Tong-Yee Lee, Changsheng Xu2023-02-232023-02-23
-
Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMCYilun Du, Conor Durkan, Robin Strudel, Joshua B. Tenenbaum, Sander Dieleman, Rob Fergus, Jascha Sohl-Dickstein, Arnaud Doucet, Will Grathwohl2023-02-222023-02-22
-
Learning 3D Photography Videos via Self-supervised Diffusion on Single ImagesXiaodong Wang1, Chenfei Wu1, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan DuanarXiv 2023. Paper  2023-02-212023-02-21
-
Boundary Guided Mixing Trajectory for Semantic Control with Diffusion ModelsYe Zhu, Yu Wu, Zhiwei Deng, Olga Russakovsky, Yan YanarXiv 2023. Paper  2023-02-162023-02-16
-
MultiDiffusion: Fusing Diffusion Paths for Controlled Image GenerationOmer Bar-Tal1, Lior Yariv1, Yaron Lipman, Tali Dekel2023-02-162023-02-16
-
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion ModelsChong Mou, Xintao Wang, Liangbin Xie, Jian Zhang, Zhongang Qi, Ying Shan, Xiaohu Qie2023-02-162023-02-16
-
Text-driven Visual Synthesis with Latent Diffusion PriorTing-Hsuan Liao, Songwei Ge, Yiran Xu, Yao-Chih Lee, Badour AlBahar, Jia-Bin Huang2023-02-162023-02-16
-
Exploring the Representation Manifolds of Stable Diffusion Through the Lens of Intrinsic DimensionHenry Kvinge, Davis Brown, Charles GodfreyarXiv 2023. Paper  2023-02-162023-02-16
-
PRedItOR: Text Guided Image Editing with Diffusion PrioHareesh Ravi, Sachin Kelkar, Midhun Harikumar, Ajinkya KalearXiv 2023. Paper  2023-02-152023-02-15
-
Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual GenerationJoshua Vendrow1, Saachi Jain1, Logan Engstrom, Aleksander Madry2023-02-152023-02-15
-
Universal Guidance for Diffusion ModelsArpit Bansal1, Hong-Min Chu1, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum, Jonas Geiping, Tom Goldstein2023-02-142023-02-14
-
Text-Guided Scene Sketch-to-Photo SynthesisAprilPyone MaungMaung, Makoto Shing, Kentaro Mitsui, Kei Sawada, Fumio OkuraarXiv 2023. Paper  2023-02-142023-02-14
-
Analyzing Multimodal Objectives Through the Lens of Generative Diffusion GuidanceChaerin Kong, Nojun KwakarXiv 2023. Paper  2023-02-102023-02-10
-
Adding Conditional Control to Text-to-Image Diffusion ModelsLvmin Zhang, Maneesh Agrawala2023-02-102023-02-10
-
Is This Loss Informative? Speeding Up Textual Inversion with Deterministic Objective EvaluationAnton Voronov1, Mikhail Khoroshikh1, Artem Babenko, Max RyabininarXiv 2023. Paper  2023-02-092023-02-09
-
Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion ModelsHyeonho Jeong, Gihyun Kwon, Jong Chul YearXiv 2023. Paper  2023-02-082023-02-08
-
GLAZE: Protecting Artists from Style Mimicry by Text-to-Image ModelsShawn Shan, Jenna Cryan, Emily Wenger, Haitao Zheng, Rana Hanocka, Ben Y. ZhaoarXiv 2023. Paper  2023-02-082023-02-08
-
Q-Diffusion: Quantizing Diffusion ModelsXiuyu Li, Long Lian, Yijiang Liu, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt KeutzerarXiv 2023. Paper  2023-02-082023-02-08
-
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoveryYuxin Wen1, Neel Jain1, John Kirchenbauer, Micah Goldblum, Jonas Geiping, Tom Goldstein2023-02-072023-02-07
-
Fair Diffusion: Instructing Text-to-Image Generation Models on FairnessFelix Friedrich, Patrick Schramowski, Manuel Brack, Lukas Struppek, Dominik Hintersdorf, Sasha Luccioni, Kristian KerstingarXiv 2023. Paper  2023-02-072023-02-07
-
Structure and Content-Guided Video Synthesis with Diffusion ModelsPatrick Esser, Johnathan Chiu, Parmida Atighehchian, Jonathan Granskog, Anastasis Germanidis2023-02-062023-02-06
-
Zero-shot Image-to-Image TranslationGaurav Parmar, Krishna Kumar Singh, Richard Zhang, Yijun Li, Jingwan Lu, Jun-Yan ZhuarXiv 2023. Paper  2023-02-062023-02-06
-
Eliminating Prior Bias for Semantic Image Editing via Dual-Cycle DiffusionZuopeng Yang, Tianshu Chu, Xin Lin, Erdun Gao, Daqing Liu, Jie Yang, Chaoyue WangarXiv 2023. Paper  2023-02-052023-02-05
-
ReDi: Efficient Learning-Free Diffusion Inference via Trajectory RetrievalKexun Zhang, Xianjun Yang, William Yang Wang, Lei LiarXiv 2023. Paper  2023-02-052023-02-05
-
Mixture of Diffusers for scene composition and high resolution image generationÁlvaro Barbero JiménezarXiv 2023. Paper  2023-02-052023-02-05
-
Semantic-Guided Image Augmentation with Pre-trained ModelsBohan Li, Xinghao Wang, Xiao Xu, Yutai Hou, Yunlong Feng, Feng Wang, Wanxiang ChearXiv 2023. Paper  2023-02-042023-02-04
-
TEXTure: Text-Guided Texturing of 3D ShapesElad Richardson1, Gal Metzer1, Yuval Alaluf, Raja Giryes, Daniel Cohen-Or2023-02-032023-02-03
-
Dreamix: Video Diffusion Models are General Video EditorsEyal Molad1, Eliahu Horwitz1, Dani Valevski1, Alex Rav Acha, Yossi Matias, Yael Pritch, Yaniv Leviathan, Yedid Hoshen2023-02-022023-02-02
-
Trash to Treasure: Using text-to-image models to inform the design of physical artefactsAmy Smith1, Hope Schroeder1, Ziv Epstein, Michael Cook, Simon Colton, Andrew LippmanarXiv 2023. Paper  2023-02-012023-02-01
-
Zero3D: Semantic-Driven Multi-Category 3D Shape GenerationBo Han, Yitong Liu, Yixuan ShenarXiv 2023. Paper  2023-01-312023-01-31
-
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion ModelsHila Chefer1, Yuval Alaluf1, Yael Vinker, Lior Wolf, Daniel Cohen-Or2023-01-312023-01-31
-
GALIP: Generative Adversarial CLIPs for Text-to-Image SynthesisMing Tao, Bing-Kun Bao, Hao Tang, Changsheng Xu2023-01-302023-01-30
-
PromptMix: Text-to-image diffusion models enhance the performance of lightweight networksArian Bakhtiarnia, Qi Zhang, Alexandros Iosifidis2023-01-302023-01-30
-
Shape-aware Text-driven Layered Video EditingYao-Chih Lee, Ji-Ze Genevieve Jang, Yi-Ting Chen, Elizabeth Qiu, Jia-Bin Huang2023-01-302023-01-30
-
Towards Equitable Representation in Text-to-Image Synthesis Models with the Cross-Cultural Understanding Benchmark (CCUB) DatasetZhixuan Liu, Youeun Shin, Beverley-Claire Okogwu, Youngsik Yun, Lia Coleman, Peter Schaldenbrand, Jihie Kim, Jean OharXiv 2023. Paper  2023-01-282023-01-28
-
SEGA: Instructing Diffusion using Semantic DimensionsManuel Brack, Felix Friedrich, Dominik Hintersdorf, Lukas Struppek, Patrick Schramowski, Kristian KerstingarXiv 2023. Paper  2023-01-282023-01-28
-
Text-To-4D Dynamic Scene GenerationUriel Singer1, Shelly Sheynin1, Adam Polyak1, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv TaigmanarXiv 2023. Paper  2023-01-262023-01-26
-
Guiding Text-to-Image Diffusion Model Towards Grounded GenerationZiyi Li, Qinye Zhou, Xiaoyun Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie2023-01-122023-01-12
-
Visual Story Generation Based on Emotion and KeywordsYuetian Chen, Ruohua Li, Bowen Shi, Peiru Liu, Mei SiAAAI 2022. Paper  2023-01-072023-01-07
-
Muse: Text-To-Image Generation via Masked Generative TransformersHuiwen Chang1, Han Zhang1, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan2023-01-022023-01-02
-
Exploring Vision Transformers as Diffusion LearnersHe Cao, Jianan Wang, Tianhe Ren, Xianbiao Qi, Yihao Chen, Yuan Yao, Lei ZhangarXiv 2022. Paper  2022-12-282022-12-28
-
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion ModelsJiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao2022-12-282022-12-28
-
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video GenerationJay Zhangjie Wu, Yixiao Ge, Xintao Wang, Weixian Lei, Yuchao Gu, Wynne Hsu, Ying Shan, Xiaohu Qie, Mike Zheng Shou2022-12-222022-12-22
-
Optimizing Prompts for Text-to-Image GenerationYaru Hao1, Zewen Chi1, Li Dong, Furu Wei2022-12-192022-12-19
-
Uncovering the Disentanglement Capability in Text-to-Image Diffusion ModelsQiucheng Wu, Yujian Liu, Handong Zhao, Ajinkya Kale, Trung Bui, Tong Yu, Zhe Lin, Yang Zhang, Shiyu Chang2022-12-162022-12-16
-
TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image modelsFederico A. Galatolo, Mario G. C. A. Cimino, Edoardo CogottiarXiv 2022. Paper  2022-12-152022-12-15
-
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image InpaintingSu Wang1, Chitwan Saharia1, Ceslee Montgomery1, Jordi Pont-Tuset, Shai Noy, Stefano Pellegrini, Yasumasa Onoe, Sarah Laszlo, David J. Fleet, Radu Soricut, Jason Baldridge, Mohammad Norouzi, Peter Anderson, William ChanarXiv 2022. Paper  2022-12-132022-12-13
-
The Stable Artist: Steering Semantics in Diffusion Latent SpaceManuel Brack, Patrick Schramowski, Felix Friedrich, Dominik Hintersdorf, Kristian KerstingarXiv 2022. Paper  2022-12-122022-12-12
-
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image SynthesisWeixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang2022-12-092022-12-09
-
SmartBrush: Text and Shape Guided Object Inpainting with Diffusion ModelShaoan Xie, Zhifei Zhang, Zhe Lin, Tobias Hinz, Kun ZhangarXiv 2022. Paper  2022-12-092022-12-09
-
Executing your Commands via Motion Diffusion in Latent SpaceXin Chen, Biao Jiang, Wen Liu, Zilong Huang, Bin Fu, Tao Chen, Jingyi Yu, Gang Yu2022-12-082022-12-08
-
Diffusion Guided Domain Adaptation of Image GeneratorsKunpeng Song, Ligong Han, Bingchen Liu, Dimitris Metaxas, Ahmed Elgammal2022-12-082022-12-08
-
Multi-Concept Customization of Text-to-Image DiffusionNupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, Jun-Yan Zhu2022-12-082022-12-08
-
SINE: SINgle Image Editing with Text-to-Image Diffusion ModelsZhixing Zhang, Ligong Han, Arnab Ghosh, Dimitris Metaxas, Jian Ren2022-12-082022-12-08
-
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationYen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander Schwing, Liangyan Gui2022-12-082022-12-08
-
MoFusion: A Framework for Denoising-Diffusion-based Motion SynthesisRishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik, Christian Theobalt2022-12-082022-12-08
-
Judge, Localize, and Edit: Ensuring Visual Commonsense Morality for Text-to-Image GenerationSeongbeom Park, Suhong Moon, Jinkyu KimarXiv 2022. Paper  2022-12-072022-12-07
-
Magic: Multi Art Genre Intelligent Choreography Dataset and Network for 3D Dance GenerationRonghui Li, Junfan Zhao, Yachao Zhang, Mingyang Su, Zeping Ren, Han Zhang, Xiu LiarXiv 2022. Paper  2022-12-072022-12-07
-
Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video EncodingGyeongman Kim, Hajin Shim, Hyunsu Kim, Yunjey Choi, Junho Kim, Eunho YangarXiv 2022. Paper  2022-12-062022-12-06
-
M-VADER: A Model for Diffusion with Multimodal ContextSamuel Weinbach1, Marco Bellagente1, Constantin Eichenberg, Andrew Dai, Robert Baldock, Souradeep Nanda, Björn Deiseroth, Koen Oostermeijer, Hannah Teufel, Andres Felipe Cruz-SalinasarXiv 2022. Paper  2022-12-062022-12-06
-
ADIR: Adaptive Diffusion for Image ReconstructionShady Abu-Hussein, Tom Tirer, Raja Giryes2022-12-062022-12-06
-
Diffusion-SDF: Text-to-Shape via Voxelized DiffusionMuheng Li, Yueqi Duan, Jie Zhou, Jiwen LuarXiv 2022. Paper  2022-12-062022-12-06
-
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image PriorsCongyue Deng, Chiyu "Max'' Jiang, Charles R. Qi, Xinchen Yan, Yin Zhou, Leonidas Guibas, Dragomir AnguelovarXiv 2022. Paper  2022-12-062022-12-06
-
Shape-Guided Diffusion with Inside-Outside AttentionDong Huk Park1, Grace Luo1, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell2022-12-012022-12-01
-
Unite and Conquer: Cross Dataset Multimodal Synthesis using Diffusion ModelsNithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel2022-12-012022-12-01
-
DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative ModelGwanghyun Kim, Se Young Chun2022-11-292022-11-29
-
SinDDM: A Single Image Denoising Diffusion ModelVladimir Kulikov, Shahar Yadin, Matan Kleiner, Tomer Michaeli2022-11-292022-11-29
-
Unified Discrete Diffusion for Simultaneous Vision-Language GenerationMinghui Hu, Chuanxia Zheng, Heliang Zheng, Tat-Jen Cham, Chaoyue Wang, Zuopeng Yang, Dacheng Tao, Ponnuthurai N. SuganthanarXiv 2022. Paper  2022-11-272022-11-27
-
SpaText: Spatio-Textual Representation for Controllable Image GenerationOmri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin2022-11-252022-11-25
-
3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion ModelsGang Li, Heliang Zheng, Chaoyue Wang, Chang Li, Changwen Zheng, Dacheng TaoarXiv 2022. Paper  2022-11-252022-11-25
-
Shifted Diffusion for Text-to-image GenerationYufan Zhou, Bingchen Liu, Yizhe Zhu, Xiao Yang, Changyou Chen, Jinhui XuarXiv 2022. Paper  2022-11-242022-11-24
-
Sketch-Guided Text-to-Image Diffusion ModelsAndrey Voynov, Kfir Aberman, Daniel Cohen-Or2022-11-242022-11-24
-
SinDiffusion: Learning a Diffusion Model from a Single Natural ImageWeilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li2022-11-222022-11-22
-
Human Evaluation of Text-to-Image Models on a Multi-Task BenchmarkVitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser, Gregory Hunter, Arvind Raghavan, Yann Hicke, Bryan A. Plummer, Ori Kerret, Tonio Buonassisi, Kate Saenko, Armando Solar-Lezama, Iddo DroriNeurIPS 2022. Paper  2022-11-222022-11-22
-
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image TranslationNarek Tumanyan1, Michal Geyer1, Shai Bagon, Tali DekelarXiv 2022. Paper  2022-11-222022-11-22
-
EDICT: Exact Diffusion Inversion via Coupled TransformationsBram Wallace, Akash Gokul, Nikhil NaikarXiv 2022. Paper  2022-11-222022-11-22
-
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion ModelsAjay Jain1, Amber Xie1, Pieter Abbeel2022-11-212022-11-21
-
Investigating Prompt Engineering in Diffusion ModelsSam Witteveen, Martin AndrewsarXiv 2022. Paper  2022-11-212022-11-21
-
SinFusion: Training Diffusion Models on a Single Image or VideoYaniv Nikankin, Niv Haim, Michal IraniarXiv 2022. Paper  2022-11-212022-11-21
-
DiffStyler: Controllable Dual Diffusion for Text-Driven Image StylizationNisha Huang, Yuxin Zhang, Fan Tang, Chongyang Ma, Haibin Huang, Yong Zhang, Weiming Dong, Changsheng XuarXiv 2022. Paper  2022-11-192022-11-19
-
Magic3D: High-Resolution Text-to-3D Content CreationChen-Hsuan Lin1, Jun Gao1, Luming Tang1, Towaki Takikawa1, Xiaohui Zeng1, Xun Huang, Karsten Kreis, Sanja Fidler, Ming-Yu Liu, Tsung-Yi Lin2022-11-182022-11-18
-
Invariant Learning via Diffusion Dreamed Distribution ShiftsPriyatham Kattakinda, Alexander Levine, Soheil FeiziarXiv 2022. Paper  2022-11-182022-11-18
-
InstructPix2Pix: Learning to Follow Image Editing InstructionsTim Brooks, Aleksander Holynski, Alexei A. EfrosarXiv 2022. Paper  2022-11-172022-11-17
-
Null-text Inversion for Editing Real Images using Guided Diffusion ModelRon Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-OrarXiv 2022. Paper  2022-11-172022-11-17
-
Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion ModelsAdham Elarabawy, Harish Kamath, Samuel DentonarXiv 2022. Paper  2022-11-152022-11-15
-
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelXingqian Xu, Zhangyang Wang, Eric Zhang, Kai Wang, Humphrey Shi2022-11-152022-11-15
-
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image GenerationZhihong Pan, Xin Zhou, Hao TianarXiv 2022. Paper  2022-11-142022-11-14
-
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsPatrick Schramowski, Manuel Brack, Björn Deiseroth, Kristian Kersting2022-11-092022-11-09
-
Rickrolling the Artist: Injecting Invisible Backdoors into Text-Guided Image Generation ModelsLukas Struppek, Dominik Hintersdorf, Kristian Kersting2022-11-042022-11-04
-
eDiffi: Text-to-Image Diffusion Models with an Ensemble of Expert DenoisersYogesh Balaji, Seungjun Nah, Xun Huang, Arash Vahdat, Jiaming Song, Karsten Kreis, Miika Aittala, Timo Aila, Samuli Laine, Bryan Catanzaro, Tero Karras, Ming-Yu Liu2022-11-022022-11-02
-
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal GuidanceWei Li, Xue Xu, Xinyan Xiao, Jiachen Liu, Hu Yang, Guohao Li, Zhanpeng Wang, Zhifan Feng, Qiaoqiao She, Yajuan Lyu, Hua WuarXiv 2022. Paper  2022-10-282022-10-28
-
MagicMix: Semantic Mixing with Diffusion ModelsJun Hao Liew, Hanshu Yan, Daquan Zhou, Jiashi Feng2022-10-282022-10-28
-
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-ExpertsZhida Feng1, Zhenyu Zhang1, Xintong Yu1, Yewei Fang, Lanxin Li, Xuyi Chen, Yuxiang Lu, Jiaxiang Liu, Weichong Yin, Shikun Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng WangarXiv 2022. Paper  2022-10-272022-10-27
-
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?Hritik Bansal1, Da Yin1, Masoud Monajatipoor, Kai-Wei Chang2022-10-272022-10-27
-
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative ModelsZijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover, Duen Horng Chau2022-10-262022-10-26
-
Lafite2: Few-shot Text-to-Image GenerationYufan Zhou, Chunyuan Li, Changyou Chen, Jianfeng Gao, Jinhui XuarXiv 2022. Paper  2022-10-252022-10-25
-
High-Resolution Image Editing via Multi-Stage Blended DiffusionJohannes Ackermann, Minjun Li2022-10-242022-10-24
-
A Visual Tour Of Current Challenges In Multimodal Language ModelsShashank Sonkar, Naiming Liu, Richard G. BaraniukarXiv 2022. Paper  2022-10-222022-10-22
-
Conditional Diffusion with Less Explicit Guidance via Model Predictive ControlMax W. Shen, Ehsan Hajiramezanali, Gabriele Scalia, Alex Tseng, Nathaniel Diamant, Tommaso Biancalani, Andreas LoukasarXiv 2022. Paper  2022-10-212022-10-21
-
Diffusion Models already have a Semantic Latent SpaceMingi Kwon, Jaeseok Jeong, Youngjung Uh2022-10-202022-10-20
-
DiffEdit: Diffusion-based semantic image editing with mask guidanceGuillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu CordarXiv 2022. Paper  2022-10-202022-10-20
-
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image GenerationRuijun Li, Weihua Li, Yi Yang, Hanyu Wei, Jianhua Jiang, Quan BaiarXiv 2022. Paper  2022-10-182022-10-18
-
UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single ImageDani Valevski, Matan Kalman, Yossi Matias, Yaniv LeviathanarXiv 2022. Paper  2022-10-182022-10-18
-
Imagic: Text-Based Real Image Editing with Diffusion ModelsBahjat Kawar1, Shiran Zada1, Oran Lang, Omer Tov, Huiwen Chang, Tali Dekel, Inbar Mosseri, Michal IraniarXiv 2022. Paper  2022-10-172022-10-17
-
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image ManipulationChaerin Kong, DongHyeon Jeon, Ohjoon Kwon, Nojun KwakarXiv 2022. Paper  2022-10-122022-10-12
-
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and GuidanceChen Henry Wu, Fernando De la Torre2022-10-112022-10-11
-
clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIPJustin N. M. Pinkney, Chuan Li2022-10-052022-10-05
-
LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion ModelsParamanand Chandramouli, Kanchana Vaishnavi GandikotaarXiv 2022. Paper  2022-10-052022-10-05
-
DALL-E-Bot: Introducing Web-Scale Diffusion Models to RoboticsIvan Kapelyukh, Vitalis Vosylius, Edward JohnsarXiv 2022. Paper  2022-10-052022-10-05
-
Imagen Video: High Definition Video Generation with Diffusion ModelsJonathan Ho1, William Chan1, Chitwan Saharia1, Jay Whang1, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim SalimansarXiv 2022. Paper  2022-10-052022-10-05
-
Membership Inference Attacks Against Text-to-image Generation ModelsYixin Wu, Ning Yu, Zheng Li, Michael Backes, Yang ZhangarXiv 2022. Paper  2022-10-032022-10-03
-
2022-09-29
-
Re-Imagen: Retrieval-Augmented Text-to-Image GeneratorWenhu Chen, Hexiang Hu, Chitwan Saharia, William W. CohenarXiv 2022. Paper  2022-09-292022-09-29
-
DreamFusion: Text-to-3D using 2D DiffusionBen Poole, Ajay Jain, Jonathan T. Barron, Ben Mildenhall2022-09-292022-09-29
-
Make-A-Video: Text-to-Video Generation without Text-Video DataUriel Singer, Adam Polyak, Thomas Hayes, Xi Yin, Jie An, Songyang Zhang, Qiyuan Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh, Sonal Gupta, Yaniv TaigmanarXiv 2022. Paper  2022-09-292022-09-29
-
Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided DiffusionNisha Huang, Fan Tang, Weiming Dong, Changsheng Xu2022-09-272022-09-27
-
Personalizing Text-to-Image Generation via Aesthetic GradientsVictor Gallego2022-09-252022-09-25
-
Best Prompts for Text-to-Image Models and How to Find ThemNikita Pavlichenko, Dmitry UstalovarXiv 2022. Paper  2022-09-232022-09-23
-
The Biased Artist: Exploiting Cultural Biases via Homoglyphs in Text-Guided Image Generation ModelsLukas Struppek, Dominik Hintersdorf, Kristian Kersting2022-09-192022-09-19
-
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative ModelsChen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando De la Torre2022-09-142022-09-14
-
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationNataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman2022-08-252022-08-25
-
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion ModelsRobin Rombach1, Andreas Blattmann1, Björn Ommer2022-07-262022-07-26
-
Discrete Contrastive Diffusion for Cross-Modal and Conditional GenerationYe Zhu, Yu Wu, Kyle Olszewski, Jian Ren, Sergey Tulyakov, Yan Yan2022-06-152022-06-15
-
Blended Latent DiffusionOmri Avrahami, Ohad Fried, Dani Lischinski2022-06-062022-06-06
-
Compositional Visual Generation with Composable Diffusion ModelsNan Liu1, Shuang Li1, Yilun Du1, Antonio Torralba, Joshua B. Tenenbaum2022-06-032022-06-03
-
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion DecoderJie Shi1, Chenfei Wu1, Jian Liang, Xiang Liu, Nan DuanarXiv 2022. Paper  2022-06-012022-06-01
-
Text2Human: Text-Driven Controllable Human Image GenerationYuming Jiang, Shuai Yang, Haonan Qiu, Wayne Wu, Chen Change Loy, Ziwei Liu2022-05-312022-05-31
-
Improved Vector Quantized Diffusion ModelsZhicong Tang, Shuyang Gu, Jianmin Bao, Dong Chen, Fang Wen2022-05-312022-05-31
-
Photorealistic Text-to-Image Diffusion Models with Deep Language UnderstandingChitwan Saharia1, William Chan1, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi2022-05-232022-05-23
-
Retrieval-Augmented Diffusion ModelsAndreas Blattmann1, Robin Rombach1, Kaan Oktay, Björn Ommer2022-04-252022-04-25
-
Hierarchical Text-Conditional Image Generation with CLIP LatentsAditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, Mark Chen2022-04-132022-04-13
-
KNN-Diffusion: Image Generation via Large-Scale RetrievalOron Ashual, Shelly Sheynin, Adam Polyak, Uriel Singer, Oran Gafni, Eliya Nachmani, Yaniv TaigmanarXiv 2022. Paper  2022-04-062022-04-06
-
High-Resolution Image Synthesis with Latent Diffusion ModelsRobin Rombach1, Andreas Blattmann1, Dominik Lorenz, Patrick Esser, Björn Ommer2021-12-202021-12-20
-
Tackling the Generative Learning Trilemma with Denoising Diffusion GANsZhisheng Xiao, Karsten Kreis, Arash Vahdat2021-12-152021-12-15
-
More Control for Free! Image Synthesis with Semantic Diffusion GuidanceXihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell2021-12-102021-12-10
-
Blended Diffusion for Text-driven Editing of Natural ImagesOmri Avrahami, Dani Lischinski, Ohad Fried2021-11-292021-11-29
-
Vector Quantized Diffusion Model for Text-to-Image SynthesisShuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang, Dongdong Chen, Lu Yuan, Baining Guo2021-11-292021-11-29
-
DiffusionCLIP: Text-guided Image Manipulation Using Diffusion ModelsGwanghyun Kim, Jong Chul YeCVPR 2022. Paper  2021-10-062021-10-06
Counts - 144   Back to
top