-
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed DataHeeseung Kim, Sungwon Kim, Jiheum Yeom, Sungroh YoonarXiv 2023. Paper  2023-06-282023-06-28
-
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesisShivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje HenterarXiv 2023. Paper  2023-06-152023-06-15
-
HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion ModelsJi-Sang Hwang, Sang-Hoon Lee, Seong-Whan LeearXiv 2023. Paper  2023-06-122023-06-12
-
Boosting Fast and High-Quality Speech Synthesis with Linear DiffusionHaogeng Liu, Tao Wang, Jie Cao, Ran He, Jianhua TaoarXiv 2023. Paper  2023-06-092023-06-09
-
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech SynthesisHaobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing XiaoInterSpeech 2023. Paper  2023-06-012023-06-01
-
Efficient Neural Music GenerationMax W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yuping Wang, Yuxuan Wang2023-05-252023-05-25
-
2023-03-15
-
DiffuseRoll: Multi-track multi-category music generation based on diffusion modelHongfei WangarXiv 2023. Paper  2023-03-142023-03-14
-
Multi-Source Diffusion Models for Simultaneous Music Generation and SeparationGiorgio Mariani, Irene Tallini, Emilian Postolache, Michele Mancusi, Luca Cosmo, Emanuele Rodolà2023-02-042023-02-04
-
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video GenerationLudan Ruan, Yiyang Ma, Huan Yang, Huiguo He, Bei Liu, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo2022-12-192022-12-19
-
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid RepresentationChen Zhang, Yi Ren, Kejun Zhang, Shuicheng Yan2022-11-012022-11-01
-
Full-band General Audio Synthesis with Score-based DiffusionSantiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan SerràarXiv 2022. Paper  2022-10-262022-10-26
-
Hierarchical Diffusion Models for Singing Voice Neural VocoderNaoya Takahashi, Mayank Kumar, Singh, Yuki MitsufujiarXiv 2022. Paper  2022-10-142022-10-14
-
Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GANYin-Ping Cho, Yu Tsao, Hsin-Min Wang, Yi-Wen Liu2022-09-212022-09-21
-
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive EvaluationDa-Yi Wu, Wen-Yi Hsiao, Fu-Rong Yang, Oscar Friedman, Warren Jackson, Scott Bruzenak, Yi-Wen Liu, Yi-Hsuan Yang2022-08-092022-08-09
-
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-SpeechRongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren2022-07-132022-07-13
-
CARD: Classification and Regression Diffusion ModelsXizewen Han, Huangjie Zheng, Mingyuan ZhouNeurIPS 2022. Paper  2022-06-152022-06-15
-
Adversarial Audio Synthesis with Complex-valued Polynomial NetworksYongtao Wu, Grigorios G Chrysos, Volkan CevherICML workshop 2022. Paper  2022-06-142022-06-14
-
Multi-instrument Music Synthesis with Spectrogram DiffusionCurtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner, Ethan Manilow, Jesse EngelISMIR 2022. Paper  2022-06-112022-06-11
-
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisYichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu2022-05-302022-05-30
-
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech SynthesisRongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao2022-04-212022-04-21
-
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral ShapingYuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel BacchianiInterspeech 2022. Paper  2022-03-312022-03-31
-
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisMax W. Y. Lam, Jun Wang, Dan Su, Dong Yu2022-03-252022-03-25
-
ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave GenerationShoule Wu, Ziqiang Shi2022-01-292022-01-29
-
Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal DerivativesHideyuki Tachibana, Mocho Go, Muneyoshi Inahara, Yotaro Katayama, Yotaro WatanabearXiv 2021. Paper  2021-12-262021-12-26
-
Denoising Diffusion Gamma ModelsEliya Nachmani, Robin San Roman, Lior WolfarXiv 2021. Paper  2021-10-102021-10-10
-
Variational Diffusion ModelsDiederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho2021-07-012021-07-01
-
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound SynthesisSimon Rouard, Gaëtan Hadjeres2021-06-142021-06-14
-
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive PriorSang-gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon, Tie-Yan Liu2021-06-112021-06-11
-
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio GenerationShoule Wu, Ziqiang Shi2021-05-172021-05-17
-
DiffSinger: Singing Voice Synthesis via Shallow Diffusion MechanismJinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Peng Liu, Zhou Zhao2021-05-062021-05-06
-
Symbolic Music Generation with Diffusion ModelsGautam Mittal, Jesse Engel, Curtis Hawthorne, Ian Simon2021-03-302021-03-30
-
DiffWave: A Versatile Diffusion Model for Audio SynthesisZhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro2020-09-212020-09-21
-
WaveGrad: Estimating Gradients for Waveform GenerationNanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Cha2020-09-022020-09-02
Counts - 34   Back to
top