-
Universal Speech Enhancement with Score-based DiffusionJoan Serrà, Santiago Pascual, Jordi Pons, R. Oguz Araz, Davide ScainiarXiv 2022. Paper  2022-06-072022-06-07
-
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed DataSungwon Kim1, Heeseung Kim1, Sungroh YoonarXiv 2022. Paper  2022-05-302022-05-30
-
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio SynthesisYichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan LiuarXiv 2022. Paper  2022-05-302022-05-30
-
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech SynthesisRongjie Huang1, Max W. Y. Lam1, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao2022-04-212022-04-21
-
SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral ShapingYuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel BacchianiarXiv 2022. Paper  2022-03-312022-03-31
-
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisMax W. Y. Lam, Jun Wang, Dan Su, Dong Yu2022-03-252022-03-25
-
Conditional Diffusion Probabilistic Model for Speech EnhancementYen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu TsaoIEEE 2022. Paper  2022-02-102022-02-10
-
InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in TrainingZehua Chen, Xu Tan, Ke Wang, Shifeng Pan, Danilo Mandic, Lei He, Sheng ZhaoarXiv 2022. Paper  2022-02-082022-02-08
-
ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave GenerationShoule Wu1, Ziqiang Shi12022-01-292022-01-29
-
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANsSongxiang Liu, Dan Su, Dong YuarXiv 2022. Paper  2022-01-282022-01-28
-
Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal DerivativesHideyuki Tachibana, Mocho Go, Muneyoshi Inahara, Yotaro Katayama, Yotaro WatanabearXiv 2021. Paper  2021-12-262021-12-26
-
Guided-TTS:Text-to-Speech with Untranscribed SpeechHeeseung Kim, Sungwon Kim, Sungroh YoonarXiv 2021. Paper  2021-11-302021-11-30
-
Denoising Diffusion Gamma ModelsEliya Nachmani1, Robin San Roman1, Lior WolfarXiv 2021. Paper  2021-10-102021-10-10
-
EdiTTS: Score-based Editing for Controllable Text-to-SpeechJaesung Tae1, Hyeongju Kim1, Taesu KimarXiv 2021. Paper  2021-10-062021-10-06
-
A Study on Speech Enhancement Based on Diffusion Probabilistic ModelYen-Ju Lu1, Yu Tsao1, Shinji WatanabearXiv 2021. Paper  2021-07-252021-07-25
-
Variational Diffusion ModelsDiederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho2021-07-012021-07-01
-
WaveGrad 2: Iterative Refinement for Text-to-Speech SynthesisNanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan2021-06-172021-06-17
-
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound SynthesisSimon Rouard1, Gaëtan Hadjeres12021-06-142021-06-14
-
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive PriorSang-gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon, Tie-Yan Liu2021-06-112021-06-11
-
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion*Songxiang Liu1, Yuewen Cao1, Dan Su, Helen Meng2021-05-282021-05-28
-
Grad-TTS: A Diffusion Probabilistic Model for Text-to-SpeechVadim Popov1, Ivan Vovk1, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov2021-05-132021-05-13
-
DiffSinger: Singing Voice Synthesis via Shallow Diffusion MechanismJinglin Liu1, Chengxi Li1, Yi Ren1, Feiyang Chen, Peng Liu, Zhou Zhao2021-05-062021-05-06
-
DiffSinger: Singing Voice Synthesis via Shallow Diffusion MechanismJinglin Liu1, Chengxi Li1, Yi Ren1, Feiyang Chen, Peng Liu, Zhou Zhao2021-05-062021-05-06
-
Restoring degraded speech via a modified diffusion modelJianwei Zhang, Suren Jayasuriya, Visar BerishaInterspeech 2021. Paper  2021-04-222021-04-22
-
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling*Junhyeok Lee, Seungu Han2021-04-062021-04-06
-
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech*Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo KimInterspeech 2021. Paper  2021-04-032021-04-03
-
Symbolic Music Generation with Diffusion ModelsGautam Mittal, Jesse Engel, Curtis Hawthorne, Ian Simon2021-03-302021-03-30
-
DiffWave: A Versatile Diffusion Model for Audio SynthesisZhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro2020-09-212020-09-21
-
WaveGrad: Estimating Gradients for Waveform GenerationNanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Cha2020-09-022020-09-02
Counts - 29   Back to
top