1. Universal Speech Enhancement with Score-based Diffusion
    Joan Serrà, Santiago Pascual, Jordi Pons, R. Oguz Araz, Davide Scaini
    arXiv 2022. Paper  
    2022-06-07
    2022-06-07
  2. Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
    Sungwon Kim1, Heeseung Kim1, Sungroh Yoon
    arXiv 2022. Paper  
    2022-05-30
    2022-05-30
  3. BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis
    Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu
    arXiv 2022. Paper  
    2022-05-30
    2022-05-30
  4. FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
    Rongjie Huang1, Max W. Y. Lam1, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao
    arXiv 2022. Paper   Project  
    2022-04-21
    2022-04-21
  5. SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
    Yuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel Bacchiani
    arXiv 2022. Paper  
    2022-03-31
    2022-03-31
  6. BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
    Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu
    ICLR 2022. Paper   Github  
    2022-03-25
    2022-03-25
  7. Conditional Diffusion Probabilistic Model for Speech Enhancement
    Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao
    IEEE 2022. Paper  
    2022-02-10
    2022-02-10
  8. InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training
    Zehua Chen, Xu Tan, Ke Wang, Shifeng Pan, Danilo Mandic, Lei He, Sheng Zhao
    arXiv 2022. Paper  
    2022-02-08
    2022-02-08
  9. ItôWave: Itô Stochastic Differential Equation Is All You Need For Wave Generation
    Shoule Wu1, Ziqiang Shi1
    arXiv 2022. Paper   Project  
    2022-01-29
    2022-01-29
  10. DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
    Songxiang Liu, Dan Su, Dong Yu
    arXiv 2022. Paper  
    2022-01-28
    2022-01-28
  11. Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal Derivatives
    Hideyuki Tachibana, Mocho Go, Muneyoshi Inahara, Yotaro Katayama, Yotaro Watanabe
    arXiv 2021. Paper  
    2021-12-26
    2021-12-26
  12. Guided-TTS:Text-to-Speech with Untranscribed Speech
    Heeseung Kim, Sungwon Kim, Sungroh Yoon
    arXiv 2021. Paper  
    2021-11-30
    2021-11-30
  13. Denoising Diffusion Gamma Models
    Eliya Nachmani1, Robin San Roman1, Lior Wolf
    arXiv 2021. Paper  
    2021-10-10
    2021-10-10
  14. EdiTTS: Score-based Editing for Controllable Text-to-Speech
    Jaesung Tae1, Hyeongju Kim1, Taesu Kim
    arXiv 2021. Paper  
    2021-10-06
    2021-10-06
  15. A Study on Speech Enhancement Based on Diffusion Probabilistic Model
    Yen-Ju Lu1, Yu Tsao1, Shinji Watanabe
    arXiv 2021. Paper  
    2021-07-25
    2021-07-25
  16. Variational Diffusion Models
    Diederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho
    arXiv 2021. Paper   Github  
    2021-07-01
    2021-07-01
  17. WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
    Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan
    arXiv 2021. Paper   Project   Github   Github2  
    2021-06-17
    2021-06-17
  18. CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
    Simon Rouard1, Gaëtan Hadjeres1
    arXiv 2021. Paper   Project  
    2021-06-14
    2021-06-14
  19. PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior
    Sang-gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon, Tie-Yan Liu
    arXiv 2021. Paper   Project  
    2021-06-11
    2021-06-11
  20. DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion*
    Songxiang Liu1, Yuewen Cao1, Dan Su, Helen Meng
    arXiv 2021. Paper   Github  
    2021-05-28
    2021-05-28
  21. Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
    Vadim Popov1, Ivan Vovk1, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov
    ICML 2021. Paper   Project   Github  
    2021-05-13
    2021-05-13
  22. DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
    Jinglin Liu1, Chengxi Li1, Yi Ren1, Feiyang Chen, Peng Liu, Zhou Zhao
    arXiv 2021. Paper   Project   Github  
    2021-05-06
    2021-05-06
  23. DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
    Jinglin Liu1, Chengxi Li1, Yi Ren1, Feiyang Chen, Peng Liu, Zhou Zhao
    arXiv 2021. Paper   Project   Github  
    2021-05-06
    2021-05-06
  24. Restoring degraded speech via a modified diffusion model
    Jianwei Zhang, Suren Jayasuriya, Visar Berisha
    Interspeech 2021. Paper  
    2021-04-22
    2021-04-22
  25. NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling*
    Junhyeok Lee, Seungu Han
    Interspeech 2021. Paper   Project   Github  
    2021-04-06
    2021-04-06
  26. Diff-TTS: A Denoising Diffusion Model for Text-to-Speech*
    Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim
    Interspeech 2021. Paper  
    2021-04-03
    2021-04-03
  27. Symbolic Music Generation with Diffusion Models
    Gautam Mittal, Jesse Engel, Curtis Hawthorne, Ian Simon
    arXiv 2021. Paper   Code  
    2021-03-30
    2021-03-30
  28. DiffWave: A Versatile Diffusion Model for Audio Synthesis
    Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro
    ICLR 2021. Paper   Github  
    2020-09-21
    2020-09-21
  29. WaveGrad: Estimating Gradients for Waveform Generation
    Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Cha
    ICLR 2021. Paper   Project   Github  
    2020-09-02
    2020-09-02
Counts - 29   Back to top