1. Zero-Shot Blind Audio Bandwidth Extension
    Eloi Moliner, Filip Elvander, Vesa Välimäki
    arXiv 2023. Paper  
    2023-06-02
    2023-06-02
  2. Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model
    Xiang Li, Songxiang Liu, Max W. Y. Lam, Zhiyong Wu, Chao Weng, Helen Meng
    Interspeech 2023. Paper  
    2023-05-26
    2023-05-26
  3. Diffusion-Based Audio Inpainting
    Eloi Moliner, Vesa Välimäki
    arXiv 2023. Paper  
    2023-05-24
    2023-05-24
  4. FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models
    Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao
    arXiv 2023. Paper   Github  
    2023-05-23
    2023-05-23
  5. A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
    Ibrahim Malik, Siddique Latif, Raja Jurdak, Björn Schuller
    arXiv 2023. Paper  
    2023-05-19
    2023-05-19
  6. AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
    Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao
    arXiv 2023. Paper   Project  
    2023-04-03
    2023-04-03
  7. Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-k Selection Discriminator
    Yunhao Chen, Yunjie Zhu, Zihui Yan, Jianlu Shen, Zhen Ren, Yifan Huang
    arXiv 2023. Paper   Github  
    2023-03-27
    2023-03-27
  8. Enhancing Unsupervised Speech Recognition with Diffusion GANs
    Xianchao Wu
    ICASSP 2023. Paper  
    2023-03-23
    2023-03-23
  9. Defending against Adversarial Audio via Diffusion Model
    Shutong Wu, Jiongxiao Wang, Wei Ping, Weili Nie, Chaowei Xiao
    ICLR 2023. Paper   Github  
    2023-03-02
    2023-03-02
  10. TransFusion: Transcribing Speech with Multinomial Diffusion
    Matthew Baas, Kevin Eloff, Herman Kamper
    SACAIR 2022. Paper   Github  
    2022-10-14
    2022-10-14
Counts - 10   Back to top