Research intern talk: Unified speech enhancement approach for speech degradations & noise suppression

Speech enhancement approaches generally focus on removing additive noise and reverberation that adversely affects the overall speech quality and intelligibility. Another group of signal degradations like clipping, bandwidth limitations, and codec degradation can occur due to poor recording hardware, network transmission, and other pre-processing. These degradations largely impact on intelligibility and speech quality. In this work, we deploy a convolutional recurrent network to remove these speech degradations in conjunction with the noise suppression task and propose cascade and end-to-end approaches. We compare both complex mask and direct spectrum estimation approaches for this task using a small real-time capable DNN. Overall, we propose a cascaded processing approach, addressing the distortion types differently, and enabling a task-tailored modular processing.

日期：: 2022年8月18日

- Khandokar Md. Nayem
  
  PhD student
  
  Indiana University, Bloomington
研究领域
- Audio and Acoustics
研究院
- Microsoft Research Lab - Redmond
组
- Audio and Acoustics Research Group

接下来观看

Final intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact
July 18, 2024
Speakers:

Benjamin Stahl,

Hannes Gamper
MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges
May 14, 2024
Speakers:

Hannes Gamper

Research intern talk: Unified speech enhancement approach for speech degradations & noise suppression

Speakers

Khandokar Md. Nayem

相关链接

研究领域

研究院

组

接下来观看

Final intern talk: Distilling Self-Supervised-Learning-Based Speech Quality Assessment into Compact

MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges