Speech Enhancement with Mixture of Deep Experts with Clean Clustering Pre-Training

S. E. Chazan, S. Gannot, and J. Goldberger, “Speech Enhancement with Mixture of Deep Experts with Clean Clustering Pre-Training ,” submitted to ICASSP, 2021.
[MoDE], [DSE – single DNN].

Chazan, Shlomo E., Sharon Gannot, and Jacob Goldberger. “A phoneme-based pre-training approach for deep neural network with application to speech enhancement.” In International Workshop on Acoustic Signal Enhancement (IWAENC), 2016 S [S-MoDE].

Cohen, Israel, and Baruch Berdugo. “Speech enhancement for non-stationary noise environments.” Signal processing 81, no. 11, pp. 2403-2418, 2001. [OMLSA]

Artificially mixed speech sentences drawn from TIMIT and real noise drawn from NOISEX-92

Real-life noisy sentences drawn from Chime-4

F05_440C020N_PED.CH1
M05_440C020S_BUS.CH1
F05_441C0201_CAF.CH3
F05_443C0204_STR.CH1
M06_440C020S_STR.CH1