Yaron Laufer and Sharon Gannot, “Scoring-Based ML Estimation and CRBs for Reverberation, Speech and Noise PSDs in a Spatially Homogeneous Noise-Field,” submitted to IEEE Transactions on Audio, Speech and Language Processing, July 2019.
In our experimental study, we consider two different real-life acoustic scenarios, recorded at the acoustic lab of the Engineering Faculty at Bar-Ilan University. Both scenarios consist of measured RIRs and recorded noises:
In both cases, the noisy and reverberant signals were constructed by convolving clean speech utterances (from the TIMIT database) with the corresponding RIR, and then adding the noise with several reverberant signal-to noise ratio (RSNR) levels.
Based on the proposed non-blocking-based and blocking-based ML estimators of the various PSDs, a multichannel Wiener filter (MCWF) is constructed, aiming to enhance the reverberant and noisy speech. Two versions for computing the MCWF are examined: i) The direct implementation, denoted henceforth as Dir; and ii) the decision-directed implementation, which will be referred to as DD.
Rev [ms] =400.
Rev [ms] =610.
[1] Y. Laufer and S. Gannot, “A Bayesian Hierarchical Model for Speech Enhancement with Time-Varying Audio Channel,” IEEE Trans. Audio, Speech and Language Processing, vol. 27, no. 1, pp. 225–239, Jan. 2019.
[2] E. Hadad, F. Heese, P. Vary, and S. Gannot, “Multichannel audio database in various acoustic environments,” in International Workshop on Acoustic Signal Enhancement (IWAENC), 2014, pp. 313–317.