Smartglasses processing

Short demonstration

1 desired speaker (user) + 3 undesired speakers (fixed locations) ; SNRin = -5 dB .

Single at Input 0:16 Single at Output 0:16

Long demonstration

In the article, we described experiments in which different signals were combined and processed with a number of algorithms. The desired signal was broadcast from the Head and Torso Simulator (HATS) and received by the 8-channel glasses mounted array. Similarly, a number of undesired signals were broadcast and received by the array. The signals were combined to create various scenarios which were used to test and compare several algorithms.

Desired Signal

desired signal - the average of the two omnidirectional sensors

: 0:16

Noise Signal

Two noise scenarios were created:

(a) three stationary speech sources, a recording of the three stationary speech signals

(b) a moving speech source, a recording of a single undesired speech signal

Both are stereo recording with each channel corresponding to one of the omnidirectional sensors.

Noise Signal

Stationary noise sources 0:16 Moving noise source 0:16

Results

Stationary scenario:

Input Signal

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Fixed-MVDR

SNR= -10 DB 0:16 SNR= -5 DB 0:16 SNR= 0 DB 0:16 SNR= 5 DB 0:16

Fixed-MPDR

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Adaptive MPDR

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Oracle Adaptation

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Unporcessed Monopole Average

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Proposed Algorithm (without post-processing)

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Moving interference scenario:

Input Signal

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Fixed-MVDR

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Fixed-MPDR

SNR = - 10 DB 0:16 SNR = -5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Adaptive MPDR

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Oracle Adaptation

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Unporcessed Monopole Avarage

SNR = - 10 DB 0:16 SNR = -5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Proposed Algorithm (without post-processing)

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Post-processing results:

The following table demonstrates the effects of postprocessing. The results pertain to the scenario with three static interferers. Two sets of parameters are used for the post-prcessing stage. The first set (termed “post1”) is more conservative and the second (“post2”) is more aggressive.

Proposed Algorithm (without post-processing)

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR= 5 DB 0:16

Post 1

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Post 2

SNR = - 10 DB 0:16 SNR = - 5 DB 0:16 SNR = 0 DB 0:16 SNR = 5 DB 0:16

Reference:

Dovid Y. Levin, Emanuël A.P. Habets, Sharon Gannot, Near-field signal acquisition for smartglasses using two acoustic vector-sensors, Speech Communication, Volume 83, October 2016, Pages 42-53.
[ArXiv document (open access)]; [Journal (possibly pay-walled)]