Objective evaluations of two-stage binaural speech enhancement with Wiener filter for speech enhancement and sound localization
Resumé
For high-quality speech communication, we previously proposed a two-stage binaural speech enhancement with Wiener lter (TS-BASE/WF) approach inspired by the equalization-cancellation (EC) theory, to suppress interfering signals and preserve impression of acoustic scene. In the proposed TS-BASE/ WF, the interfering signal is rst estimated by equalizing and cancelling the target signal through two equalizers and a time-variant Wiener filter is then applied to enhance the target signal given the noisy mixture signals. In this paper, we pay main attention to the comprehensive experimental evaluations on its speech-enhancement performance and its ability in preserving binaural bene ts in a variety of acoustic conditions. Experimental results show that the TS-BASE/WF approach is able to suppress non-stationary multiple interfering signals and enhance the target signal which is expected to improve the quality of speech communication, and succeeds in preserving the binaural cues which is expected to give birth to the perceptual impression of the auditory scene, in all tested spatial scenarios.
Referencer
Culling, J. F., and Summer eld, Q. (1995). “Perceptual segregation of concurrent speech sounds: absence of across-frequency grouping by common interaural delay,” J. Acoust. Soc. Am. 98, 785-797.
Dorbecker M., and Ernst S. (1996). “Combination of two-channel spectral subtraction and adaptive Wiener post- ltering for noise reduction and dereverberation,” Proc. EUSIPCO, pp. 995-998.
Durlach, N. I. (1963). “Equalization and cancellation theory of binaural masking level differences,” J. Acoust. Soc. Am. 35, 1206-1218.
Jeffress, L. A. (1948). “A place theory of sound localization,” J. Comparative and Physiological Psychology 41, 35-39.
Klasen T. J., Van den Boqaert, T., Moonen, M., Wouters, J. (2007). “Binaural noise reduction algorithms for hearing aids that preserve interaural time delay cues processing,” IEEE Trans. on Signal Processing, 55, 1579-1585.
Kollmeier B., Peissig J., and Hohmann V. (1993). “Binaural noise-reduction hearing aid scheme with real-time processing in the frequency domain,” Scand. Audiol. Suppl. 38, 28-38.
Li, J., Sakamoto, S., Hongo, S., Akagi M., Suzuki Y. (2009). “Two-stage binaural speech enhancement with Wiener lter based on equalization-cancellation model,” Proc. IEEE Workshop on Application of Signal Processing to Audio and Acoustics (New Paltz, NY, USA), pp. 133-136.
Lotter T., Sauert B., and Vary P. (2005). “A stereo input-output superdirective beamformer for dual channel noise reduction,” Proc., Eurospeech, pp. 2285- 2288.
Nakashima H., Chisaki Y., Usagawa T., and Ebata M. (2003). “Frequency domain binaural model based on interaural phase and level differences,” Acoust. Sci. and Tech. 24, 172-178.
Scalart P., and Filho J. V. (1996) “Speech enhancement based on a priori signal to noise estimation,” Proc. ICASSP, vol. 2, pp. 629-632.
Yderligere filer
Publiceret
Citation/Eksport
Nummer
Sektion
Licens
Authors who publish with this journal agree to the following terms:
a. Authors retain copyright* and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
*From the 2017 issue onward. The Danavox Jubilee Foundation owns the copyright of all articles published in the 1969-2015 issues. However, authors are still allowed to share the work with an acknowledgement of the work's authorship and initial publication in this journal.