[1]
K. Lund, A. Ahrens, and T. Dau, “A method for evaluating audio-visual scene analysis in multi-talker environments”, ISAAR, vol. 7, pp. 357-364, Apr. 2020.