[1]
LundK., AhrensA., and DauT., “A method for evaluating audio-visual scene analysis in multi-talker environments”, ISAAR, vol. 7, pp. 357-364, Apr. 2020.