Potential of self-conducted speech audiometry with smart speakers
Keywords:speech audiometry, smart home, ASR, matrix sentence test, OLSA, smart speaker, unsupervised measurement
Speech audiometry in noise based on matrix sentence tests is an important diagnostic tool to assess the speech reception threshold (SRT) of a subject, i.e., the signal-to-noise ratio corresponding to 50% intelligibility. Although the matrix test format allows for self-conducted measurements by applying a visual, closed response format, these tests are mostly performed in open response format with an experimenter entering the correct/incorrect responses (expert-conducted). Using automatic speech recognition (ASR) enables self- conducted measurements without the need of visual presentation of the response alternatives. A combination of these self-conducted measurement procedures with signal presentation via smart speakers could be used to assess individual speech intelligibility in an individual listening environ- ment. Therefore, this paper compares self-conducted SRT measurements using smart speakers with expert-conducted lab measurements. With smart speakers, the experimenter has no control over the absolute presentation level, mode of presentation (headphones vs. loudspeaker), potential errors from the automated response logging, and room acoustics. We present the differences between measurements in the lab and with a smart speaker for normal- hearing, mildly hearing-impaired and moderate hearing-impaired subjects in low, medium, and high reverberation.
Arlinger, S. (2003) “Negative consequences of uncorrected hearing loss - a review.” Int. J. Audiol., 42(2), S17–S20. doi: 10.3109/14992020309074639
Boumpa, E., Gkogkidis, A., Charalampou, I., Ntaliani, A., Kakarountas, A., and Kokkinos, V. (2019). “An Acoustic-Based Smart Home System for People Suffering from Dementia”. Technol., 7(1), 29. doi: 10.3390/technologies7010029
Brand, T., and Kollmeier, B. (2002). “Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests”. J. Acoust. Soc. Am., 111(6), 2801–2810. doi: 10.1121/1.1479152
Kollmeier, B., Warzybok, A., Hochmuth, S., Zokoll, M. A., Uslar, V., Brand, T., and Wagener, K. C. (2015). “The multilingual matrix test: Principles, applications, and comparison across languages: A review”. Int. J. Audiol., 54(sup2), 3–16. doi: 10.3109/14992027.2015.1020971
Mathers, C., Smith, A., and Concha, M. (2001). “Global burden of hearing loss in the year 2000.” Global Burden of Disease 2000, 18(4), 1–30.
Nuesse, T., Wiercinski, B., Brand, T. and Holube, I. (2019). “Measuring Speech Recognition With a Matrix Test Using Synthetic Speech.” Trends Hear. doi: 10.1177/2331216519862982
Ooster, J., Huber, R., Kollmeier, B., and Meyer, B. T. (2018). “Evaluation of an automated speech-controlled listening test with spontaneous and read responses.” Speech Commun., 98, 85–94. doi: 10.1016/j.specom.2018.01.005
Ooster, J., Porysek Moreta, P. N., Bach, J.-H., Holube, I. and Meyer, B.T. (2019). ”Computer, test my hearing: Accurate speech audiometry with smart speakers” Proceedings of the Interspeech 2019, Graz, Austria. doi 10.21437/Interspeech.2019-2118
Vora, J., Tanwar, S., Tyagi, S., Kumar, N., and Rodrigues, J. J. P. C. (2017). “Home- based exercise system for patients using IoT enabled smart speaker”. In 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services, Healthcom. doi: 10.1109/HealthCom.2017.8210826
Wagener, K. C., Ku ̈hne, V., and Kollmeier, B. (1999). “Entwicklung und Evaluation eines Satztests fu ̈r die deutsche Sprache I-III: Design, Optimierung und Evalua- tion des Oldenburger Satztests” (Development and evaluation of a German speech intelligibility test. Part I-III: Design, optimization and evaluation of the Oldenburg sentence test). Z Audiol., 38(1-3), 4-15, 44-56, 86-95.
Wagener, K. C., and Brand, T. (2005). “Sentence intelligibility in noise for listeners with normal hearing and hearing impairment: Influence of measure- ment procedure and masking parameters”. Int. J. Audiol., 44(3), 144–156. doi: 10.1080/14992020500057517
Wagener K., Hochmuth S., Ahrlich M., Zokoll M. and Kollmeier B. (2014), “Der weibliche Oldenburger Satztest”. 17. DGA Jahrestagung, Oldenburg, Germany.
How to Cite
Authors who publish with this journal agree to the following terms:
a. Authors retain copyright* and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
*From the 2017 issue onward. The Danavox Jubilee Foundation owns the copyright of all articles published in the 1969-2015 issues. However, authors are still allowed to share the work with an acknowledgement of the work's authorship and initial publication in this journal.