UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Prediction of sleepiness ratings from voice by man and machine

Huckvale, M; Beke, A; Ikushima, M; (2020) Prediction of sleepiness ratings from voice by man and machine. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (pp. pp. 4571-4575). ISCA: Virtual event. Green open access

[thumbnail of 1601.pdf]
Preview
Text
1601.pdf - Published Version

Download (495kB) | Preview

Abstract

This paper looks in more detail at the Interspeech 2019 computational paralinguistics challenge on the prediction of sleepiness ratings from speech. In this challenge, teams were asked to train a regression model to predict sleepiness from samples of the Düsseldorf Sleepy Language Corpus (DSLC). This challenge was notable because the performance of all entrants was uniformly poor, with even the winning system only achieving a correlation of r=0.37. We look at whether the task itself is achievable, and whether the corpus is suited to training a machine learning system for the task. We perform a listening experiment using samples from the corpus and show that a group of human listeners can achieve a correlation of r=0.7 on this task, although this is mainly by classifying the recordings into one of three sleepiness groups. We show that the corpus, because of its construction, confounds variation with sleepiness and variation with speaker identity, and this was the reason that machine learning systems failed to perform well. We conclude that sleepiness rating prediction from voice is not an impossible task, but that good performance requires more information about sleepy speech and its variability across listeners than is available in the DSLC corpus

Type: Proceedings paper
Title: Prediction of sleepiness ratings from voice by man and machine
Event: Interspeech.2020
Open access status: An open access version is available from UCL Discovery
DOI: 10.21437/Interspeech.2020-1601
Publisher version: http://dx.doi.org/10.21437/Interspeech.2020-1601
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Sleepiness, Voice, Machine Learning, Paralinguistics
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10118777
Downloads since deposit
10,108Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item