Liu, Zhuoya;
Huckvale, Mark;
McGlashan, Julian;
(2022)
Automated Voice Pathology Discrimination from Continuous Speech Benefits from Analysis by Phonetic Context.
In:
Proceedings of Interspeech 2022.
(pp. pp. 2158-2162).
ISCA: Incheon, Korea.
Preview |
Text
Huckvale_Automated Voice Pathology Discrimination from Continuous Speech.pdf Download (689kB) | Preview |
Abstract
In contrast to previous studies that look only at discriminating pathological voice from the normal voice, in this study we focus on the discrimination between cases of spasmodic dysphonia (SD) and vocal fold palsy (VP) using automated analysis of speech recordings. The hypothesis is that discrimination will be enhanced by studying continuous speech, since the different pathologies are likely to have different effects in different phonetic contexts. We collected audio recordings of isolated vowels and of a read passage from 60 patients diagnosed with SD (N=38) or VP (N=22). Baseline classifiers on features extracted from the recordings taken as a whole gave a cross-validated unweighted average recall of up to 75% for discriminating the two pathologies. We used an automated method to divide the read passage into phone-labelled regions and built classifiers for each phone. Results show that the discriminability of the pathologies varied with phonetic context as predicted. Since different phone contexts provide different information about the pathologies, classification is improved by fusing phone predictions, to achieve a classification accuracy of 83%. The work has implications for the differential diagnosis of voice pathologies and contributes to a better understanding of their impact on speech.
Type: | Proceedings paper |
---|---|
Title: | Automated Voice Pathology Discrimination from Continuous Speech Benefits from Analysis by Phonetic Context |
Event: | 23rd INTERSPEECH Conference: Human and Humanizing Speech Technology |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.21437/Interspeech.2022-10154 |
Publisher version: | https://doi.org/10.21437/Interspeech.2022-10154 |
Language: | English |
Additional information: | This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Voice pathology discrimination, phonetic context, continuous speech, machine learning |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10181494 |
Archive Staff Only
View Item |