UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Computational simulation of tone and focus perception ------ Sequential processing based on direct phonetic perception at syllable level

Chen, Yue; (2025) Computational simulation of tone and focus perception ------ Sequential processing based on direct phonetic perception at syllable level. Doctoral thesis (Ph.D), UCL (University College London). Green open access

[thumbnail of Yue Chen - PhD Thesis.pdf]
Preview
Text
Yue Chen - PhD Thesis.pdf - Accepted Version

Download (5MB) | Preview

Abstract

This thesis uses computational simulations of perception of lexical tone and prosodic focus to address a number of fundamental questions about the basic mechanisms of speech perception: (a) Is it necessary to extract intermediate featural representations of speech prior to perceptual categorisation? (b) Is it necessary to process variability through normalisation? (c) Is it possible to process different communicative functions independently even if they are encoded on the same acoustic dimension? And (d) Is it possible to sequentially process a global prosodic function syllable by syllable in perception? The first study focused on the perception of lexical tone and investigated the need for tonal feature extraction for Mandarin tone recognition. Computational tone recognisers were built with and without feature extraction. Several sets of pitch features were chosen as hypothetical sub-tonal features. The results show that direct tone recognition not only yields better performance than any of the feature extraction schemes, but also requires less computational complexity. It is suggested that prior feature extraction is unlikely to be the operational mechanism of speech perception. The second study focused on the concurrent perception of lexical tone and prosodic focus. Computational models were developed to simulate different perceptual strategies for both tone and focus. The results demonstrate that the neural network-based syllable-by-syllable processing of focus can generate the similar recognition patterns to human perception. With respect to the speech variability, the results show that pitch normalisation schemes do not benefit either tone or focus recognition. Furthermore, there is little difference between recognising tone and focus independently or as a tone-focus combination, suggesting that the two functions are sufficiently separated from each other in their f0 coding, despite sharing the same acoustic dimension.

Type: Thesis (Doctoral)
Qualification: Ph.D
Title: Computational simulation of tone and focus perception ------ Sequential processing based on direct phonetic perception at syllable level
Open access status: An open access version is available from UCL Discovery
Language: English
Additional information: Copyright © The Author 2025. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10206690
Downloads since deposit
136Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item