UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Self-Supervised Solution to the Control Problem of Articulatory Synthesis

Krug, Paul K; Birkholz, Peter; Gerazov, Branislav; Van Niekerk, Daniel R; Xu, Anqi; Xu, Yi; (2023) Self-Supervised Solution to the Control Problem of Articulatory Synthesis. In: Proceedings of the INTERSPEECH 2023. (pp. pp. 4329-4333). ISCA: Dublin, Ireland. Green open access

[thumbnail of krug23_interspeech.pdf]
Preview
Text
krug23_interspeech.pdf - Published Version

Download (566kB) | Preview

Abstract

Given an articulatory-to-acoustic forward model, it is a priori unknown how its motor control must be operated to achieve a desired acoustic result. This control problem is a fundamental issue of articulatory speech synthesis and the cradle of acousticto-articulatory inversion, a discipline which attempts to address the issue by the means of various methods. This work presents an end-to-end solution to the articulatory control problem, in which synthetic motor trajectories of Monte-Carlo-generated artificial speech are linked to input modalities (such as natural speech recordings or phoneme sequence input) via speakerindependent latent representations of a vector-quantized variational autoencoder. The proposed method is self-supervised and thus, in principle, synthesizer and speaker model independent.

Type: Proceedings paper
Title: Self-Supervised Solution to the Control Problem of Articulatory Synthesis
Event: INTERSPEECH 2023
Open access status: An open access version is available from UCL Discovery
DOI: 10.21437/Interspeech.2023-2173
Publisher version: https://doi.org/10.21437/Interspeech.2023-2173
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Acoustic-to-articulatory inversion, VQ-VAE
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Speech, Hearing and Phonetic Sciences
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10178237
Downloads since deposit
4,864Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item