UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Optimal transport for Latent variable models

Gaujac, Benoit; (2023) Optimal transport for Latent variable models. Doctoral thesis (Ph.D), UCL (University College London). Green open access

[thumbnail of Benoit Gaujac Thesis.pdf]
Preview
Text
Benoit Gaujac Thesis.pdf - Accepted Version

Download (24MB) | Preview

Abstract

Generative models are probabilistic models which aim at approximating the process by which a given dataset is generated. They are well suited to the unsupervised learning setup and constitute intuitive and powerful models providing an interpretable representation of the data. However, learning even simple generative models can be challenging in the traditional Maximum Likelihood framework, due to the inflexibility of the training objective used. Motivated by its topological properties, we will show in this thesis how methods based on Optimal Transport can overcome these difficulties and offer competitive alternatives. Firstly, we show that training generative models that combine both discrete and continuous latent variables can be significantly more effective when using Optimal Transport methods. Such intuitive models are highly motivated by the structure of many real-world datasets but remain hard to train with the most common likelihood-based method, often resulting in the collapse of the discrete latent variables. Secondly, we propose a novel approach based on Optimal Transport to training models with fully Markovian deep-latent hierarchies. Probabilistic models with deep latent-variable structures have powerful modelling capacity, but common approaches often fail to leverage deep-latent hierarchies without complex inference and optimisation schemes. Our method successfully leverages the whole hierarchy of the models and shows competitive generative performance while learning smooth latent manifolds through every layers of the latent hierarchy. Finally, we introduce a new training objective to improve the learning of interpretable and disentangled representation of the data. Our method achieves competitive disentanglement relative to state-of-the-art techniques whilst improving the reconstruction and generation performances of the models.

Type: Thesis (Doctoral)
Qualification: Ph.D
Title: Optimal transport for Latent variable models
Open access status: An open access version is available from UCL Discovery
Language: English
Additional information: Copyright © The Author 2023. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10176121
Downloads since deposit
1,764Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item