TediGAN: Text-Guided Diverse Face Image Generation and Manipulation

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

TediGAN: Text-Guided Diverse Face Image Generation and Manipulation

Xia, W; Yang, Y; Xue, J; Wu, B; (2021) TediGAN: Text-Guided Diverse Face Image Generation and Manipulation. In: (Proceedings) IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (In press). Green open access

[thumbnail of WeihaoXia-00743-final.pdf]

Preview

Text
WeihaoXia-00743-final.pdf - Accepted Version
Download (2MB) | Preview

Abstract

In this work, we propose TediGAN, a novel framework for multi-modal image generation and manipulation with textual descriptions. The proposed method consists of three components: StyleGAN inversion module, visual-linguistic similarity learning, and instance-level optimization. The inversion module maps real images to the latent space of a well-trained StyleGAN. The visual-linguistic similarity learns the text-image matching by mapping the image and text into a common embedding space. The instancelevel optimization is for identity preservation in manipulation. Our model can produce diverse and high-quality images with an unprecedented resolution at 10242 . Using a control mechanism based on style-mixing, our TediGAN inherently supports image synthesis with multi-modal inputs, such as sketches or semantic labels, with or without instance guidance. To facilitate text-guided multimodal synthesis, we propose the Multi-Modal CelebA-HQ, a large-scale dataset consisting of real face images and corresponding semantic segmentation map, sketch, and textual descriptions. Extensive experiments on the introduced dataset demonstrate the superior performance of our proposed method. Code and data are available at https://github.com/weihaox/TediGAN.

Type:	Proceedings paper
Title:	TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
Event:	IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Dates:	19 June 2021 - 25 June 2021
Open access status:	An open access version is available from UCL Discovery
Publisher version:	https://ieeexplore.ieee.org/Xplore/home.jsp
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI:	https://discovery-pp.ucl.ac.uk/id/eprint/10131010

Downloads since deposit

9,078Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item