UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models

Bian, Yifan; Küster, Dennis; Liu, Hui; Krumhuber, Eva G; (2024) Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models. Sensors , 24 (1) , Article 126. 10.3390/s24010126. Green open access

[thumbnail of sensors-24-00126-v2.pdf]
Preview
Text
sensors-24-00126-v2.pdf - Published Version

Download (735kB) | Preview

Abstract

This paper provides a comprehensive overview of affective computing systems for facial expression recognition (FER) research in naturalistic contexts. The first section presents an updated account of user-friendly FER toolboxes incorporating state-of-the-art deep learning models and elaborates on their neural architectures, datasets, and performances across domains. These sophisticated FER toolboxes can robustly address a variety of challenges encountered in the wild such as variations in illumination and head pose, which may otherwise impact recognition accuracy. The second section of this paper discusses multimodal large language models (MLLMs) and their potential applications in affective science. MLLMs exhibit human-level capabilities for FER and enable the quantification of various contextual variables to provide context-aware emotion inferences. These advancements have the potential to revolutionize current methodological approaches for studying the contextual influences on emotions, leading to the development of contextualized emotion models.

Type: Article
Title: Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models
Open access status: An open access version is available from UCL Discovery
DOI: 10.3390/s24010126
Publisher version: http://dx.doi.org/10.3390/s24010126
Language: English
Additional information: Copyright © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Keywords: Automatic facial expression recognition; naturalistic context; deep learning; multimodal large language model
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Experimental Psychology
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10184687
Downloads since deposit
60Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item