Ruan, D;
Mo, R;
Yan, Y;
Chen, S;
Xue, J-H;
Wang, H;
(2022)
Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition.
International Journal of Computer Vision
, 130
pp. 455-477.
10.1007/s11263-021-01556-7.
Preview |
Text
DelianRuan-IJCV-accepted.pdf - Accepted Version Download (5MB) | Preview |
Abstract
In this paper, we propose a novel adaptive deep disturbance-disentangled learning (ADDL) method for effective facial expression recognition (FER). ADDL involves a two-stage learning procedure. First, a disturbance feature extraction model is trained to identify multiple disturbing factors on a large-scale face database involving disturbance label information. Second, an adaptive disturbance-disentangled model, which contains a global shared subnetwork and two task-specific subnetworks, is designed and learned to explicitly disentangle disturbing factors from facial expression images. In particular, the expression subnetwork leverages a multi-level attention mechanism to extract expression-specific features, while the disturbance subnetwork embraces a new adaptive disturbance feature learning module to extract disturbance-specific features based on adversarial transfer learning. Moreover, a mutual information neural estimator is adopted to minimize the correlation between expression-specific and disturbance-specific features. Extensive experimental results on both in-the-lab FER databases (including CK+, MMI, and Oulu-CASIA) and in-the-wild FER databases (including RAF-DB, SFEW, Aff-Wild2, and AffectNet) show that our proposed method consistently outperforms several state-of-the-art FER methods. This clearly demonstrates the great potential of disturbance disentanglement for FER. Our code is available at https://github.com/delian11/ADDL.
Type: | Article |
---|---|
Title: | Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1007/s11263-021-01556-7 |
Publisher version: | https://doi.org/10.1007/s11263-021-01556-7 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Facial expression recognition, Multi-task learning, Adversarial transfer learning, Multi-level attention |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10141361 |
Archive Staff Only
View Item |