UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Factored Neural Representation for Scene Understanding

Wong, Yu‐Shiang; Mitra, Niloy J; (2023) Factored Neural Representation for Scene Understanding. Computer Graphics Forum 10.1111/cgf.14911. (In press). Green open access

[thumbnail of Computer Graphics Forum - 2023 - Wong.pdf]
Preview
Text
Computer Graphics Forum - 2023 - Wong.pdf - Published Version

Download (44MB) | Preview

Abstract

A long-standing goal in scene understanding is to obtain interpretable and editable representations that can be directly constructed from a raw monocular RGB-D video, without requiring specialized hardware setup or priors. The problem is significantly more challenging in the presence of multiple moving and/or deforming objects. Traditional methods have approached the setup with a mix of simplifications, scene priors, pretrained templates, or known deformation models. The advent of neural representations, especially neural implicit representations and radiance fields, opens the possibility of end-to-end optimization to collectively capture geometry, appearance, and object motion. However, current approaches produce global scene encoding, assume multiview capture with limited or no motion in the scenes, and do not facilitate easy manipulation beyond novel view synthesis. In this work, we introduce a factored neural scene representation that can directly be learned from a monocular RGB-D video to produce object-level neural presentations with an explicit encoding of object movement (e.g., rigid trajectory) and/or deformations (e.g., nonrigid movement). We evaluate ours against a set of neural approaches on both synthetic and real data to demonstrate that the representation is efficient, interpretable, and editable (e.g., change object trajectory). Code and data are available at: http://geometry.cs.ucl.ac.uk/projects/2023/factorednerf/.

Type: Article
Title: Factored Neural Representation for Scene Understanding
Open access status: An open access version is available from UCL Discovery
DOI: 10.1111/cgf.14911
Publisher version: https://doi.org/10.1111/cgf.14911
Language: English
Additional information: © 2023 The Authors. Computer Graphics Forum published by Eurographics - The European Association for Computer Graphics and John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/).
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10175443
Downloads since deposit
1,292Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item