Mu, J;
Zhong, V;
Raileanu, R;
Jiang, M;
Goodman, N;
Rocktäschel, T;
Grefenstette, E;
(2022)
Improving Intrinsic Exploration with Language Abstractions.
In:
Advances in Neural Information Processing Systems.
NIPS
Preview |
PDF
7115_improving_intrinsic_exploratio.pdf - Published Version Download (2MB) | Preview |
Abstract
Reinforcement learning (RL) agents are particularly hard to train when rewards are sparse. One common solution is to use intrinsic rewards to encourage agents to explore their environment. However, recent intrinsic exploration methods often use state-based novelty measures which reward low-level exploration and may not scale to domains requiring more abstract skills. Instead, we explore language as a general medium for highlighting relevant abstractions in an environment. Unlike previous work, we evaluate whether language can improve over existing exploration methods by directly extending (and comparing to) competitive intrinsic exploration baselines: AMIGo (Campero et al., 2021) and NovelD (Zhang et al., 2021). These language-based variants outperform their non-linguistic forms by 47-85% across 13 challenging tasks from the MiniGrid and MiniHack environment suites.
Type: | Proceedings paper |
---|---|
Title: | Improving Intrinsic Exploration with Language Abstractions |
Event: | 36th Conference on Neural Information Processing Systems (NeurIPS 2022) |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | https://papers.nips.cc/ |
Language: | English |
Additional information: | This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10173892 |
Archive Staff Only
![]() |
View Item |