Franceschelli, Giorgio;
Musolesi, Mirco;
(2023)
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges.
Journal of Artificial Intelligence Research
, 79
pp. 417-446.
10.1613/jair.1.15278.
Preview |
Text
Musolesi_Reinforcement Learning for Generative AI_VoR.pdf Download (393kB) | Preview |
Abstract
Generative Artificial Intelligence (AI) is one of the most exciting developments in Computer Science of the last decade. At the same time, Reinforcement Learning (RL) has emerged as a very successful paradigm for a variety of machine learning tasks. In this survey, we discuss the state of the art, opportunities and open research questions in applying RL to generative AI. In particular, we will discuss three types of applications, namely, RL as an alternative way for generation without specified objectives; as a way for generating outputs while concurrently maximizing an objective function; and, finally, as a way of embedding desired characteristics, which cannot be easily captured by means of an objective function, into the generative process. We conclude the survey with an in-depth discussion of the opportunities and challenges in this fascinating emerging area.
Type: | Article |
---|---|
Title: | Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1613/jair.1.15278 |
Publisher version: | http://dx.doi.org/10.1613/jair.1.15278 |
Language: | English |
Additional information: | © 2024 The Authors. Published by AI Access Foundation under Creative Commons Attribution License CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/). |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10187531 |
Archive Staff Only
![]() |
View Item |