Caddy, James;
Treude, Christoph;
Wagner, Markus;
Barr, Earl T;
(2025)
The role of surprisal in issue trackers.
Empirical Software Engineering
, 30
, Article 30. 10.1007/s10664-024-10587-w.
(In press).
![]() |
Text
The role of surprisal in issue trackers.pdf - Accepted Version Access restricted to UCL open access staff until 24 November 2025. Download (1MB) |
Abstract
Context: Software development creates and relies on a large volume of information, yet the volume of this information can make it challenging for developers to maintain an overview of all goings-on that a team and external actors contribute to a project. We posit that unexpected or “surprising” events could serve as important signposts amidst this information overload. These unexpected events may indicate underlying anomalies or emergent situations that require immediate attention. To explore this premise, our study leverages the concept of ‘surprisal’ from information theory to identify and quantify these unusual occurrences from the issues and pull requests of popular open-source software repositories.// Objective: Drawing from a previously published research protocol, our study investigates whether a correlation exists between the ‘surprisal’ of issues and their perceived importance or difficulty within software repositories.// Results: We performed a comprehensive analysis of approximately two million issues and pull requests, gathered from 1,270 repositories. Their ‘surprisal’ was then examined in relation to several indicative metrics of difficulty and perceived importance. Our results indicate only a weak correlation. This outcome underscores the need for further research to devise more effective strategies for helping developers prioritise issues.
Type: | Article |
---|---|
Title: | The role of surprisal in issue trackers |
DOI: | 10.1007/s10664-024-10587-w |
Publisher version: | https://doi.org/10.1007/s10664-024-10587-w |
Language: | English |
Additional information: | This version is the author-accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Self-information, n-gram, GitHub issues |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10203223 |
Archive Staff Only
![]() |
View Item |