Kamath, Gaurav;
Schuster, Sebastian;
Vajjala, Sowmya;
Reddy, Siva;
(2024)
Scope Ambiguities in Large Language Models.
Transactions of the Association for Computational Linguistics
, 12
pp. 738-754.
10.1162/tacl_a_00670.
Preview |
Text
tacl_a_00670.pdf - Published Version Download (674kB) | Preview |
Abstract
Sentences containing multiple semantic operators with overlapping scope often create ambiguities in interpretation, known as scope ambiguities. These ambiguities offer rich insights into the interaction between semantic structure and world knowledge in language processing. Despite this, there has been little research into how modern large language models treat them. In this paper, we investigate how different versions of certain autoregressive language models—GPT-2, GPT-3/3.5, Llama 2, and GPT-4—treat scope ambiguous sentences, and compare this with human judgments. We introduce novel datasets that contain a joint total of almost 1,000 unique scope-ambiguous sentences, containing interactions between a range of semantic operators, and annotated for human judgments. Using these datasets, we find evidence that several models (i) are sensitive to the meaning ambiguity in these sentences, in a way that patterns well with human judgments, and (ii) can successfully identify human-preferred readings at a high level of accuracy (over 90% in some cases).1
Type: | Article |
---|---|
Title: | Scope Ambiguities in Large Language Models |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1162/tacl_a_00670 |
Publisher version: | http://dx.doi.org/10.1162/tacl_a_00670 |
Language: | English |
Additional information: | Copyright © 2024 Association for Computational Linguistics. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For a full description of the license, please visit https://creativecommons.org/licenses/by/4.0/legalcode. |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Linguistics |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10193596 |
Archive Staff Only
![]() |
View Item |