UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

The appeal of the gamma family distribution to protect the confidentiality of contingency tables

Jackson, James; Mitra, Robin; Francis, Brian; Dove, Iain; (2024) The appeal of the gamma family distribution to protect the confidentiality of contingency tables. ArXiv: Ithaca, NY, USA. Green open access

[thumbnail of discretisedgammapaper.pdf]
Preview
Text
discretisedgammapaper.pdf - Submitted Version

Download (1MB) | Preview

Abstract

Administrative databases, such as the English School Census (ESC), are rich sources of information that are potentially useful for researchers. For such data sources to be made available, however, strict guarantees of privacy would be required. To achieve this, synthetic data methods can be used. Such methods, when protecting the confidentiality of tabular data (contingency tables), often utilise the Poisson or Poisson-mixture distributions, such as the negative binomial (NBI). These distributions, however, are either equidispersed (in the case of the Poisson) or overdispersed (e.g. in the case of the NBI), which results in excessive noise being applied to large low-risk counts. This paper proposes the use of the (discretized) gamma family (GAF) distribution, which allows noise to be applied in a more bespoke fashion. Specifically, it allows less noise to be applied as cell counts become larger, providing an optimal balance in relation to the risk-utility trade-off. We illustrate the suitability of the GAF distribution on an administrative-type data set that is reminiscent of the ESC.

Type: Working / discussion paper
Title: The appeal of the gamma family distribution to protect the confidentiality of contingency tables
Open access status: An open access version is available from UCL Discovery
Publisher version: https://doi.org/10.48550/arXiv.2408.02513
Language: English
Additional information: This is an Open Access paper published under a Creative Commons Attribution 4.0 International (CC BY 4.0) Licence (https://creativecommons.org/licenses/by/4.0/).
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10201418
Downloads since deposit
39Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item