Balancing disclosure risk against the loss of nonpublication

A nondisclosure policy for tabular data or microdata restricts release of information that could be related to a specific individual. Pannekoek and de Waal (1998) describe a rule that suppresses data release when the number of people in a cell defined by a rare characteristic falls below a fixed floor, and show how empirical Bayes methods can be used to improve the estimation of that number. We argue that the nondisclosure problem can be formulated as a decision problem in which one loss is associated with the possibility of disclosure and another with nonpublication of data. This analysis supports a decision on whether to disclose information in each cell, minimizing the expected sum of the two losses. We present arguments for several loss functions, considering both tabular and microdata releases, and illustrate their application to simple simulated data. (1998)

Alan M. Zaslavsky

Journal of Official Statistics

January 1, 1998

1998

Zaslavsky AM and Horton NJ

http://hbanaszak.mjr.uw.edu.pl/TempTxt/ZaslavskyHorton_1998_Balancing%20Disclosure%20Risk%20Against%20the%20Loss%20of%20NonPublication.pdf