The U.S. Census Bureau

Disclosure Risk Assessment in Perturbative Microdata Protection

William E. Yancey, William E. Winkler, Robert H. Creecy

KEY WORDS: additive noise, mixtures, rank swapping, EM Algorithm, record linkage

ABSTRACT

This paper describes methods for data perturbation that include rank swapping and additive noise. It also describes enhanced methods of re-identification using probabilistic record linkage. The empirical comparisons use variants of the framework for measuring information loss and re-identification risk that were introduced by Domingo-Ferrer and Mateo-Sanz.

CITATION:

Source: U.S. Census Bureau, Statistical Research Division

Created: 01-FEB-2002
Last revised: February 01 2002