U.S. Census Bureau

Overview of Record Linkage and Current Research Directions

William E Winkler

KEY WORDS: approximate string comparison; unsupervised and semi-supervised learning, data extraction and standardization, register maintenance


This paper provides background on record linkage methods that can be used in combining data from a variety of sources such as person lists business lists. It also gives some areas of current research.


Source: U.S. Census Bureau, Statistical Research Division

Created: February 8, 2006
Last revised: February 8, 2006