U.S. Department of Commerce

Research Reports

You are here: Census.govSubjects A to ZResearch Reports Sorted by Year › Abstract of RRS2004/02
Skip top of page navigation

An Adaptive String Comparator for Record Linkage

William E. Yancey

KEY WORDS:

ABSTRACT

We develop a string comparator based on edit distance that uses variable edit-step costs derived from training data. Using first and last name data from census files, we compare the performance of this string comparator with one without variable edit step costs and with the Jaro-Winkler string comparator, which is standardly used in the Census Bureau's record linkage software.

CITATION:

Source: U.S. Census Bureau, Statistical Research Division

Created: 25-FEB-2004


Source: U.S. Census Bureau | Statistical Research Division | (301) 763-3215 (or chad.eric.russell@census.gov) |   Last Revised: October 08, 2010