Skip Navigation Links
Centers for Disease Control and Prevention
CDC
CDC CDC Home Search Health Topics A-Z
Contact Help Travelers Health n i p Home NIP header
Health Care Professionals

Registries: Immunization Information Systems (IIS)
IIS Home
What's New?
Upcoming Events
Parents menu
Providers menu
States/Territories menu
Partner Organizations
Pubs & Resources
Contacts
Acronyms

NIP:
NIP HOME
First time visitor?
About NIP
Data and Statistics
International Efforts
Links to other web sites 
bullet Glossary/ Acronyms 

NIP sub-sites:
ACIP
Flu Vaccine
Immunization Registries
Vaccines for Children Program
CASA (Clinic Assessment Program)
AFIX (Grantee Assessment)
VACMAN
 

NIP Site Search
 
For Immunization Information, call the
CDC-INFO Contact Center:
English and Spanish
800-CDC-INFO
800-232-4636
TTY
888-232-6348

Get Acrobat Reader
Get Adobe Reader
Home Health Care Professionals Home Partners Home Media Home Informacion en Espanol Health Care Professionals
 
Registries: Immunization Information Systems
States/Cities/Territories
IIS logo
Deduplication Test Cases

NIP has developed a toolkit to assist immunization information systems (IIS) in the evaluation of their deduplication algorithms. This toolkit will help registries assess their system's ability to prevent/remove duplicate records. The data and procedures in this toolkit can help identify strengths and weaknesses in the deduplication algorithms. The test data set consists of test cases that are representative of known duplicate record problems in real data, based on the information provided by various IIS personnel. These test cases are fictitious examples; they do not correspond to information on real children. The evaluation tool application will calculate sensitivity and specificity values for the IIS's algorithms based on the test results. The sensitivity value measures how well the system performs at recognizing known duplicate records. The specificity is the value that reflects how accurate the duplicate record detection is by measuring the rate at which non-duplicate records are misidentified.

zip file Download deduplication toolkit (.zip file format)
  1. Click on the above link to download a zip file containing the toolkit components:
  • Toolkit User Manual (EvaluationToolManual.doc) - a document that will guide the user through installation of the tool and use of the test data set
  • Evaluation Tool (DupEval.cab) - a program that will calculate sensitivity and specificity values for an IIS's deduplication algorithm
  • Test Data Set (DupTestData.csw) - a file containing test cases representative of known duplicate record problems
  1. Run the setup.exe program to install the Evaluation Tool on a PC. *Note that for older Windows 95 systems you may first need to run DCOM98.exe which is included in the kit.
If you have any questions, please send them via email to siisclear@cdc.gov.

Top of page


National Immunization Program (NIP)
NIP Home | Contact Us | Help | Glossary | About | Accessibility

This page last modified on February 15, 2006

   

Department of Health and Human Services
Centers for Disease Control and Prevention
CDC Home
  |  CDC Search  |  CDC Health Topics A-Z