Registries:
Immunization Information Systems
States/Cities/Territories
|
|
Deduplication
Test Cases |
NIP has developed a toolkit to assist immunization
information systems (IIS) in the evaluation
of their deduplication algorithms. This toolkit
will help registries assess their system's
ability to prevent/remove duplicate records.
The data and procedures in this toolkit can
help identify strengths and weaknesses in the
deduplication algorithms. The test data set
consists of test cases that are representative
of known duplicate record problems in real
data, based on the information provided by
various IIS personnel. These test cases
are fictitious examples; they do not correspond
to information on real children. The
evaluation tool application will calculate
sensitivity and specificity values for the
IIS's algorithms based on the test results.
The sensitivity value measures how well the
system performs at recognizing known duplicate
records. The specificity is the value that
reflects how accurate the duplicate record
detection is by measuring the rate at which
non-duplicate records are misidentified.
- Click
on the above link to download a zip file
containing the toolkit components:
-
Toolkit User Manual (EvaluationToolManual.doc)
- a document that will guide the user through
installation of the tool and use of the
test data set
-
Evaluation Tool (DupEval.cab) - a program
that will calculate sensitivity and specificity
values for an IIS's deduplication algorithm
- Test
Data Set (DupTestData.csw) - a file containing
test cases representative of known duplicate
record problems
- Run
the setup.exe program to install the Evaluation
Tool on a PC. *Note that for older Windows
95 systems you may first need to run DCOM98.exe
which is included in the kit.
If you have any questions, please send them
via email to siisclear@cdc.gov.
|