Due to the fact that DNA typing is only an examination of a DNA sample's sequence and/or length at discrete locations, a match in DNA typing is always a statistical exercise. (Currently, time and expense limit an examination of an individual's entire genome, which would show unique identity for all but identical twins.) In order to determine the probability that a particular genotype might occur at random in a population, population data must be gathered to make an estimate of the frequency of each possible allele and genotype. Usually a sample size of greater than 100 samples is sufficient to make reliable projections about a genotype's frequency in a larger population (see Chakraborty, R. (1992) Human Biology 64:141-159).
The data collected and presented here represent information from published sources (see reference listing). The population sampled is listed by its reference number according to the commercial kit used to collect the data. We hope this information will be helpful to the DNA typing community for comparing results between populations.
Population Survey provided by Brian Burritt (San Diego Police Department)
Reference Listing of Published Sources (219 publications as of 08/27/2008)
Download OmniPop program (~1.7 Mbytes macro-enabled Excel file developed by Brian Burritt)
- OmniPop calculates a user-inputted profile's frequency using allele frequencies from 202 published databases present in the program (updates will be made available in the future)
NIST Population Data collected on U.S. Sample Groups
Programs that permit on-line STR profile frequency calculations
ENFSI DNA WG STR Population Database (http://www.str-base.org/index.php): uses 5,699 samples from 24 European populations with SGM Plus kit loci
Canadian Random Match Calculator for Profiler Plus and COfiler kits - (http://www.csfs.ca/pplus/profiler.htm); see also http://www.csfs.ca/databases/index.htm for information on a variety of populations
Links to Original Information on STRBase regarding Population Data
08/27/08