Published Population Data from a Variety of STR Systems


Due to the fact that DNA typing is only an examination of a DNA sample's sequence and/or length at discrete locations, a match in DNA typing is always a statistical exercise. (Currently, time and expense limit an examination of an individual's entire genome, which would show unique identity for all but identical twins.) In order to determine the probability that a particular genotype might occur at random in a population, population data must be gathered to make an estimate of the frequency of each possible allele and genotype. Usually a sample size of greater than 100 samples is sufficient to make reliable projections about a genotype's frequency in a larger population (see Chakraborty, R. (1992) Human Biology 64:141-159).

The data collected and presented here represent information from published sources (see reference listing). The population sampled is listed by its reference number according to the commercial kit used to collect the data. We hope this information will be helpful to the DNA typing community for comparing results between populations.

Population Survey provided by Brian Burritt (San Diego Police Department)

Reference Listing of Published Sources (219 publications as of 08/27/2008)

Download OmniPop program (~1.7 Mbytes macro-enabled Excel file developed by Brian Burritt)

NIST Population Data collected on U.S. Sample Groups

 

Programs that permit on-line STR profile frequency calculations

ENFSI DNA WG STR Population Database (http://www.str-base.org/index.php): uses 5,699 samples from 24 European populations with SGM Plus kit loci

Canadian Random Match Calculator for Profiler Plus and COfiler kits -  (http://www.csfs.ca/pplus/profiler.htm); see also http://www.csfs.ca/databases/index.htm for information on a variety of populations

 


Links to Original Information on STRBase regarding Population Data


08/27/08

RETURN TO HOMEPAGE