Comparison of Analysis Method
Brian T. Luke (lukeb@ncifcrf.gov)Return to Contents
As stated in Identifying Putative Biomarkers, a total of 10 methods are available in the BioMarker Development Kit (BMDK) to search a dataset of feature intensities for putative biomarkers. Included in the description of each method is an examination of the minimum strength a feature must have to have a 50% probability or better of obtaining a score that is better than a feature with no information. Two forms of a putative biomarker are examined, as shown in the figure to the right. For a feature of type Feature-a, the relative strength of the putative biomarker is determined by Za, which represents the extent to which the maximum intensity for samples in one category exceeds that of the other. For example, if Za=30, the range of intensities for one category is only 70% that of the other, or approximately 85% of all samples have another sample of a different category with the same approximate intensity.
The following table lists the minimum value of Za needed for a peak of type Feature-a to have a 50% probability or higher of obtaining a better score than a non-informative peak as a function of the number of Cases and Controls (Each).
Each | catboot | student | dtgini | dtinfg | nnfeat | chisq | kruswal | kolsmir | extreme | vartest |
30 | 60 | 55 | 50 | 40 | 55 | 65 | 65 | 60 | 45 | 50 |
45 | 50 | 40 | 35 | 35 | 50 | 45 | 55 | 45 | 40 | 45 |
60 | 50 | 35 | 30 | 30 | 45 | 40 | 45 | 45 | 30 | 35 |
90 | 45 | 30 | 25 | 25 | 35 | 35 | 35 | 35 | 20 | 40 |
150 | 35 | 30 | 15 | 15 | 30 | 30 | 25 | 25 | 10 | 30 |
300 | 25 | 20 | 10 | 10 | 20 | 20 | 20 | 20 | 10 | 20 |
For the largest dataset (300 Cases and 300 Controls), dtgini, dtinfg and extreme only require Za to be 10. This means that if one category has a maximum intensity that is 90% of the other so that 95% of all samples have intensity in the overlapped region. For the smallest dataset (30 Cases and 30 Controls) dtinfg requires that one category have a maximum intensity that is 60% of the other while catboot and kolsmir require that it be 40% or less.
For putative biomarkers of type Feature-b, the following table lists the minimum value of 2Zb needed to find at least 50% of Feature-b peaks with scores better than non-informative peaks as a function of the number of Cases and Controls (Each).
Each | catboot | student | dtgini | dtinfg | nnfeat | chisq | kruswal | kolsmir | extreme | vartest |
30 | 95 | 55 | 85 | 70 | 70 | 100 | 80 | 95 | 75 | 60 |
45 | 80 | 50 | 60 | 60 | 60 | 65 | 65 | 70 | 55 | 60 |
60 | 75 | 45 | 55 | 50 | 55 | 55 | 50 | 70 | 55 | 45 |
90 | 60 | 35 | 45 | 40 | 40 | 40 | 40 | 50 | 35 | 45 |
150 | 50 | 30 | 25 | 25 | 35 | 35 | 30 | 40 | 15 | 30 |
300 | 30 | 20 | 15 | 10 | 25 | 20 | 25 | 30 | 10 | 20 |
For the largest dataset dtinfg and extreme have at least 50% probability of identifying a putative biomarker if there is a 95% overlap in the ranges of intensity for the two categories; again with 95% of all samples having intensity in the overlapping region. For the smallest dataset, student has at least a 50% probability of identifying the feature if there is at most a 72.5% overlap in the intensity ranges while for chisq this overlap can be at most 50%.
The major goal of this exercise is to demonstrate that there is a limit to the level of detection of a putative biomarker using any of these methods. For the largest dataset examined, at least 5% of the samples (30 samples) must have an intensity value in a range not covered by samples in the other category. As the sample size decreases, the fraction of samples that must have intensity values in a range not covered by the other category increases. This means that if a single category is composed of multiple States, and at least one of the States contains a small fraction of the samples, identifying a marker for this State may only be likely if the total number of samples in this category, and therefore the number of samples in this State, is reasonably large.
It should be stressed that these limits of detection are only approximate. In some cases one value of Za or 2Zb finds slightly below 50% of the features. Since the tests are only performed at 5% increments, the minimum level shown in these tables identifies significantly more than 50% of the features. In addition, these artificial features are produced using a random number generator with a uniform distribution. If the distribution of intensity values is different from uniform, it may be harder or easier to identify a putative biomarker. For example, if the distributions of intensity values have a Gaussian or normal distribution about the mean, finding a putative biomarker may be considerably harder. Conversely, a putative biomarker would be easier to identify if it has an excess density of intensity values in the range not covered by samples in the other category.
(Last updated 4/30/07)