Appropriateness of some resampling-based inference...[Stat Med. 2007]

SearchDatabase nameforSearch term

Advanced Search (beta)

Display Show

All: 1

Review: 0

Click to change filter selection through MyNCBI.

1: Stat Med. 2007 Feb 28;26(5):1102-13. Links

Appropriateness of some resampling-based inference procedures for assessing performance of prognostic classifiers derived from microarray data.

Lusa L, McShane LM, Radmacher MD, Shih JH, Wright GW, Simon R.

Department of Experimental Oncology, Istituto Nazionale per lo Studio e la Cura dei Tumori, Milano, Italy. lara.lusa@ifom-ieo-campus.it

The goal of many gene-expression microarray profiling clinical studies is to develop a multivariate classifier to predict patient disease outcome from a gene-expression profile measured on some biological specimen from the patient. Often some preliminary validation of the predictive power of a profile-based classifier is carried out using the same data set that was used to derive the classifier. Techniques such as cross-validation or bootstrapping can be used in this setting to assess predictive power, and if applied correctly, can result in a less biased estimate of predictive accuracy of a classifier. However, some investigators have attempted to apply standard statistical inference procedures to assess the statistical significance of associations between true and cross-validated predicted outcomes. We demonstrate in this paper that naïve application of standard statistical inference procedures to these measures of association under null situations can result in greatly inflated testing type I error rates. Under alternatives of small to moderate associations, confidence interval coverage probabilities may be too low, although for very large associations coverage probabilities approach their intended values. Our results suggest that caution should be exercised in interpreting some of the claims of exceptional prognostic classifier performance that have been reported in prominent biomedical journals in the past few years. Copyright (c) 2006 John Wiley & Sons, Ltd.

PMID: 16755534 [PubMed - indexed for MEDLINE]

Bias in error estimation when using cross-validation for model selection. [BMC Bioinformatics. 2006]
Calculating confidence intervals for prediction error in microarray classification using resampling. [Stat Appl Genet Mol Biol. 2008]
On the statistical assessment of classifiers using DNA microarray data. [BMC Bioinformatics. 2006]
Multiple gene expression classifiers from different array platforms predict poor prognosis of colorectal cancer. [Clin Cancer Res. 2007]
Sharp simultaneous confidence intervals for the means of selected populations with application to microarray data analysis. [Biometrics. 2007]
» See all Related Articles...

Display Show

Appropriateness of some resampling-based inference procedures for assessing performance of prognostic classifiers derived from microarray data.

Related Articles