SED navigation bar go to SED home page go to SED seminars page go to NIST home page SED Home Page SED Contacts SED Projects SED Products and Publications Search SED Pages

Statistical Engineering Division Seminar

Analyzing High Dimensional Data with Fewer Observations

Muni Srivastava
University of Toronto
Statistical Engineering Division Seminar
Thursday, September 20, 2007, 10:30-11:30 AM
Building 222, Room A264

Abstract

In this talk I consider the problem of analyzing datasets obtained on several characteristics, m, of an individual, such as in DNA microarrays in which such characteristics are typically in the several thousands. The number, however, of subjects, N, in which these observations can be taken is typically small, often less than 50. Standard multivariate theory requires that m be substantially smaller than N, and very little theory is available to analyze the case when N is less than m. In the last decade or so, results have been obtained for this case, also. I will present some of these results. Using these results, I will analyze two datasets from microarrays. Both datasets require the comparison of two groups, the so-called two-sample test. Often, it is required to find the components (or genes) that may have caused the rejection of the equality of two mean vectors. An alternative to the currently used False Discovery Rate (FDR) will be presented.

NIST Contact: John Lu, (301) 975-2846.

Date created: 9/12/2007
Last updated: 9/12/2007
Please email comments on this WWW page to sedwww@cam.nist.gov.