Using Supervised Principal Components Analysis to Assess Multiple Pollutant Effects Steven Roberts and Michael A. Martin School of Finance and Applied Statistics, College of Business and Economics, Australian National University, Canberra, Australian Capital Territory, Australia Abstract Background: Many investigations of the adverse health effects of multiple air pollutants analyze the time series involved by simultaneously entering the multiple pollutants into a Poisson log-linear model. This method can yield unstable parameter estimates when the pollutants involved suffer high intercorrelation ; therefore, traditional approaches to dealing with multicollinearity, such as principal component analysis (PCA) , have been promoted in this context. Objectives: A characteristic of PCA is that its construction does not consider the relationship between the covariates and the adverse health outcomes. A refined version of PCA, supervised principal components analysis (SPCA) , is proposed that specifically addresses this issue. Methods: Models controlling for long-term trends and weather effects were used in conjunction with each SPCA and PCA to estimate the association between multiple air pollutants and mortality for U.S. cities. The methods were compared further via a simulation study. Results: Simulation studies demonstrated that SPCA, unlike PCA, was successful in identifying the correct subset of multiple pollutants associated with mortality. Because of this property, SPCA and PCA returned different estimates for the relationship between air pollution and mortality. Conclusions: Although a number of methods for assessing the effects of multiple pollutants have been proposed, such methods can falter in the presence of high correlation among pollutants. Both PCA and SPCA address this issue. By allowing the exclusion of pollutants that are not associated with the adverse health outcomes from the mixture of pollutants selected, SPCA offers a critical improvement over PCA. Key words: air pollution, mortality, multiple pollutants, principal components analysis, time series. Environ Health Perspect 114:1877–1882 (2006) . doi:10.1289/ehp.9226 available via http://dx.doi.org/ [Online 24 August 2006] Address correspondence to S. Roberts, School of Finance and Applied Statistics, College of Business and Economics, Australian National University, Canberra ACT 0200 Australia. Telephone: 61 2 6125 3470. Fax: 61 2 6125 0087. E-mail: steven.roberts@anu.edu.au The authors declare they have no competing financial interests. Received 3 April 2006 ; accepted 24 August 2006. The full version of this article is available for free in HTML or PDF formats. |