Testing Information

Descriptions of NTP Study Types

Expanded Overview

Print this page Easy Link

http://ntp.niehs.nih.gov/go/10007

Survival Analyses

The probability of survival is estimated by the product-limit procedure of Kaplan and Meier (1958) and is presented in the form of graphs. Animals found dead of other than natural causes are censored from the survival analyses; animals dying from natural causes are not censored. Statistical analyses for possible dose-related effects on survival uses Cox's (1972) method for testing two groups for equality and Tarone's (1975) life table test to identify dose-related trends. All reported P values for the survival analyses are two sided.

Calculation of Incidence

The incidences of neoplasms or nonneoplastic lesions are the numbers of animals bearing such lesions at a specific anatomic site and the numbers of animals with that site examined microscopically. For calculation of statistical significance, the incidences of most neoplasms and all nonneoplastic lesions are the numbers of animals affected at each site examined microscopically. However, when macroscopic examination is required to detect neoplasms in certain tissues (e.g., harderian gland, intestine, mammary gland, and skin) before microscopic evaluation, or when neoplasms have multiple potential sites of occurrence (e.g., leukemia or lymphoma), the denominators consist of the number of animals on which a necropsy was performed. The survival-adjusted neoplasm rate for each group and each site-specific neoplasm is also determined. This survival-adjusted rate (based on the Poly-3 method described below) accounts for differential mortality by assigning a reduced risk of neoplasm, proportional to the third power of the fraction of time on study, to animals that do not reach terminal sacrifice.

Analysis of Neoplasm and Nonneoplastic Lesion Incidences

The Poly-k test (Bailer and Portier, 1988; Portier and Bailer, 1989; Piegorsch and Bailer, 1997) is used to assess neoplasm and nonneoplastic lesion prevalence. This test is a survival-adjusted quantal-response procedure that modifies the Cochran-Armitage linear trend test to take survival differences into account. More specifically, this method modifies the denominator in the quantal estimate of lesion incidence to approximate more closely the total number of animal years at risk. For analysis of a given site, each animal is assigned a risk weight. This value is one if the animal had a lesion at that site or if it survived until terminal sacrifice; if the animal died prior to terminal sacrifice and did not have a lesion at that site, its risk weight is the fraction of the entire study time that it survived, raised to the kth power.

This method yields a lesion prevalence rate that depends only upon the choice of a shape parameter for a Weibull hazard function describing cumulative lesion incidence over time (Bailer and Portier, 1988). Unless otherwise specified, a value of k=3 was used in the analysis of site-specific lesions. This value was recommended by Bailer and Portier (1988) following an evaluation of neoplasm onset time distributions for a variety of site-specific neoplasms in control F344 rats and B6C3F1 mice (Portier et al., 1986). Bailer and Portier (1988) showed that the Poly-3 test gave valid results if the true value of k was anywhere in the range from 1 to 5. A further advantage of the Poly-3 method is that it does not require lesion lethality assumptions. Variation introduced by the use of risk weights, which reflect differential mortality, was accommodated by adjusting the variance of the Poly-3 statistic as recommended by Bieler and Williams (1993).

Tests of significance include pairwise comparisons of each exposed group with controls and a test for an overall exposure-related trend. Continuity-corrected Poly-3 tests are used in the analysis of lesion incidence, and reported P values are one sided. The significance of lower incidences or decreasing trends in lesions are also identified.

Analysis of Continuous Variables

Two approaches are employed to assess the significance of pairwise comparisons between exposed and control groups in the analysis of continuous variables. Organ and body weight data, which historically have approximately normal distributions, are analyzed with the parametric multiple comparison procedures of Dunnett (1955) and Williams (1971, 1972). Hematology, clinical chemistry, urinalysis, urine concentrating ability, cardiopulmonary, cell proliferation, tissue concentrations, spermatid, and epididymal spermatozoal data, which have typically skewed distributions, are analyzed using the nonparametric multiple comparison methods of Shirley (1977), as modified by Williams (1986) and Dunn (1964). Jonckheere's test (Jonckheere, 1954) is used to assess the significance of the dose-related trends and to determine whether a trend-sensitive test (Williams' or Shirley's test) is more appropriate for pairwise comparisons than a test that does not assume a monotonic dose-related trend (Dunnett's or Dunn's test). Prior to statistical analysis, extreme values identified by the outlier test of Dixon and Massey (1951) are examined by NTP personnel, and implausible values are eliminated from the analysis. Average severity values are analyzed for significance with the Mann-Whitney U test (Hollander and Wolfe, 1973). Because vaginal cytology data are proportions (the proportion of the observation period that an animal was in a given estrous stage), an arcsine transformation is used to bring the data into closer conformance with a normality assumption. Treatment effects are investigated by applying a multivariate analysis of variance (Morrison, 1976) to the transformed data to test for simultaneous equality of measurements across exposure concentrations.

Statistical analysis of immunological data use a tier approach (Luster et al., 1988, Luster et al., 1992). Data are initially tested for homogeneity using Bartlett's Chi Square Test. For data that are determined to be homogeneous, one-analysis of variance (ANOVA) is conducted. If the ANOVA is significant at p < 0.05 or less, Dunnett's multiple range t test is used for multiple treatment-control comparisons. If the data are not homogeneous, nonparametric analysis of variance, the Kruskal-Wallis test, or the Wilcoxon rank sum test is used to compare treatment groups with controls groups. The level of statistical significance is set at p < 0.05 and p < 0.01. Values are routinely presented as mean ± standard error. For host-resistance data, Chi Square analysis or log linear models are used to determine chemical treatment effects on mortality. For comparisons of group survival times, the product limit estimator is used in conjunction with the Mantel-Cox test.