Chapter 11 - Effect of Interviewers on Data Quality - Development of Computer-Assisted Interviewing Procedures for the NHSDA

Computer Assisted Interviewing for SAMHSA's National Household Survey on Drug Abuse

11. Effect of Interviewers on Data Quality

In this chapter, we report the results of two investigations of the effect of the interviewers on data quality. We examine the relationship between interviewer characteristics and performance measures and look at the effects of the mode of interview on interviewer variance, which is the increase in variance due to the correlation of response errors within interviewers.

11.1 Relationship Between Interviewer Characteristics and Interviewer Performance

In this section, we describe the characteristics of the 1997 field experiment's field interviewers (FIs) and examine the relationship of those characteristics to a number of FI performance measures. This chapter describes an exploratory study to determine whether the quality of the NHSDA interviewer workforce can be improved by selecting them on the basis of their background characteristics.

11.1.1 Field Interviewer Characteristics

There were 165 FIs involved in the 1997 field experiment. For most of the interviewers, data from a personal data sheet (PDS) were available. For the purposes of this study, the following variables were created from the PDS: gender, age in years, a four-category ethnicity variable (black, white, Hispanic, and other), and a three-category education-level variable (high school, some college, and 4-year college graduate).

Eighty-one percent (133) of the interviewers were female. Their average age was 50 years, with a standard deviation of approximately 12. Most were white (73%), 17% were black, and the remaining 10% were Asian, Pacific Islander, American Indian, Alaskan Native, or Hispanic. Some 21% attended high school or were high school graduates; 41% had some college or a 2-year degree; and 36% were college graduates or had postgraduate degrees.

The other set of background variables related to field interviewing experience were as follows: a six-level ordered categorical variable reflecting years of survey interviewing experience (none, up to 1 year, 1 to 2 years, 2 to 5 years, 5 to 10 years, more than 10 years). Of the 143 interviewers who provided information on survey interviewing experience, only 21% had no experience, 48% had up to 5 years of experience, and 31% had 5 or more years of experience.

Five binary indicator variables indicated whether the FI had experience as a data collector or supervisor of data collectors in the following five areas: household surveys with face-to-face interviews; listing area segments; computer-assisted personal interviewing (CAPI); computer-assisted telephone interviewing (CATI) or other telephone interviewing; and field reporting with a microcomputer. These field interviewing experience variables are summarized in Exhibit 11.1.1.

Most of the FIs had either household survey face-to-face interviewing experience (70%) or CAPI experience (58%). Also, 79% of the interviewers had either household survey or CAPI experience. Some 49% had experience in both areas.

11.1.2 Methodology

Each interviewer completed a 23-item computer beliefs and attitudes questionnaire at three time points: prior to training, immediately after training, and at the end of the 1997 field experiment study. The time between the pre- and post-administrations was approximately 2 weeks, and the time between the post- and end-of-study administration was approximately 3 months. The 23 items were scored using a 5-point Likert scale: 5 (strongly agree), 4 (agree), 3 (neither agree nor disagree), 2 (disagree), and 1 (strongly disagree).

Twenty-one of these items came from an instrument used by the U.S. Bureau of the Census to study their FIs. Research Triangle Institute (RTI) added two items concerning the use of the Newton in screening. We decided to drop these two items so that our results might eventually be compared with those of the Census Bureau. The 21 items measured, for the most part, beliefs and attitudes about the role of computers in general and specifically with field interviewing. The correlation matrices for the 21 items were factor analyzed separately for each of the three time points. For each administration, two through five factors were extracted and rotated by Varimax. We felt that the best criterion for selecting a particular factor solution (i.e., number of factors) at each time point should be based on how effectively they predicted the FI performance measures.

The factor analyses of the beliefs and attitude questionnaires were undertaken (a) to reduce the number of independent variables in the interviewer performance regression models, (b) to address the problem of multicollinearity, and (c) to simplify the interpretation of results from the regression analyses. The unit of analysis was the interviewer. About 140 interviewers met the criteria to be included in the regression analyses. Therefore, we wanted to minimize the number of independent variables to be included in the regression models.

To model interviewer performance, we needed to control for the segment characteristics in which each interviewer worked. Nine segment characteristics were selected for this purpose:

Quarter 1 through 3 successful screening percentage (number of households screened divided by number of eligible households),
Quarter 1 through 3 successful interview percentage (number of completed interviews divided by the number of people who were selected to be interviewed),
median rent,
median value of owner-occupied dwelling,
percentage of noninstitutionalized Hispanics 12 years of age or older,
percentage of noninstitutionalized black, non-Hispanics 12 years ofage or older,
percentage of urban population in segment,
a binary urbanized area indicator, and
an ordered three-level population density variable.

The performance measures for the interviewers were based upon a single segment, which was the first clean segment in which the interviewer worked. A clean segment was one in which only one interviewer worked. The reason for this definition was to avoid contaminating the performance measures for a particular interviewer with the performance of one or more other interviewers. In most cases, an interviewer worked in only one segment, and that segment was usually clean.

The following interviewer performance measures were used as the five dependent variables in the regression modeling: the percentage of eligible dwelling units that were successfully screened; the percentage of completed interviews from selected persons; the probability of the interviewer having trouble setting up the computer as judged by the respondent; the probability of the interviewer having trouble with the computer during the interview as judged by the respondent; and the number of days from assignment of segment to beginning work in the segment.

Logistic regression analysis was used to model the four binomial outcome variables. For each interviewer, we had the number of trials (e.g., the number of persons selected for an interview) and the number of successful outcomes or events (e.g., the number of completed interviews). All of the logistic regression analyses were adjusted for overdispersion because the interview completion events within interviewers were not independent. A Cox proportional hazards model was used to model the time from assignment to beginning work in the assigned segment.

Because there were eight different factor solutions (two- to five-factor solutions for both the pre-training and post-training beliefs and attitudes questionnaires), we modeled each of the five interviewer performance measures as a function of each of the eight sets of factor scores. Thus, there were a total of 40 regression analyses. In general, the attitude and belief factors were not very predictive of interviewer performance. Overall, the post-training five-factor solution seemed to be the best set of factors for predicting the five performance measures, perhaps because these factors were measured after training and because they were also measured closer to the fieldwork activities on which the performance measures were based. Also, five factors explain more variation in the 21-questionnaire items than two-, three-, or four-factor solutions.

The five rotated factors accounted for 60% of the total variation in the 21 attitude and belief questions. The first factor had high loadings from 10 of the 21 items ranging from 0.55 to 0.78. It explained the most variation in the attitude items. We interpret this factor as a general expression of positive beliefs and attitudes toward computers both in general and specifically withrespect to their utility for interviewing.

The remaining four factors explained considerably less variation and had only between two and four variables loading high (i.e., about 0.50 and above) on them. Factor 2 reflected a disagreement with negative beliefs, such as beliefs that computers threaten interviewers' jobs and the feeling that the interviewer is being watched all the time by using a computer. Factor 3 was defined by three questionnaire items that reflected not being afraid of the responsibility for taking care of the computer and other equipment. Factor 4 was defined by three items that reflected lack of concern about computer breakdowns and keying in data. The last factor, Factor 5, was defined by two items that indicated that the interviewer believed that the respondents will enjoy the interview and that carrying the computer around will not be tiring.

Two logistic regression models, described below, were developed for each of the following four performance measures: the probability of a completed interview, the probability of a successful screening, the probability of having trouble setting up the computer, and the probability of having trouble with the computer during the interview. The independent variables came from three variable sets: nine segment characteristics, eight interviewer background and experience characteristics, and the five-factor post-training belief and attitude factors. Using the same independent variables, the Cox proportional hazards model was used to model the number of days from segment assignment to start of work in the segment.

Because there were a large number of independent variables relative to the number of observations, the following strategy was adopted for the first logistic regression model. The five post-treatment belief and attitude factors were included in all of the models. CAPI experience also was included in all four models. A stepwise logistic regression was run separately for the nine segment characteristics to select segment characteristics for inclusion in one of the two final models for each performance measure. Likewise, a stepwise logistic regression analysis was run on the eight interviewer background and experience variables to select interviewer characteristics for inclusion in one of the two final models for each performance measure. Variables significant at the 0.10 level for each of the two stepwise regression analyses along with the five attitude and belief factors (as well as CAPI experience) were included in a final model for each of the four performance measures.

The second model-building strategy was to do a single stepwise logistic regression analysis for each of the four interviewer performance measures. The variables were selected from the entire pool of 22 independent variables (i.e., the nine segment characteristics, the eight interviewer background and experience characteristics, and the five post-training factors). For all of the stepwise regressions, the 0.15 significance level was chosen for entry into the model, and 0.20 was chosen for removal from the model.

Thus, for each of the four performance measures, two models were developed. One model involved selecting the variables for inclusion in the final model (henceforth referred to as Model 1) by selecting the significant variables separately from the segment and interviewer background characteristics sets by stepwise logistic regressions and then combining the significant variables from these two stepwise regressions with the five post-training attitude and belief factorsand CAPI experience to estimate a final model. The other final model (henceforth referred to as Model 2) involved selecting the variables from a single stepwise logistic regression analysis using the single pool of all 22 variables from all three variable classes.

11.1.3 Results

For modeling the probability of completing an interview, both models indicated that the most important variables were segment characteristics. In both models, there was a highly significant negative relationship between the percentage of Hispanics in the segment and the probability of a completed interview; moreover, there was a highly significant positive relationship between the percentage of black, non-Hispanics in the segment and the probability of a completed interview. For Model 1 (see Exhibit 11.1.2), there were no significant parameters for any of the interviewer background variables or for any of the five belief and attitude factors. For Model 2 (see Exhibit 11.1.3), two interviewer variables were significant: education level and household survey experience. Education level was positively associated with the probability of a completed interview, and household survey experience was negatively associated. (The significance levels were 0.04 and 0.05, respectively.) Factor 2, reflecting disagreement with the belief that computers are watching the interviewer, and reflecting disagreement with the belief that computers may threaten interviewers' job security, was negatively associated (p=.02) with the probability of a completed interview. The direction of two of these relationships (household survey experience and Factor 2) was counterintuitive.

For modeling the probability of completing a household screening, once again both models indicated that the most important variables were segment characteristics. For Model 1 (see Exhibit 11.1.4), percentage Hispanic and median rent were negatively associated with the probability of a completed screening, while median value of owner-occupied housing and the Quarters 1 to 3 completed screening rate were positively associated with the screening rate. None of the interviewer background variables was significant. However, Factor 2 was once again significant, but in the expected direction. Interviewers who disagreed with the negative belief statements had a higher probability of completing household screenings. For Model 2 (see Exhibit 11.1.5), percentage Hispanic and median rent were negatively associated with the probability of a completed screening as in Model 1. The percentage of the segment that was urban had a positive relationship with the screening rate. In this model, interviewers who had computer-assisted interviewing experience had about two times the odds of a completed screening (p=.04) compared to those with non-CAPI experience. Factor 2 was once again significant (p=.01) in the expected direction. A one-scale point increase in Factor 2 increased the odds of a completed screening by about 44%.

Model 1 (see Exhibit 11.1.6), for modeling the probability of having trouble in setting up the computer for the CAPI, indicated that none of the 22 candidate variables was significant. The overall level of significance of the model was only 0.44. For Model 2 (see Exhibit 11.1.7), only two variables entered the model (percentage urban in the segment and interviewer's age). However, neither was significant at the 0.05 level.

For Model 1 (see Exhibit 11.1.8), in modeling the probability of having computer troubleduring the interview, only two segment characteristics were marginally significant. Percentage Hispanic had a negative parameter (p=0.07), and population density had a positive parameter (p=0.09). Factor 5 had a positive relationship (p=0.03) with the probability of having computer problems during the interview. For Model 2 (see Exhibit 11.1.9), percentage Hispanic and population density had negative and positive parameter, respectively, and were significant at the 0.09 and 0.02 levels, respectively. Age and Factor 5 had positive relationships with computer problems. (The significance levels were 0.03 and 0.01, respectively.) Older interviewers had a higher probability of experiencing computer trouble. Factor 5 was defined primarily by the belief that respondents will enjoy the interview and the belief that carrying computers around will not be tiring. It is puzzling why this factor should be positively associated with computer troubles during the interview.

The Cox proportional hazards models indicated that the time from segment assignments to starting work in the segment could not be predicted from these same independent variables.

11.1.4 Discussion

Segment characteristics were the most important predictors of the probability of a successful screening and of the probability of a completed interview. In general, the six survey experience variables were not predictive of interviewer performance. The only exception was the finding that CAPI experience was positively related to the successful screening rate. The five attitude and belief factors were marginally predictive, but in some cases the direction of the relationship was counterintuitive.

In general, there was not a lot of variation in the dependent variables. The screening rates were generally in the 0.90s, the interview rates were in the high 0.70s, and the two computer problem variables were in the 0.10s. Nevertheless, the screening and interview rates were fairly predictable from the segment characteristics. One of the reasons that the survey experience variables were not predictive of performance is that five of the six variables simply indicated whether or not they had any experience in that particular area, but did not indicate the amount of experience with respect to either years or number of surveys in each of the five areas.

Another problem was the large number of independent variables relative to the number of interviewers. With four performance measures and 22 independent variables, we would expect four or five parameters to be significant at the 0.05 level just by chance. The situation is even worse when variables are selected by stepwise procedures. This could be a reason why the relationship of the attitude and belief factors to the performance measures is in some instances counterintuitive. Because the independent variables came from different sources, missing data further reduced the sample sizes for the regression analyses.


Exhibit 11.1.1 Interviewer Experience
Area of Experience	Experience		No Experience
Listing Area Segments	58	(35%)	107	(65%)
Computer-Assisted Personal Interviewing	96	(58%)	69	(42%)
Computer-Assisted Telephone Interviewing	72	(44%)	93	(56%)
Field Reporting with a Microcomputer	33	(20%)	132	(80%)
Household Surveys-Face-to-Face Interviews	115	(70%)	50	(30%)


Exhibit 11.1.2 Probability of Completed Interviews: Model 1
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	2.061	0.304	45.979	0.000	0.000	0.000
Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	-0.012	0.004	10.732	0.001	-0.141	0.989
Black, Non-Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	0.009	0.004	6.416	0.011	0.148	1.009
Urbanized Area Indicator	1	-0.967	0.281	11.811	0.001	-0.200	0.380
Segment Median Value, Owner-Occupied HUs	1	0.000	0.000	2.535	0.111	0.090	1.000
Computer-Assisted Interview Experience	1	-0.272	0.213	1.632	0.202	-0.065	0.762
Post-Training Factor #1: Positive Toward Computers	1	0.104	0.104	0.989	0.320	0.048	1.109
Post-Training Factor #2: Disagreement with Negative Attitudes	1	-0.177	0.117	2.293	0.130	-0.085	0.838
Post-Training Factor #3: Not Afraid of Caring for Computers	1	-0.003	0.110	0.001	0.975	-0.002	0.997
Post-Training Factor #4: Lack of Concern Re Computer Operations	1	-0.030	0.108	0.076	0.782	-0.013	0.971
Post-Training Factor #5: Rs Like Interview; Easy to Do	1	-0.149	0.101	2.199	0.138	-0.072	0.861

HU=housing unit.

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.


Exhibit 11.1.3 Probability of Completed Interviews: Model 2
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	-2.883	2.096	1.892	0.169	.	.
Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	-0.016	0.004	16.375	0.000	-0.207	0.984
Black, Non-Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	0.015	0.005	11.028	0.001	0.236	1.015
Quarters 1-3 Segment Person Response Rate	1	0.009	0.006	2.002	0.157	0.075	1.009
Quarters 1-3 Segment DU Screening Rate	1	0.037	0.022	2.892	0.089	0.104	1.038
Segment Median Rent, Renter-Occupied HUs	1	-0.002	0.001	2.751	0.097	-0.174	0.998
Segment Median Value, Owner-Occupied HUs	1	0.000	0.000	8.555	0.003	0.329	1.000
Urbanized Area Indicator	1	-0.943	0.318	8.793	0.003	-0.202	0.389
Education	1	0.260	0.126	4.254	0.039	0.118	1.297
Household Survey Experience	1	-0.600	0.307	3.824	0.051	-0.112	0.549
Post-Training Factor #2: Disagreement with Negative Attitudes	1	-0.290	0.127	5.261	0.022	-0.142	0.748

DU=dwelling unit; HU=housing unit.

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.


Exhibit 11.1.4 Probability of Completed Screenings: Model 1
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	-2.607	2.498	1.089	0.297	.	.
Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	-0.016	0.006	7.493	0.006	-0.098	0.984
Quarters 1-3 Segment DU Screening Rate	1	0.073	0.026	7.795	0.005	0.116	1.076
Segment Median Rent, Renter-Occupied HUs	1	-0.004	0.001	14.307	0.000	-0.250	0.996
Segment Median Value, Owner-Occupied HUs	1	0.000	0.000	2.812	0.094	0.100	1.000
Percent Urban Population in Segment	1	0.012	0.004	8.582	0.003	0.124	1.012
Population Density	1	-0.580	0.262	4.894	0.027	-0.111	0.560
Computer-Assisted Interview Experience	1	0.352	0.280	1.584	0.208	0.047	1.422
Post-Training Factor #1: Positive Toward Computers	1	0.200	0.145	1.892	0.169	0.052	1.221
Post-Training Factor #2: Disagreement with Negative Attitudes	1	0.444	0.118	14.255	0.000	0.126	1.559
Post-Training Factor #3: Not Afraid of Caring for Computers	1	-0.097	0.153	0.403	0.526	-0.025	0.907
Post-Training Factor #4: Lack of Concern Re Computer Operations	1	-0.070	0.149	0.222	0.638	-0.019	0.932
Post-Training Factor #5: Rs Like Interview; Easy to Do	1	-0.145	0.134	1.182	0.277	-0.042	0.865

DU=housing unit.

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.


Exhibit 11.1.5 Probability of Completed Screenings: Model 2
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	2.598	0.470	30.560	0.000	.	.
Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	-0.015	0.007	5.302	0.021	-0.080	0.985
Segment Median Rent, Renter-Occupied HUs	1	-0.002	0.001	5.319	0.021	-0.093	0.998
Percent Urban Population in Segment	1	0.011	0.005	5.909	0.015	0.100	1.011
Computer-Assisted Interview Experience	1	0.737	0.346	4.522	0.034	0.083	2.089
Post-Training Factor #2: Disagreement with Negative Attitudes	1	0.363	0.124	8.606	0.003	0.090	1.437

HU=housing unit.

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.


Exhibit 11.1.6 Probability of Problems Setting Up Computer: Model 1
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	-3.517	1.145	9.439	0.002	.	.
Percent Urban Population in Segment	1	0.008	0.007	1.374	0.241	0.108	1.008
Age	1	0.014	0.018	0.640	0.424	0.063	1.014
Field Reporting Microcomputer Experience	1	-0.686	0.633	1.173	0.279	-0.095	0.504
Computer-Assisted Interviewing Experience	1	0.091	0.424	0.046	0.831	0.016	1.095
Post-Training Factor #1: Positive Toward Computers	1	-0.072	0.220	0.106	0.745	-0.024	0.931
Post-Training Factor #2: Disagreement with Negative Attitudes	1	0.012	0.180	0.004	0.948	0.004	1.012
Post-Training Factor #3: Not Afraid of Caring for Computers	1	-0.075	0.205	0.134	0.714	-0.025	0.928
Post-Training Factor #4: Lack of Concern Re Computer Operations	1	-0.216	0.197	1.204	0.273	-0.071	0.806
Post-Training Factor #5: Rs Like Interview; Easy to Do	1	-0.052	0.193	0.071	0.790	-0.019	0.950

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.


Exhibit 11.1.7 Probability of Problems Setting Up Computer: Model 2
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	-4.442	1.077	17.007	0.000	.	.
Percent Urban Population in Segment	1	0.013	0.008	3.118	0.077	0.182	1.013
Age	1	0.023	0.016	2.063	0.151	0.100	1.023

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment


Exhibit 11.1.8 Probability of Computer Problems During Interview: Model 1
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	-2.901	0.400	52.523	0.000	.	.
Population Density	1	0.319	0.175	3.331	0.068	0.125	1.376
Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	-0.011	0.007	2.976	0.085	-0.164	0.989
Computer-Assisted Interviewing Experience	1	-0.114	0.280	0.165	0.684	-0.031	0.892
Post-Training Factor #1: Positive Toward Computers	1	0.061	0.152	0.158	0.691	0.032	1.062
Post-Training Factor #2: Disagreement with Negative Attitudes	1	0.102	0.159	0.406	0.524	0.054	1.107
Post-Training Factor #3: Not Afraid of Caring for Computers	1	-0.143	0.143	0.993	0.319	-0.074	0.867
Post-Training Factor #4: Lack of Concern Re Computer Operations	1	0.033	0.140	0.056	0.813	0.017	1.034
Post-Training Factor #5: Rs Like Interview; Easy to Do	1	0.291	0.136	4.559	0.033	0.159	1.337

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.


Exhibit 11.1.9 Probability of Computer Problems During Interview: Model 2
Variable	DF	Parameter Estimate	Standard Error	Wald Chi-Square	PR > Chi-Square	Standardized Estimates	Odds Ratio
Intercept	1	-4.773	0.866	30.399	0.000	.	.
Hispanic % of Civilian, Noninstitutionalized Population >12 years	1	-0.013	0.008	2.793	0.095	-0.197	0.987
Population Density	1	0.404	0.178	5.150	0.023	0.165	1.498
Age	1	0.031	0.014	4.581	0.032	0.210	1.031
Post-Training Factor #1: Positive Toward Computers	1	0.340	0.186	3.348	0.067	0.182	1.405
Post-Training Factor #5: Rs Like Interview; Easy to Do	1	0.377	0.142	7.086	0.008	0.217	1.458

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.

11.2 Effect of Mode of Interview on Interviewer Variance

Interviewer variance or correlated response variance refer to an increase in the variance of estimates due to correlation of response errors within interviewers. It has often been assumed that this correlation arises because the "interviewers influence respondents to make response errors of a similar type" (Groves, 1991, p. 16). Well-developed self-administered questionnaires (SAQs) should be subject to less influence by the interviewers. This is because the interviewer would have less interaction with the respondent during the question-answering process and, thereby, less chance to influence his or her response errors. Thus, as we become more successful in developing questionnaires that respondents can complete without help from the interviewer, we should expect interviewer variance to decrease. ACASI does allow the respondents to operate more independently than the paper answer sheets. Other chapters of this report document the fact that ACASI increases reporting of sensitive behaviors and that respondents find it to be a more private response mode. We also examined how it affected interviewer variance.

Ideally, to assess the impact of mode of interview on correlated measurement errors, we would use a design in which subsamples of the sample segments were randomly assigned to interviewers and, within interviewers, randomly assigned to an interviewing mode (either ACASI or PAPI). The 1997 field experiment did not meet these requirements; however, because the sample segments within primary sampling units (PSUs) were randomly assigned to one of four quarters of data collection and because we were using segments from Quarters 1 through 3 for the 1997 field experiment, the assumption that segments were randomly assigned to mode of interview within PSUs is valid. Segments were not randomly assigned to interviewers.

For evaluating the impact of the interviewing method on the interviewer variance component, we fit a general linear mixed model with fixed and random effects of the form

where Y is an outcome variable, represents fixed effects with respect to explanatory variables X , represents random effects with coefficients Z, and represents random error in each individual observation. For fixed effects, we used the following independent variables: (a) stratum, (b) gender, (c) race, and (d) age category. For the random effects, we considered the following components: (a) PSU and (b) field interviewer ID. We initially tried fitting a single model, which included mode of interview. Because of the fact that overall people gave different responses under ACASI and PAPI interviews, however, the individual or residual variance components for the same response variable were different for the two interview types. Therefore, the model was applied separately for ACASI and PAPI data. We examined several outcome variables using the SAS procedure MIXED. The results are displayed in Exhibit 11.2.1. For the mental health-related outcomes, individuals under the age of 17 years were dropped and the educationvariable was added to the model.

It is not possible to compare the interviewer components directly because the individual and PSU components are different for the two interview types. Thus, for each method, we computed the intraclass correlation to estimate the relative effect of interviewers on variance. This represents the proportion of the population variance that is accounted for by interviewers.

Of the 15 variables examined, for 14 of the variables correlation for PAPI was greater than the correlation for ACASI. For only one of the variables, percentage who did not gain or lose weight when depressed, the PAPI correlation was slightly smaller than ACASI, but both values were very close (0.09869 and 0.09207).

The results indicate that the ACASI method reduced the influence of interviewers on the variance. This is another positive impact of the ACASI methodology.

Because of the difficulties associated with this analysis and the fact that the design was not as well suited as it might be to investigate this issue, we recommend that this analysis be repeated with 1999 NHSDA split-sample data.


Exhibit 11.2.1 Variance Components and Interviewer Intraclass Correlation for the ACASI and PAPI Interviews for Selected Variables
	Variance Components
	Individual	PSU	Interviewer	Rho

ACASI	2247.89	29.293	1.089	0
PAPI	2021.09	6.333	55.252	0.02653
Ever used alcohol
ACASI	2048.74	0.877	27.861	0.01341
PAPI	1587.99	15.024	94.959	0.05592
Ever used marijuana or hashish
ACASI	1764.13	11.639	0	0
PAPI	1863.07	8.530	42.258	0.02208
Ever used hallucinogens
ACASI	808.159	6.061	6.799	0.0083
PAPI	676.95	7.265	6.559	0.0095
Ever used inhalants
ACASI	1020.15	0	15.415	0.01489
PAPI	484.96	3.228	17.242	0.03411
Ever felt sad or depressed for more than 2 weeks
ACASI	1967.36	2.963	0.0	0
PAPI	1241.28	11.222	38.832	0.03007
Did not gain or lose weight when depressed
ACASI	1896.63	0	207.677	0.09869
PAPI	1994.01	0	202.195	0.09207
Felt anxious for more than a month
ACASI	1554.02	0	18.871	0.012
PAPI	910.86	0	40.743	0.04282
Number of days smoked in last 30 days
ACASI	133.723	0	4.525	0.03273
PAPI	121.621	0.796	12.385	0.09188
Number of days had a drink in last 30 days
ACASI	36.218	0.135	0.612	0.01656
PAPI	43.284	0	1.876	0.04154
Number of days used marijuana in last 30 days
ACASI	89.921	0	0	0
PAPI	87.313	0	0.688	0.0078
Number of days used hallucinogens in last 30 days
ACASI	0.455	6.469	0	0
PAPI	11.281	0	0	0
Number of days used inhalants in last 30 days
ACASI	23.568	0	0	0
PAPI	2.389	11.015	14.729	0.523549

ACASI=audio computer-assisted self-interviewing; PAPI=paper-and-pencil interviewing; PSU=primary sampling unit.

Source: National Household Survey on Drug Abuse: Development of Computer-Assisted Interviewing Procedures; 1997 Field Experiment.

This page was last updated on June 16, 2008.

SAMHSA, an agency in the Department of Health and Human Services, is the Federal Government's lead agency for improving the quality and availability of substance abuse prevention, addiction treatment, and mental health services in the United States.

* Adobe™ PDF and MS Office™ formatted files require software viewer programs to properly read them. Click here to download these FREE programs now