Data revision history

State Summary/State Characteristics Longitudinal Data File (PUSUM)

Robert E. Molyneux


Abstract

This page documents a longitudinal set of state summary public library data constructed from the annual Public Library Survey of the National Center for Education Statistics (NCES) under the Federal State Cooperative System (FSCS) program. From this survey, NCES publishes the Public Library Data File, State Summary/State Characteristics Data File, the Public Library Outlet Date File, and the E.D Tabs reports entitled Public Libraries in the United States for the various years. This compilation is of the State Summary/State Characteristics Data File, that is Public Use Summary (hence, I conclude, PUSUM which convention NCES names files in this series). The dataset contains 729 observations (an observation being summary data from the libraries in one state for one year), with 61 variables per observation, although not all variables were collected all years. These data are from the states, the District of Columbia, and occasionally territories.

The summary data were originally compiled and published for the years Fiscal Year (FY) 1987-FY 2005 although for reasons discussed shortly, the set of data discussed here only has the data from FY 1992-FY 2005 or 14 years. The data are available in formats discussed at the bottom of the page under Distribution and Formats.

The state summary data are fundamentally different from the Public Library Data File which is a compilation of data from almost 10,000 public libraries collected from FY 1987-FY 2005. These summary data are the result of analyzing aspects of those data and other data. Given that not all libraries respond to all survey items, NCES has imputed those missing data. Documentation is carried in the various sources cited in the list of NCES Public Library Survey publications elsewhere on this site. This summary dataset ("PUSUM" in NCES parlance) contains imputed data given that it is attempting to compensate for the missing values for purposes of arriving at national assessments of the state of public libraries. Knowing this, could you replicate these summary data using the Public Library Data File either in its unimputed or imputed forms? Probably not. The documentation on how the imputations were done is sketchy but improving. NCES also has access to data suppressed for individual libraries. Data are suppressed in smaller libaries, for instance, salary information when the staff is sufficiently small to cause a privacy concern. Of course, you could do your own imputations. More telling, though, is the fact that these summary files are also constructed using the Restricted Use data, in part. These data are not available for public use. The documentation for several years (here 2005) specifically warns:

Comparisons to data prior to FY 92 should be made with caution, as earlier data files do not include imputations for nonresponse, and the percentage of libraries responding to a given item varied widely among states. (p. 8)

According to the documentation from earlier years, there are supposed to be summary data from FY 1987-FY 1991. 1991 is made up of a summary from the reporting libraries so that comparing FY 1991 with FY 1992 is likely to show discontinuities that are an artifact of the imputations not from any change in the libraries. For FY 1987-FY 1989, in spite of the documentation's saying there are summary data, they are sketchy. FY 1987 has a few variables reported for the 19 states' summaries reported. Summary data for FY 1988 were not with the rest of the FY 1988 data and I don't know where they are. FY 1989 does have summary data in a separate file as does FY 1990. All imputations could be taken out of the post-1992 data but experimenting with that idea resulted in derivative datasets with very few values. So, we get some data imputed by methods that are not clear or we get vanishly few data.

Variables

The tables below list the variables that were dealt with in constructing PUSUM. The intent was to group all like variables in the various years in one variable name. Rarely was there any ambiguity in reading the documentation from year to year.

Two variables have been added to each year's data in order to ease processing: state and year. (NCES variables are in upper case while any non-NCES data are in italics. year was hard coded in one of the programs and is the nominal year of collection and the same as the fiscal year that appears on publications. The NCES library data are issued as annual publications without explicit acknowledgement in the data of previous years or years that might follow. State is the two-digit postal code. This variable was made from various variables in the dataset as noted below. All NCLIS datasets use the same conventions.

A spreadsheet (Open Office, xls) presents these variables in another format.

The following table has details on the variables in PUSUM. The convention followed is to use the 2005 variable name except where the data are no longer collected and the short definitions and notes come, largely, from the documentation for FY 2005. The wording differs on occasion for variables documented in the earlier years but the definitions overall are pretty consistent. Imputation flags listed in the documentation are not included in this list nor in PUSUM.


Identification Variables
Variable Short definition Years History Notes
state Two-letter State Code ALL Created from STABR Used for processing.
year Year of data collection ALL Inserted by code Used for processing.
STABR Two-letter Federal Information Processing Standards (FIPS) State Code ALL STABR (1992-) See PUB_FIPS, below.

Population Variables
Variable Short definition Years History Notes
POPU_LSA Population of the Legal Service Area ALL POPU (1992-97)
POPU_LSA (1998-)
 
POPU_UND Unduplicated population of the legal service area for the library (constructed variable). ALL PERIOD_POP (1992-1997)
POPU_UND (1998-)
 
POPU_ST Official state population estimate ALL PERIOD_EST (1992-1997)
POPU_ST (1998-)
 

Service Outlets
Variable Short definition Years History Notes
CENTLIB Number of central libraries ALL CENTLIB (1992-)  
BRANLIB Number of branch libraries ALL BRANLIB (1992-)  
BKMOB Number of bookmobiles ALL BKMOB (1992-)  

Full-Time Equivalent (FTE) Paid Staff
Variable Short definition Years History Notes
MASTER ALA-MLS Librarians ALL MASTER (1992-)  
LIBRARIA Total number of FTE employees holding the title of librarian. ALL LIBRARIAN (1992-97)
LIBRARIA (1998-)
 
OTHPAID All other paid FTE employees. ALL OTHPAID (1992-)  
TOTSTAFF Total paid FTE employees ALL TOTEMP (1992-97)
TOTSTAFF (1998-)
 

Operating Income
Variable Short definition Years History Notes
LOCGVT Operating income from local government ALL LOCGVT (1992-)  
STGVT Operating income from state government ALL STGVT (1992-)  
FEDGVT Operating income from federal government ALL FEDGVT (1992-)  
OTHINCM Other operating income (i.e., income not included in LOCGVT, STGVT, and FEDGVT) ALL OTHINCM (1992-)  
TOTINCM Total income (Sum of LOCGVT, STGVT, FEDGVT, and OTHINCM) ALL TOTINCM (1992-)  

Operating Expenditures
Variable Short definition Years History Notes
SALARIES Salaries and wages for all library staff ALL SALARIES (1992-)  
BENEFIT Employee benefits for all library staff ALL BENEFIT (1992-)  
STAFFEXP Total staff expenditures (SALARIES plus BENEFIT) ALL TOTEXP (1992-97)
STAFFEXP (1998-)
 
PRMATEXP Operating expenditures for print materials (including books, serial back files, current serial subscriptions, government documents) 2003- PRMATEXP (2003-)  
ELMATEXP Operating expenditures for library materials in electronic format 1995- ELMATEXP (1995-)  
OTHMATEX Operating expenditures for all other library materials (microform, audio, video, DVD, and new formats) 2003- OTHMATEX (2003-)  
TOTEXPCO Total expenditures on library collections. ALL TOTEXPCOL (1992-97)
TOTEXPCO (1998-)
 
OTHOPEXP Other operating expenditures (i.e., expenditures not included in STAFFEXP and TOTEXPCO) ALL OTHOPEXP (1992-)  
TOTOPEXP Total operating expenditures (Sum of STAFFEXP, TOTEXPCO, and OTHOPEXP) ALL TOTOEXP1 (1992-97)
TOTOPEXP (1998-)
 

Capital Expenditures
Variable Short definition Years History Notes
CAPITAL Expenditures for capital outlay ALL CAPITAL (1992-)  

Capital Revenue
Variable Short definition Years History Notes
CAP_REV Capital revenue for major capital expenditures 2003- CAP_REV (2003-)  
FCAP_REV Federal government capital revenue 2005- FCAP_REV (2005-)  
LCAP_REV Local government capital revenue 2005- LCAP_REV (2005-)  
OCAP_REV Other capital revenue 2005- OCAP_REV (2005-)  
SCAP_REV State government capital revenue 2005- SCAP_REV (2005-)  

Library Collections
Variable Short definition Years History Notes
BKVOL Number of books and serial volumes ALL BKVOL (1992-)  
EBOOK Electronic books (E-books) (digital documents, including non-serial government documents in digital format) 2003- EBOOK (2003-)  
AUDIO Number of audio materials ALL AUDIO (1992-)  
VIDEO Number of video materials ALL VIDEO (1992-)  
DATABASE Databases (locally mounted or remote, full-text or not, for which temporary or permanent access rights have been acquired) 2003- DATABASE (2003-)  
SUBSCRIP Number of current serial subscriptions ALL SUBSCRIPT (1992-97)
SUBSCRIP (1998-)
 
ESUBSCRP Current electronic serial subscriptions 2003- ESUBSCRP (2003-)  

Public Service Hours
Variable Short definition Years History Notes
HRS_OPEN Total annual public service hours for all service outlets ALL DUPLI (1992-97) HRS_OPEN (1998-)  

Library Services
Variable Short definition Years History Notes
VISITS Total annual library visits ALL ATTEND (1992-97)
VISITS (1998-)
ATTEND was defined as "Annual attendance in library" which, I understand, is regarded as the same as VISITS today.
REFERENC Total annual reference transactions ALL REFERENCE (1992-1997)
REFERENC (1998-)
 

Circulation
Variable Short definition Years History Notes
TOTCIR Total annual circulation transactions ALL TOTCIR (1992-)  

Inter-Library Loans
Variable Short definition Years History Notes
LOANTO Total annual loans provided to other libraries ALL LOANTO (1992-)  
LOANFM Total annual loans received from other libraries ALL LOANFM (1992-)  

Children's Services
Variable Short definition Years History Notes
KIDCIRCL Total annual circulation (including renewals) of all children's materials in all formats to all users ALL KIDCIRCL (1992-)  
KIDATTEN Total annual attendance at all programs intended primarily for children. Includes adults who attend programs intended primarily for children. ALL KIDATTEN (1992-)  

Electronic Technology
Variable Short definition Years History Notes
ELACCEXP Operating expenditures for electronic access 1995-2002 ELACCEXP (1995-2002)  
ELMATS Number of library materials in electronic format 1995-2002 ELMATS (1995-2002)  
STFTERMS Internet terminals used by staff only 1998-2002 STFTERMS (1998-2002)  
GPTERMS Internet terminals used by general public 1998- GPTERMS (1998-)  
ERES_USR Users of electronic resources per typical week 1999- ERES_USR (1999-)  

Other
Variable Short definition Years History Notes
STARTDAT Reporting period start date ALL PERIOD_PSM (1992-1997)
STARTDAT (1998-)
Month/year format, for example: "07/2003"
ENDDAT Reporting period end date ALL PERIOD_PEM (1992-1997)
ENDDAT (1998-)
Month/year format, for example: "06/2003"
PUB_FIPS Two-digit FIPS State Code. ALL PUB_FIPS (1992-) This is a numerical code (Alabama = 01). STABR uses a character state code (Alabama = AL).
YR_SUB FSCS submission year of public library data in four-digit format (YYYY) ALL YR (1992-97) + 1900
YR_SUB (1998-)
YR is a two-digit year format and YR_SUB is four digits.
OBEREG Bureau of Economic Analysis Code ALL OBEREG (1992-) 00-U.S. Service Schools
01-New England (CT ME MA NH RI VT)
02-Mid East (DE DC MD NJ NY PA)
03-Great Lakes (IL IN MI OH WI)
04-Plains (IA KS MN MO NE ND SD)
05-Southeast (AL AR FL GA KY LA MS NC SC TN VA WV)
06-Southwest (AZ NM OK TX)
07-Rocky Mountains (CO ID MT UT WY)
08-Far West (AK CA HI NV OR WA)
09-Outlying Areas (AS FM GU MH MP PR PW VI)

Distribution and Formats

Available formats
Format Filename Size
SAS pusum.sas7bdat 353KB
CSV pusum.csv 240KB
Open Office pusum.ods 262KB
Excel pusum.xls 378KB
ASCII pusum.asc 469KB
ASCII file format format.asc 3KB

Valid XHTML 1.0!


March 7, 2008
Back to NCES index
NCLIS 30th Anniversary logo Return to NCLIS Homepage