Seasonal Adjustment in the Current Employment Statistics ProgramIntroduction Many Current Employment Statistics (CES) data users are interested in the seasonally adjusted over-the-month changes as a primary measure of overall national economic trends. Therefore, accurate seasonal adjustment is an important component in the usefulness of these monthly data. The CES Program employs concurrent seasonal adjustment methodology to seasonally adjust it’s National estimates of employment, hours, and earnings. Under concurrent methodology, new seasonal factors are calculated each month, using all relevant data up to and including the current month. This documentation discusses in detail the seasonal adjustment methodology and software employed by the Current Employment Statistics Program. It is important to note that this describes seasonal adjustment only as it relates to the CES Program’s implementation. There are other aspects of seasonal adjustment that are not discussed here. Getting Started with X-12 ARIMA The CES Program uses X-12 ARIMA software developed by the U.S. Census Bureau to seasonally adjust the monthly estimates. The X-12 ARIMA software is available on the U.S. Census Bureau web site at http://www.census.gov/srd/www/x12a/. The site contains the following information:
This documentation describes how the CES Program employs X-12 ARIMA for seasonal adjustment purposes. Specifically, it describes the input files used in the CES Program’s implementation, and commands used to invoke the software. This is not a substitute for formal X-12 ARIMA training. There are other uses and features of X-12 ARIMA that are not discussed in this documentation. The U.S. Census Bureau offers more intensive training for X-12 ARIMA and seasonal adjustment. Contact the Census Bureau or visit their website at http://www.census.gov for more details. Current Employment Statistics Seasonal Adjustment ProceduresFor published series, the CES Program generally seasonally adjusts at the 3-digit NAICS level and aggregates to the higher levels. In a few cases, the CES Program seasonally adjusts published series at a level lower than the 3-digit NAICS level. However, these lower level cells are not used when aggregating to the higher levels -- for published series, aggregation begins at the 3-digit NAICS level. The CES Program’s current implementation of seasonal adjustment controls for several calendar effects:
More information about the calendar-related fluctuations in CES data is available on the BLS website at http://www.bls.gov/ces/cesfltxt.htm . All controllable variables remain fixed during the year. For example, the ARIMA model, outliers, transformation specification, and historical data are held constant, and the same calendar treatments are used throughout the year. Once a year, as part of the annual CES benchmark procedure, all seasonal adjustment specifications are reviewed for each series. Any changes are implemented and kept constant until the next annual benchmark. Also during the annual benchmark, estimates for the five most recent years are re-seasonally adjusted using the new specifications. After 5 years of revisions, seasonally adjusted data are frozen. The CES Program uses the following input files when seasonally adjusting estimates:
More details follow on each input. Specification File An input specification file, or a "spec" file, is a text file used to specify program operations. The spec file is composed of functional units called specifications (or "specs"). Each spec unit comprising the spec file controls the options for a specific function. There are 15 different specs that can be used in a spec file; however, the CES Program’s implementation typically employs only 8 specs. These specs are:
Each spec used by the CES Program when seasonally adjusting estimates is covered in greater detail at the end of this document in Appendix A, entitled "Anatomy of a Spec File". In the CES Program’s implementation, each 3-digit employment series has its own spec file ending in a ".spc" file extension. The ".spc" extension is not recognizable by all operating systems and usually needs to be opened with a text editor such as TextPad, Wordpad, or Notepad. Also, it is important to remember that when running X-12 ARIMA in DOS, the name of the spec file must be 8 characters or less. This is a limitation of DOS, not X-12 ARIMA. All of the spec files currently used in production can be downloaded from http://www.bls.gov/ces/cesseasadj.htm. Input Data File The input data file consists of non-seasonally adjusted CES estimates at the 3-digit NAICS level and is referred to in the SERIES spec of the spec file. The CES implementation reads input data from a text file in "free format" style. In the free-format style, data are delimited with either tabs or spaces, and only the input data is included -- dates and other descriptive information are excluded. Instead, information describing the data is specified in the SERIES spec using the START and PERIOD arguments. The full path and name of the input data file is specified using the FILE argument (see Figure 1).
Figure 1. Input Data File Specifications
CES data can be extracted from the BLS website at http://www.bls.gov/ces/home.htm . However, in some cases, data extracted from the BLS website will differ from what the CES Program actually uses in seasonal adjustment. In particular, data extracted from the BLS website will reflect any strikes or other prior adjustments that have taken place. Before running seasonal adjustment, the CES Program will reverse these effects so that they will not be considered when calculating the seasonal factors. Also, the CES Program uses unrounded data when running seasonal adjustment -- data on the BLS website are rounded.
Prior Adjustment File As mentioned in the previous section, in some cases the CES Program will modify the non-seasonally adjusted estimates (input data) before running X-12 ARIMA. This is done to ensure that non-seasonal events such as strikes are not included in the calculation of the seasonal factors. Once the seasonal factors are calculated, they are applied to the original data -- i.e., the non-seasonally adjusted data that reflects the non-seasonal event -- to calculate the seasonally adjusted estimate. To read more about the impact of strikes on CES data, visit the BLS website at http://www.bls.gov/ces/cesstrk.htm. The latest prior adjustment file used in the seasonal adjustment of CES data can be downloaded from http://www.bls.gov/ces/cesseasadj.htm. The first column contains the 14-digit BLS NAICS tabcode. The second column contains the year, and the next 12 columns represent the months of the year in sequential order (January through December). The file contains both positive and negative numbers. The positive numbers reflect a strike and are added to the non-seasonally adjusted data before running X-12 ARIMA. The negative numbers reflect the buildup of employment associated with the decennial census and are subtracted from the non-seasonally adjusted data before calculating the seasonal factors. Figure 2. Prior Adjustment File Format
User-Defined Regression Variable File As mentioned earlier, the CES Program’s current implementation of seasonal adjustment controls for several non-economic calendar related fluctuations in the estimates. This is done with the inclusion of user-defined regression (or "dummy") variables. The dummy variables are defined in the REGRESSION spec of the spec file. The dummy files vary depending upon the type of calendar event being treated. Table 1 lists the dummy files used and the calendar event(s) they are used to treat. Table 1. Dummy Files with Calendar Treatment
The dummy values are usually 1 and 0, with weights assigned so that the effect over a 10 year period sums to zero. The latest user-defined regression files used in the seasonal adjustment of CES data can be downloaded from http://www.bls.gov/ces/cesseasadj.htm. Metafile The metafile is a text file ending in a ".mta" file extension and is used when running X-12 ARIMA on more than one series. It is essentially a list of the complete path and filename -- without the extension -- of all of the input specification (spec) files. Only one spec file is listed per row. As with the individual spec files, it is important to remember that when running X-12 ARIMA in DOS, the name of the metafile must be 8 characters or less. Running X-12 ARIMA Running X-12 on a Single Series Use the following command at the DOS prompt when running X-12 ARIMA on a single series:
{path1\}x12a {path2\}specfile name -options
Example: At the DOS prompt, type: c:\x12s\x12a c:\x12\nov02\AE113310 -w (where AE113310.spc is the series you want to adjust) Running X-12 on a Multiple Series Use the following command at the DOS prompt when running X-12 ARIMA on more than one series: {path1\}x12a -m {path2\}metafile name -options
Example: At the DOS prompt, type: c:\x12s\x12a -m c:\x12\nov02\pubAE -w (where pubAE.mta is the metafile you are using) Output from X-12 ARIMA When X-12 ARIMA is run, several output files are generated by default. The output files are saved in the same location as the input specification files.
More details follow on each of the output files. Main Output File (*.out) The X-12 ARIMA output is written to a text file ending in a ".out" extension. Output from the CES implementation contains many different tables and statistics, including:
Individual specs in the spec file control their contribution to this output using optional PRINT arguments. For example, within the X11 spec, BRIEF specifies that only certain tables or plots are printed, while the minus sign in front of a name (such as -SPECSA or -SPECIRR) means that particular table or plot should be suppressed from the output. In this example, without the options -SPECSA and -SPECIRR, both of the plots would be printed by default under the BRIEF option. Figure 3. The PRINT Argument in the X11 Spec
It is important to remember that every time X-12 ARIMA is run on a particular series, the *.out file is overwritten, unless an alternate name or directory is specified. Error Output File (*.err) Input errors are written to a text file ending in an ".err" extension. If the error is fatal, ERROR: will be displayed before the error message. If the error is not fatal, WARNING: will be printed before the message. Non-fatal errors (or warnings) will not stop the program, but should be an alert to use caution and to check input and output carefully. It is important to remember that, as is the case with all output files, every time X-12 ARIMA is run on a particular series, the *.err file is overwritten, unless an alternate name or directory is specified. Log Output File (*.log) A summary of modeling and seasonal adjustment diagnostics are written to a text file ending in a ".log" extension. Individual specs in the specification file control their contribution to this output using optional SAVELOG arguments. When X-12 ARIMA is run on an individual spec file, the log file is stored with the same name and directory as the spec file. However, when X-12 is run using a metafile, the log file is stored with the same name and directory as the metafile. As is with all output files, every time X-12 ARIMA is run, the *.log file is overwritten unless an alternate name or directory is specified. Other Output Files Other output files are generated as specified in the spec file using the SAVE argument. In the CES Program’s implementation, the following additional output files are generated:
Indirect Seasonal Adjustment The CES Program generally seasonally adjusts published series directly at the 3-digit NAICS level and aggregates to the higher levels. However, there are some exceptions to this rule. In a few of the All Employees series, the CES Program will seasonally adjust at a level lower than the 3-digit NAICS level. In these instances, the CES Program seasonally adjust the 3-digit series indirectly; i.e., all of the component (lower level) series are seasonally adjusted directly and aggregated up to the composite (3-digit) level. Indirect seasonal adjustment is performed on these series because some of the individual component series that aggregate to the composite series exhibit different seasonal patterns that may be masked if seasonally adjusted directly at the aggregate level. The spec file for the composite series differs somewhat from normal CES implementation. The most significant difference is at the beginning of the spec file, where the SERIES spec is replaced with the COMPOSITE spec. Running X-12 employing the COMPOSITE spec produces an indirect seasonal adjustment of the composite series as well as a direct adjustment. Output from the indirect adjustment is saved under non-standard file extensions.
The COMPOSITE spec is covered in greater detail at the end of this document in Appendix A. Spec files for the component series are constructed the same way as a standard X-12 ARIMA run (because they are only adjusted directly, as is standard practice). Below is a current list of industries that are indirectly seasonally adjusted, along with their component series. Keep in mind that, for any given series, not all of the component series are published. In the table below, component series published with the first preliminary data release are denoted with an asterisk (*). Table 2. CES Series Indirectly Seasonally Adjusted
Appendix A: An Overview - Anatomy of a Spec FileInput File used for Seasonal Adjustment of Current Employment Statistics Data For published series, the Current Employment Statistics (CES) Program generally seasonally adjusts at the 3-digit NAICS level and aggregates to the higher levels. One of the main inputs to the seasonal adjustment process is a unique file called a specification (spec) file. The spec file contains a set of specifications or specs that give X-12 ARIMA various information about the data and the desired seasonal adjustment options and output. Each spec inside the spec file controls options for a specific function. For example, the SERIES spec contains specifications on the location and format of the data, while the X11 spec sets seasonal adjustment options such as seasonal adjustment transformation mode, output files to save, and diagnostic statistics to print. More detail on each spec used by CES follows. 1. SERIES spec SERIES{ TITLE = "Logging" START = 1993.01 PERIOD = 12 SAVE = A1 PRINT = BRIEF NAME = '10113310 - AE' FILE = 'c:\AE10113310.dat'} The main function of the SERIES spec is to specify details about the input data series such as the name, format, and location of the data. The Current Employment Statistics (CES) implementation employs 7 options or arguments with the SERIES spec.
2. TRANSFORM spec TRANSFORM{FUNCTION = LOG} The main function of the TRANSFORM spec is to transform or adjust the time series prior to estimating a regARIMA model. The CES implementation employs 1 argument with the TRANSFORM spec.
3. REGRESSION spec REGRESSION{ VARIABLES = (AO1995.02 AO1996.01 AO1999.01) USER = (dum1 dum2 dum3 dum4 dum5 dum6 dum7 dum8 dum9 dum10 dum11) START = 1986.01 FILE = 'c:\FDUM8606.dat' USERTYPE = TD SAVE = (TD AO) } The main function of the REGRESSION spec is to specify the regression components of a regARIMA model. The CES implementation employs up to 6 options with the REGRESSION spec.
Note: Not every option is used in every spec file. For example, if no predetermined outliers exist, then the VARIABLES argument will not be used. Likewise, if we are not treating a particular series for calendar effects, then the USER, START, FILE, and USERTYPE arguments will not be used. 4. ARIMA spec ARIMA{MODEL = (2 1 0) (0 1 1)} The main function of the ARIMA spec is to specify the ARIMA part of a regARIMA model. The CES implementation employs 1 option with the ARIMA spec.
5. ESTIMATE spec ESTIMATE{MAXITER = 1000} The main function of the ESTIMATE spec is to estimate the regARIMA model specified by the REGRESSION and ARIMA specs. The CES implementation employs 1 argument with the ESTIMATE spec.
6. FORECAST spec FORECAST{MAXLEAD = 24} The main function of the FORECAST spec is to generate forecasts (and/or backcasts) for the time series model given in the SERIES spec using the estimated regARIMA model. The CES implementation employs 1 argument with the FORECAST spec.
7. OUTLIER spec OUTLIER{ CRITICAL = 3.5 TYPES = AO } The main function of the OUTLIER spec is to perform automatic detection of point outliers, temporary change outliers, level shifts, or any combination of the three. The CES implementation uses this spec to automatically detect point outliers only. CES employs 2 arguments with the OUTLIER spec.
8. X11 spec X11{ MODE = MULT PRINT = (BRIEF -SPECSA -SPECIRR) SAVE = (D10 D11 D16) APPENDFCST = YES FINAL = USER SAVELOG = (Q Q2 M7 FB1 FD8 MSF) } The function of the X11 spec is to control certain aspects of the seasonal adjustment process. For example, the CES implementation uses the X11 spec to control the type of seasonal adjustment decomposition calculated (mode). CES employs 6 arguments with the X11 spec.
As previously mentioned, the CES Program generally seasonally adjusts published series at the 3-digit NAICS level and aggregates to the higher levels. However, there are a few cases in which CES seasonally adjusts published series at a level lower than the 3-digit NAICS level. In these instances, CES seasonally adjusts the 3-digit NAICS level indirectly; i.e., all of the component or lower level series are seasonally adjusted directly aggregated up to the 3-digit level. When this happens, the SERIES spec is replaced by the COMPOSITE spec in the specification file of the 3-digit series. 9. COMPOSITE spec COMPOSITE{ TITLE = "Construction of buildings" SAVE = (ISF ISA CMS) PRINT = BRIEF NAME = '20236000 - AE' SAVELOG = (INDTEST INDQ) } The COMPOSITE spec is used as part of the procedure for obtaining both indirect and direct adjustments of a composite series data series. This spec is required for obtaining composite adjustments and is used in place of the SERIES spec. The COMPOSITE spec can also specify details about the input data series such as the name of the series and which tables are to be printed or stored. The Current Employment Statistics (CES) implementation employs 5 options or arguments with the COMPOSITE spec.
Last Modified Date: February 6, 2009 |
Tools |
Calculators |
Help |
Info |