Posted: September 30, 2010
PDF of this report (16pp.)
Social Security celebrated its 75th birthday on August 14, 2010. Over the last 75 years, we have collected data to carry out our mission. Our data are about people-their wages, their identifying information, their employers, their addresses, and much more. The first regulation we published included a commitment to the public to safeguard the personal information entrusted to us. This commitment is as solid as it was 75 years ago and is further strengthened by privacy laws. We cannot publicly release much of our data because it is protected by the Privacy Act, the Internal Revenue code, the HIPAA, and other statutes. While some of the data can be anonymized, much of it cannot. Our inventory planning recognizes these constraints and all releases will protect privacy in accordance with all applicable laws.
In the first days of his administration, President Obama issued a memorandum for the heads of executive departments and agencies announcing a commitment to transparency and open government. In this document, the President instructed the Director of the Office of Management and Budget (OMB) to issue instructions for implementing the open government principles. The Open Government Directive (M-10-06) released on December 8, 2009 provided specific requirements agencies and departments must follow in order to fully adopt the open government principles. It also mandated the development and publication of an Open Government Strategic Plan.
Under the transparency principle, OMB directed agencies to explain in detail in their Open Government Plan how they would improve transparency. Specifically, the Directive said, "To increase accountability, promote informed participation by the public, and create economic opportunity, each agency shall take prompt steps to expand access to information by making it available online in open formats." We recognize that foundational steps to becoming more transparent and increasing accountability is to inventory agency high value information currently available for download and to identify high value information not yet available and establish a reasonable timeline for publication online in open formats [1].
We have completed a number of actions to increase transparency by making information available at Socialsecurity.gov and Data.gov.
We posted 22 datasets on Data.gov. All datasets relate to our core mission and align with our strategic goals and performance measures. The following information in Charts 1 and 2 shows the datasets already posted as well as those scheduled for release by the end of FY 2011.
Chart 1 -- Social Security Datasets on Data.gov |
|||||
Datasets |
How Dataset Meets High-Value Criteria |
||||
|
Increase agency accountability and responsiveness |
Improve public knowledge of agency and its operations |
Further core mission of agency |
Create economic opportunity |
Respond to identified public need and demand |
Hearing Process
|
♦ |
♦ |
♦ |
|
♦ |
Disability Decisions
|
♦ |
♦ |
♦ |
|
♦ |
2008 Freedom of Information Act (FOIA) Annual Report |
♦ |
|
|
|
♦ |
National Beneficiary Survey
|
♦ |
♦ |
♦ |
|
♦ |
Average Monthly Payments for Supplemental Security Income, by State or Other Area, Eligibility Category, and Age, December 2008 |
|
|
|
|
♦ |
Supplemental Security Income Payments by Type of Payment, Sex, Eligibility Category, and Age, December 2008 |
|
|
|
|
♦ |
Supplemental Security Income Payments, Recipients by State and Other Area, Eligibility Category, and Age, December 2008 |
|
|
|
|
♦ |
Average Monthly Social Security Benefits of Disabled Beneficiaries and Nondisabled Dependents by Basis of Entitlement, Age, and Sex, December 2008 |
|
|
|
|
♦ |
Number of Disabled Workers Receiving Social Security Benefits by Sex, State or Other Area, and Age, December 2008 |
|
|
|
|
♦ |
Supplemental Security Income Public-Use Microdata File, 2001 Data |
♦ |
♦ |
|
|
|
Old-Age, Survivors, and Disability Insurance Public-Use Microdata File, 2001 Data |
♦ |
♦ |
|
|
|
Benefits Data from the Benefits and Earnings Public-Use file, 2004 |
♦ |
♦ |
|
|
|
Earnings Data from the benefits and Earnings Public-Use File, 2004 |
♦ |
♦ |
|
|
|
Chart 2--Planned High-Value Datasets and Information for Data.gov |
|||||
High-Value Criteria |
|||||
|
Increase agency accountability and responsiveness |
Improve public knowledge of agency and its operations |
Further core mission of agency |
Create economic opportunity |
Respond to identified public need and demand |
New Disability Determination Services Processing
State Disability Determination Services Budget Information |
♦ |
♦ |
♦ |
|
♦ |
Freedom of Information Act Report for 2010 |
♦ |
|
|
|
♦ |
Retirement Claims Filed and Cleared (aggregated information) |
♦ |
♦ |
♦ |
|
♦ |
Number and Percentage of Retirement Claims Filed via Internet |
♦ |
♦ |
♦ |
|
♦ |
Internet Usage for Selected Online Transactions |
♦ |
♦ |
♦ |
|
♦ |
Quality Workload Statistics |
♦ |
♦ |
♦ |
|
♦ |
Earnings Public Use File (Demographic and earnings information for a sample of all social security numbers) |
♦ |
♦ |
|
|
|
Benefits and Earnings Public Use File updated (Social Security Benefit Information and Longitudinal Earnings Information of Beneficiaries only) |
♦ |
♦ |
|
|
|
National Survey of Children and Families |
♦ |
♦ |
|
|
|
Datasets from Statistical Modernization Initiative |
♦ |
♦ |
|
|
|
Field Office Waiting Time |
♦ |
♦ |
♦ |
|
♦ |
Social Security 800 Number Call Volume and Busy Rate |
♦ |
♦ |
♦ |
|
♦ |
Speed in Answering Social Security 800 Number Calls |
♦ |
♦ |
♦ |
|
♦ |
As our program responsibilities grow, data and databases also grow and change. Historically, we developed our programmatic software at different times for different purposes, and deployed it in a stovepipe environment [2]. In order to give a comprehensive and transparent picture of our data, we will need to derive datasets for Data.gov from several sources (e.g. Case Processing Management System, Appeals Review Processing System, etc.) as we continue conversion from our older, homegrown database structures {e.g. Master Data Access Method (MADAM)} to modern relational database technologies.
Going forward we will explore incorporating data transparency considerations into the systems development lifecycle so that providing high value data in standard formats for public use is a more efficient, automated process. Building steps into the lifecycle involves working with and through several governance groups. For example, the lifecycle Change Control Boards determine the appropriate placement of each step in the lifecycle relative to other tasks and data needs; the Deputy Commissioner for Systems Management Steering Committee determines the resource and schedule impact on project activities. We tentatively plan to identify a selection of pilot projects and evaluate their results prior to implementing any significant lifecycle changes.
In addition to the database strategy referenced in the paragraph above, we have made progress in overhauling information sharing through continued development of a business intelligence (BI) architecture. The process began in the late 1990's with internal collaboration and the use of technology solutions to address data integration and workload management challenges. Once the BI architecture was in place, we transitioned workloads incrementally, workload by workload, based upon value to us, and governed by our architectural review board. Master data management is an important component of building the BI architecture and provides consistency and data quality.[3] It is through the BI architecture that we will be able to produce summarized datasets for Data.gov.
Chart 3 shows three different methods for deriving datasets for Data.gov. Line 1 reflects the use of the BI architecture. Using master data management in the BI architecture, we summarize and aggregate data, making it available for Data.gov without PII. Besides the datasets available through the BI architecture model, we produce additional datasets through data extraction from the program specific data we maintain. Line two shows how we use anonymization methods to develop public use files as well as summarized reports. Line three illustrates a similar process to produce public reports from the survey data we collect, along with program specific data we maintain.
While not reflected in the Chart 3 diagram, we also produce datasets in response to Freedom of Information Act (FOIA) requests. Since we generate the data and information in response to individual requests, there is not just one method for responding to FOIA requests. We look at each information release to determine if it is appropriate for Data.gov.
Based upon a review of our data, we identified high value data categories, shown below, that will improve public knowledge of our operations and our programs. For all of the categories, we will extract reports that contain summarized data so that even if the public combines our datasets with other data, no violations of privacy can occur. We will verify and certify the quality of the data in accordance with the Information Quality Act and the Data Quality Framework developed by our Executive Accountable for Publicly Disseminated Federal Spending Information Integrity. In addition, our Open Government Executive Steering Committee will advise, and as needed, concur, on the appropriate level of data (local, regional, national, etc.) and frequency of release (e.g. weekly, annually, etc). As needed, we will refer datasets to disclosure review boards for clearance should there be any questions regarding security and/or privacy. The Checklist for Data.gov Submission (see Appendix) will be signed off by the appropriate officials before the data are made available to Data.gov.
The following list is grouped according to major mission and program areas, not by priority. These areas have high value data that are available and not released. These areas supplement the extensive data already released on Disability Determination Services and Office of Disability Adjudication and Review case processing, as well as statistical information that our researchers make available on a regular basis. Our first priority for earliest release, in FY 2012, will be the data that have already been determined to be the "official agency measure" in our Unified Measurement System and available through the BI Architecture, as well as data in support of Performance.gov metrics. The second priority will be budget information, in consultation with the Senior Accountable Official on the Quality of Federal Spending Information. The third priority will be administrative information that includes data about staffing and human resources. By focusing efforts in these areas, by the end of FY 2012, we will provide a transparent picture of program activities, our budget, and our staff. We will also release as soon as possible, without respect to the above priorities, those datasets requested by our key audiences. (See our Communications Plan, Appendix C, in the Open Government Plan.)
The following chart reflects the inventory of high value data not released, but available as explained in the paragraph above.
Chart 4--Future New High Value Datasets
|
Information Technology Systems | Social Security Advisory Board |
Reimbursable activity | Automation Investment Fund |
Delegated Buildings | Disaster Relief |
Construction | Recovery Act funds |
Low Income Subsidy | Office of the Inspector General |
Continuing disability review allotments | Other allotments and targeted appropriations |
Supplemental Security Income non-disability redetermination allotments |
Continuing our long tradition of work in the research community, we will develop and release additional statistical tables, studies and research working papers as part of our ongoing mission. Research and statistical products by our Office of Research, Evaluation, and Statistics are available at: http://www.ssa.gov/policy/. Our regular publications include, for example, the Annual Statistical Supplement to the Social Security Bulletin. The Supplement is a major resource for data on the nation's social insurance and welfare programs. The majority of the statistical tables present information about the programs we administer. In 2009, our Office of Research, Evaluation and Statistics began a 3-year initiative to modernize how we produce statistical tables and publications. The new process will include the development of summary data sets that can be converted to machine readable format (e.g., comma separated value). Consistent with our commitment to transparency, we will include the new datasets in the Data.gov inventory. As reflected in our Open Government Plan, as well as Chart 2 of this document, we plan to release the first dataset based upon this new modernized process in FY 2011. As the modernization continues, and new datasets become available, we will review and include them in the Data.gov inventory.
One of our ongoing initiatives that foster the use of our data is the Retirement Research Consortium (RRC). The Retirement Research Consortium (RRC) consists of three multidisciplinary centers housed in three separate institutions (Boston College, the University of Michigan, and the National Bureau of Economic Research). We fund the activity through cooperative agreements and awarded approximately $7.5 million to the RRC in FY 2009, the last year of the current 5-year award. We expect funding to continue at that level for each of the remaining years of the award.
The RRC has three main goals:
To meet these goals, the centers perform many activities. They conduct research, prepare policy briefs and working papers, hold an annual conference, and provide research and training support for young scholars. A link to recent RRC research is provided below:
http://www.ssa.gov/policy/rrc/subjects.htmlThe Trustees Report and other actuarial information is available at: http://www.socialsecurity.gov/OACT/pubs.html
Other useful data are currently available from the Office of the Chief Actuary using the following link:
This link provides drill down capability (from 1967 forward) for information on the number of beneficiaries, average benefits, and type of beneficiaries. http://www.socialsecurity.gov/OACT/ProgData/icp.html
Chart 5- Information That May Be High Value but Not Currently Available
|
This document provides the results of our complete data inventory in accordance with the Open Government Directive. However, our information and data will continue to evolve. Therefore, we will need to refresh the inventory periodically as new programs, applications, research, surveys, and public engagements change our data holdings. In some cases, we will retire and consolidate data holdings.
In addition, until the launch of Data.gov, we developed our methods and processes for gathering data, extracting data, and displaying data to support internal management, operational, and executive needs. For example, within the organization today, we do not build data gathering for public release into the project life cycle, and extraction methods have been primarily for executive decision-making or specific research. Today, where the data are available, we will reformat and present it in ways that benefit the public, while at the same time protecting their personal information.
These methods will need to evolve to incorporate transparency of the data (i.e. public release in standard formats) into the requirements for new data gathering efforts, with an ultimate goal of operating with a more agile process that considers both internal and external use of data in the future. In the future, we will specify the format and presentation of the data upfront so that preparation of data for public release will be both quicker and easier. We are exploring putting the appropriate tasks into the Planning and Analysis (P&A) and Construction phases of our systems project life cycle to identify and verify the data that would be published as well as to conduct data validation and certification. Our goal is to integrate this process with other appropriate data, testing and certification activities rather than add another stand-alone activity.
[1] High value information is information that can be used to increase agency accountability and responsiveness; improve public knowledge of the agency and its operations; further the core mission of the agency; create economic opportunity; or respond to need and demand as identified through public consultation.
[2] Social Security Administration, Information Technology Vision 2009-2014, February 2009, page 14.
[3] Case Study: BI Strategy Prepares US Social Security Administration for the Future, Gartner Research, Bill Gassman, July 31, 2009, page 4.
[4] This chart does not include datasets produced in response to FOIA requests. The numerous datasets related to our budget process, Annual Performance Plan, Performance.Gov and Priority Goals are also not included.
We used our idea-sharing tool, "OpenSocialSecurity," to capture your ideas for creating our Open Government Plan. We received many ideas on how we could increase our transparency, participation, collaboration and innovation.
Under certain circumstances, Social Security can provide a speaker to discuss our programs, benefits, and services. We provide speakers free of charge, and they are generally available during business hours only. Please note that we are not able to fulfill all requests.
The Administration is committed to creating an open and transparent government through the following initiatives:
Alan Lane
Associate Commissioner for Open Government
Address:
6401 Security Blvd
West High Rise Rm. 1126
Baltimore, MD 21235
Email: open.government@ssa.gov