README Census 2000 Summary File 4 Delivered via FTP CONTENTS About the FTP Application Other Sources of the Data File Naming Convention Technical Documentation Data Records and Segmentation File Record Layout FTP File Transfer UnZipping the Files Spreadsheet and Data Base Aids For step-by-step instructions for moving the data and the structure into a spreadsheet (including screen shots), please see www.census.gov/support/SF4ASCII.html. Structure files in Access2000 and other formats are available at http://www.census.gov/support/2000/SF4/. We are unable to provide one-on-one support for applications of the data to specific spreadsheets or data base software. ABOUT THE FTP APPLICATION - The application is intended for experienced users of census data, compressed files, and spreadsheet/database software. - FTP users should have a fast file transfer capability. - Users of the FTP application need to unzip the compressed file after downloading, then import it into the spreadsheet/database software of their choice for data analysis and table presentation. OTHER SOURCES OF THE DATA - American FactFinder at factfinder.census.gov . - This system provides Internet access to all tables plus additional derived tables called Quick Tables and Geographic Comparison Tables. - The system can create thematic maps on various data items. - The system can create reference maps defining the geographic area. - Tables are available on American FactFinder on the morning of public release. DVD/CD-ROM - CD-ROMs with software are available for individual states on or shortly after the public release date. - DVDs are created after all states are released. - They can be purchased online ($50 for CD-ROM; $70 for DVD) from the Census Catalog (www.census.gov, select Census Catalog from left sidebar) or ordered by telephone from the Customer Services Center (301-763-INFO). - DVDs and CD-ROMs contain the same software. - Software is proprietary but are in a format that can easily be imported into data bases or spreadsheets. FILE NAMING CONVENTIONS - The naming convention for geographic header files is ssgeo_uf4.zip - ss is USPS state abbreviation - The codes are in technical documentation on page 7-1, located at http://www.census.gov/prod/cen2000/doc/sf4.pdf - geo_uf4.zip is a constant across SF4 geographic header names. - Naming convention for SF4 data files is ssiiiyy_uf4.zip. - ss is USPS state abbreviation - iii is the characteristic iteration (total population, race groups, American Indian and Alaska Native tribes, Hispanic/Latino, and ethnic groups. ) - Characteristic iteration codes are in the Appendix H of the technical documentation, which is available at http://www.census.gov/prod/cen2000/docs/sf4.pdf - page=278. - yy is the number of the file - Valid codes are 01 through 38. See below for distribution of tables across files. - _uf4.zip is a constant across SF4 data file names TECHNICAL DOCUMENTATION - The complete technical documentation for SF4 is available at http://www.census.gov/prod/cen2000/doc/sf4.pdf. DATA RECORDS AND SEGMENTATION Table distribution across data files is as follows: --------------------------------------------------------------------------- Number Of Starting Ending data matrix matrix File name items number number --------------------------------------------------------------------------- ssgeo.uf4 ssiii01.uf4 220 PCT1 PCT4 ssiii02.uf4 249 PCT5 PCT16 ssiii03.uf4 208 PCT17 PCT34 ssiii04.uf4 189 PCT35 PCT37 ssiii05.uf4 244 PCT38 PCT45 ssiii06.uf4 240 PCT46 PCT49 ssiii07.uf4 214 PCT50 PCT61 ssiii08.uf4 248 PCT62 PCT67 ssiii09.uf4 220 PCT68 PCT71 ssiii10.uf4 221 PCT72 PCT76 ssiii11.uf4 106 PCT77 PCT78 ssiii12.uf4 234 PCT79 PCT81 ssiii13.uf4 99 PCT82 PCT84 ssiii14.uf4 223 PCT85 PCT86(pt.) ssiii15.uf4 237 PCT86(pt.) ssiii16.uf4 250 PCT87 PCT103 ssiii17.uf4 207 PCT104 PCT120 ssiii18.uf4 185 PCT121 PCT131 ssiii19.uf4 157 PCT132 PCT137 ssiii20.uf4 213 PCT138 PCT143 ssiii21.uf4 144 PCT144 ssiii22.uf4 247 PCT145 PCT150 ssiii23.uf4 244 PCT151 PCT156 ssiii24.uf4 228 PCT157 PCT162 ssiii25.uf4 246 PCT163 PCT208 ssiii26.uf4 49 PCT209 PCT213 ssiii27.uf4 240 HCT1 HCT9 ssiii28.uf4 199 HCT10 HCT18 ssiii29.uf4 222 HCT19 HCT22 ssiii30.uf4 165 HCT23 HCT25 ssiii31.uf4 236 HCT26 HCT29 ssiii32.uf4 250 HCT30 HCT39 ssiii33.uf4 187 HCT40 HCT55 ssiii34.uf4 222 HCT56 HCT61 ssiii35.uf4 145 HCT62 HCT70 ssiii36.uf4 236 HCT71 HCT81 ssiii37.uf4 218 HCT82 HCT86 ssiii38.uf4 238 HCT87 HCT110 -------------------------------------------------------------------------- Five fields are carried over from the geographic header file into each data file. - These fields are file identification (FILEID), state abbreviation (STUSAB), characteristic iteration (CHARITER), characteristic iteration file sequence number (CIFSN) and logical record number (LOGRECNO). - These five fields appear in the geographic header record in a fixed field format. - These five fields appear in the 38 data files in a comma delimited format. - These fields are used to "match" records in the 38 data files for a particular characteristic iteration to the geographic information in the geoheader. - A file set structure schematic appears in the technical documentation which is located at http://www.census.gov/prod/cen2000/doc/sf4.pdf. FILE RECORD LAYOUT - For a layout of the individual tables for each file, see the technical documentation at www.census.gov/prod/cen2000/doc/sf4.pdf. FTP FILE TRANSFER - Summary File 4 (SF4) FTP directory is at ftp://ftp2.census.gov/census_2000/datasets/Summary_File_4. - Each state directory provides all files available for the identified state. - The directory for each state has one geographic header file and 38 files for each iteration which meets the criteria for inclusion in this product. - There is a potential (but not likely) for 9,000 files for each state--one geographic header file and 38 data files for each of the 336 iterations. - The chart on page 3 of this document lists the table numbers available in each of the 38 files. - Once uncompressed, the files are in a flat ASCII format. - No software is provided. DOWNLOADING MULTIPLE FILES - UNIX environment--"mget" subcommand allows transfer of multiple files using the wildcard character. - Example: ftp> prompt off ftp>mget ne* (for this example, Nebraska is selected). - Windows Environment--many FTP products have been developed which have the capability to download multiple files with a single command. - We used the ws_ftp product in testing the download. - A demonstration copy is available at http://www.ipswitch.com/ - An Internet search using the term "download multiple files" yielded other similar products. - For step-by-step instructions with screen shots, please see http://www.census.gov/support/2000/SF4/ UNZIPPING THE FILES - Files compress at approximately 95% compression. - Any standard UnZIP software package can be used. - In testing we used PKZIP for Windows. It's available at www.pkware.com. - Unzipped file is in flat ASCII format. - For step-by-step instructions with screen shots, please see http://www.census.gov/support/2000/SF4/ UNZIPPED FILES - Geographic header file has fixed fields. - File data dictionary is at http://www.census.gov/prod/cen2000/doc/sf4.pdf See chapter 7 for the data dictionary. - Data files (files 01-38) have comma delimited fields - Fields from the geographic header file carried over to the data files are comma delimited in the data files. SPREADSHEET AND DATA BASE AIDS - For step-by-step instructions with screen shots for moving the data and structure to a spreadsheet, please see www.census.gov/support/SF4ASCII.html. - Structure files in Access2000 and other formats are available at http://www.census.gov/support/2000/SF4/ - We are unable to provide one-on-one support for applications of the data to specific spreadsheets or data base software.