dccps logo
Epidemiology and Genetics Research Branch

What Data are in the LI GIS?

The LI GIS for researchers consists of more than 80 datasets on Nassau and Suffolk counties, and to a lesser extent on surrounding areas, including:

  • Geographic data, including location of roads, parks and landmarks
  • Demographic data such as age, sex, and income of the population
  • Health outcome data, including relative breast cancer incidence
  • Environmental data, including land use; land cover; railroads; traffic; water use; potential sources of water pollution; release of chemicals into the water, air and soil information; information on toxic chemicals and hazardous and municipal waste; and radiation.

The researchers also have access to statistical tools and extensions that allow them to investigate possible relationships between environmental exposures and breast cancer rates.

The LI GIS for the public contains a smaller sample of this data to give the public a window into how a GIS works. The public can view breast cancer rates and either hazardous wastes sites or pesticide detections simultaneously. Individual cancer risk can not be deduced from the public maps. Statistical analyses are needed to determine any possible health implications or to determine disease risk.

Sources of the Data Include:

  • State Health Departments
  • U.S. Geological Survey
  • U.S. Postal Service
  • U.S. Bureau of the Census
  • U.S. Environmental Protection Agency
  • U.S. Department of Agriculture
  • U.S. Department of Commerce

Metadata Browser

The Metadata Browser provides information about the datasets in the LI GIS for researchers. Metadata is defined as data about data. The Metadata Browser is a useful resource for researchers who require detailed information about the data and how they are organized in the Data Warehouse. This information helps researchers assess the usefulness and relevance of data for their purposes.

The Metadata Browser has four areas:

  • Data Warehouse – Describes how the datasets are integrated into the LI GIS
  • Federal Geographic Data Committee (FGDC) Reports – Metadata on the geographic data
  • Source Datasets – Metadata about the source datasets, which are the raw materials used to build the Data Warehouse
  • Data Quality Summaries – Reports on the origin of the data, the method and purpose of data collection, any temporal issues, and a general assessment of the quality of the data. Also included are issues to consider when choosing which types of data to use in research, and privacy or data ownership concerns.

In most cases, data for the LI GIS were collected for purposes other than health-related research. The user should review the metadata carefully, paying special attention to accuracy, consistency, quality, and use constraints.

Some Sources of Data Provided for the LI GIS