Integrative Data Analysis

Integrative data analysis (IDA) refers to a set of strategies in which two or more independent data sets are pooled or combined into one and then statistically analyzed. IDA approaches differ from and offer advantages over other methodological techniques that also strive to build cumulative knowledge bases, such as meta-analysis.

In meta-analysis, summary statistics across multiple studies are pooled together. Because IDA techniques pool original raw data, there is no loss of individual information as found within meta-analytic approaches, which allows researchers to find out what works, for whom, and in which contexts. In addition, the use of IDA affords expanded inquiry within many areas of health behavior research. IDA can be used to incorporate big data that were not originally intended for the examination of theoretically relevant measures. For example, searches on Google for health-related topics could be used as an objective measure of information seeking that could supplement what is gleaned from a self-report data source such as the Health Information National Trends Survey (HINTS). 

Data integration typically takes one of two forms:

  1. merging data by common data elements (units of information that are shared or widely used across data collection efforts.), where these elements are often multi-item scales or indices but can be individual items; or
  2. linking data sets through a common factor at the record level (e.g., linking across data through demographic information) such as that seen in the Surveillance, Epidemiology, and End Results (SEER)-Medicare data set, or at multiple levels such as the environmental or policy level (e.g., linking state- or county-level information with individual-level data). 

The Behavioral Research Program seeks to promote the use of IDA to answer novel cancer control questions to accelerate scientific discovery.

Funding Opportunities in Integrative Data Analysis

NCI Small Grants Program for Cancer Research
PAR-20-052 (NCI Omnibus R03 Clinical Trial Optional)

Secondary Analyses of Existing Datasets of Tobacco Use and Health
RFA-OD-19-022 (R21 Clinical Trial Not Allowed)

Accelerating the Pace of Drug Abuse Research Using Existing Data
PAR-18-062 (R01 Clinical Trial Optional)

Research Answers to National Cancer Institute's (NCI) Provocative Questions
RFA-CA-20-004 (R01 Clinical Trial Optional)
RFA-CA-20-005 (R21 Clinical Trial Optional)
Provocative Question #7: What methods can be developed to integrate patient-generated health data into electronic health records?

Secondary Analysis and Integration of Existing Data to Elucidate the Genetic Architecture of Cancer Risk and Related Outcomes
PA-17-239 (R01)

Accelerating the Pace of Child Health Research Using Existing Data from the Adolescent Brain Cognitive Development (ABCD) Study
PAR-19-162 (R01-Clinical Trial Not Allowed)
PAR-19-163 (R21-Clinical Trial Not Allowed)

Publications

Contact

Richard Moser, Ph.D.

Richard Moser, Ph.D.

Fellowship Training and Research Methods Coordinator

Last Updated
September 24, 2020