CGCI Publication Guidelines

Like other National Cancer Institute large-scale genomics initiatives, the Cancer Genome Characterization Initiative (CGCI) is a community resource project. After validation, data are rapidly made available for use by other researchers. To act in accord with the Fort Lauderdale principles and support the continued prompt public release of large-scale genomic data prior to publication, researchers preparing manuscripts that (1) contain descriptions of CGCI data and/or (2) are of comparable scope to an initial CGCI disease-specific comprehensive global analysis publication, are required to coordinate their independent reports with CGCI's publication schedule. Journal editors who receive such manuscripts are strongly encouraged to contact the Office of Cancer Genomic (OCG; ocg@mail.nih.gov) to ensure that their publication timeline is in line with CGCI.

Comparable scope is defined as global genome-wide analysis of CGCI disease-specific data or data across diseases from one or more platforms (for disease types that encompass CGCI subprojects, see table below). As described below, publications of comparable scope are restricted prior to the release of first disease-specific manuscript.

As a general reminder, all use and publications based on CGCI data are required to follow the Data Use Limitations outlined in the CGCI Data Use Certification at the database of Genotypes and Phenotypes (dbGaP).

     Use of CGCI Data in Manuscripts Prior to Publication of Initial CGCI Disease-Specific Integrated Analysis

     Use of CGCI Data in Manuscripts After Each Initial CGCI Disease-Specific Integrated Analysis Publication

     Use of CGCI Data for Research Purposes other than Publication

     CGCI Program Attribution in Publications and Presentations

Use of CGCI Data in Manuscripts Prior to Publication of Initial CGCI Disease-Specific Integrated Analysis

Similar to standard practices of other large-scale genomic projects, the National Cancer Institute does not consider the deposition of CGCI data into its own data portal or public databases as the equivalent of publication in a peer-reviewed journal. Therefore, although the data are available to others, the data producers consider these data as unpublished and expect that the data will be used in accord with standard scientific etiquette and practices concerning unpublished data.

The first paper authored by each CGCI subproject team (https://ocg.cancer.gov/programs/cgci/projects) includes the data and analysis from a majority (~90%) of qualified CGCI cases. Specifically, these manuscripts report on the comprehensive, integrated analysis of multiple CGCI subproject datasets, including, but not limited to:

  • Copy number variation, gene and miRNA expression, structural variants and DNA sequence/mutation analysis or a combination thereof; or
  • An analysis of data from a single platform across more than one tumor type, where not all the subproject teams have published their first comprehensive analyses papers for the tumor types being analyzed

Investigators may only publish a manuscript before the CGCI subproject team has published the global analysis on that tumor type IF the publication uses a very limited dataset (less than 5 genes). Investigators should submit their abstract and request to OCG by emailing ocg@mail.nih.gov.

Prior to the publication of comprehensive analysis on a specific tumor type, available datasets are subject to the same standard principles of scientific etiquette that apply to use of unpublished findings from other sources. Below is the list of tumor datasets that are available for use now. The list will be updated when the first global manuscript for each tumor is accepted for publication. For questions, do not hesitate to contact ocg@mail.nih.gov.

Tumor Type/ Data Use Limitation Subproject Status Data Availability Status
Burkitt lymphoma/ general research Pediatric cases are completed. Adult cases are in progress. Pediatric data are available at NCI's Genomic Data Commons (GDC) and OCG Data Coordinating Center (DCC). EBV sequences are available as BAM alignments from the public directory at the DCC.
Non-Hodgkin lymphoma/ general research Completed; all raw and analyzed data are available. Open and controlled access to mRNA-seq data and whole exome, and whole genome sequencing data are available through NCBI's Sequence Read Archive and OCG DCC.
HIV+ associated cervical cancer/ general research In progress

Controlled-access whole genome sequencing, RNA-Seq, miRNA Seq and Targeted sequencing data are available at NCI's Genomic Data Commons (GDC). Controlled-access ChIP-Seq data and open-access methylation array data are available at the DCC. Both open and controlled-access higher level (analyzed) sequencing data are available at OCG DCC.

HIV+ associated lung cancer/ general research In progress Data not available.
HIV+ associated diffuse large B cell lymphoma/ general research In progress Data not available.

Use of CGCI Data in Manuscripts After Each Initial CGCI Disease-Specific Global Analysis Publication

There are no restrictions on the use of CGCI data in manuscripts after the initial CGCI global analysis publication if the data are used specifically within the Data Use Limitations. 

Use of CGCI Data for Research Purposes other than Publication

If data are used within the Data Use Limitations, there are no restrictions on the use of CGCI data for legitimate research purposes not involving publication or public presentation. For example, researchers may use CGCI data in research grant applications at any time with appropriate acknowledgement (see below), regardless of whether the initial CGCI disease-specific global analysis has been published.

CGCI Program Attribution in Publications and Presentations

OCG requests that authors who use data from CGCI acknowledge the appropriate subproject team(s) in their work by referencing the CGCI dataset in accordance with the CGCI Data Use Certification at dbGaP. Inclusion of CGCI subproject team members as authors is not required. 

If investigators desire to collaborate with CGCI members, they should contact the specific subproject team. Authors are also encouraged to recognize the contribution of the appropriate specimen donors and research groups via the acknowledgements section in their publication. Similarly, OCG requests that journal editors and reviewers attempt to ascertain if CGCI is cited and if appropriate acknowledgements are made.

For questions, please contact the OCG by emailing ocg@mail.nih.gov.

Last updated: June 22, 2020