Systems Biology Knowledgebase for a New Era in Biology
A Genomics:GTL Report from the May 2008 Workshop
Your comments and suggestions are requested and may be submitted here.
To promote development of a data and information management system, or knowledgebase, DOE’s Office of Biological and Environmental Research (OBER) hosted a workshop May 28–30, 2008, in Washington, D.C. Experts from scientific disciplines relevant to DOE missions and from the enabling technologies (e.g., bioinformatics, computer science, database development, and systems architecture) met to determine the opportunities and requirements for developing and managing this knowledgebase for OBER’s Genomics:GTL program (GTL).
Workshop participants defined the proposed GTL Knowledgebase, or GKB, as an informatics resource that would focus on DOE science-application areas yet also be widely and easily applicable to all systems biology research. Also discussed were requirements for effective development of data capabilities for systems biology that could be applied specifically to plants and microbes (i.e., bacteria, archaea, fungi, and protists—unicellular eukaryotes such as microalgae) as well as to three areas of science related to DOE missions: (1) researching and developing biofuels, (2) advancing fundamental understanding of the global carbon cycle, and (3) understanding and using biological systems for environmental remediation. Participants were organized into working groups based on four knowledgebase themes: data, metadata, and information; data integration; database architecture and infrastructure; and community and user issues.
The workshop highlighted DOE’s unique and extensive data-management needs as a foundation of mission-inspired systems biology research. These needs require a principal GTL data resource, the GKB, with critical links to complementary systems supported by other agencies and community organizations worldwide. This knowledgebase would facilitate a new level of scientific inquiry by serving as a central component for the integration of modeling, simulation, experimentation, and bioinformatic approaches. The GKB also would be a primary resource for data sharing and information exchange among the GTL community. Furthermore, not only would the GKB allow scientists to expand, compute, and integrate data and information program wide, it also would drive two classes of work: experimental design and modeling and simulation. Integrating data derived from computational predictions and modeling, as envisioned in the knowledgebase project, would increase data completeness, fidelity, and accuracy. These advancements in turn would greatly improve modeling and simulation, leading to new experimentation, analyses, and mechanistic insight. Scientists’ ever-increasing exploitation of the dynamic linkages among data integration, experimentation, and modeling and simulation—aided by the GKB—will advance efforts to achieve a predictive understanding of the functions of biological systems. The knowledgebase, therefore, must serve multiple roles, including (1) a repository of data and results from high-throughput experiments; (2) a collection of tools to derive new insights through data synthesis, analysis, and comparison; (3) a framework to test scientific understanding; (4) a heuristic capability to improve the value and sophistication of further inquiry; and (5) a foundation for prediction, design, manipulation, and, ultimately, engineering of biological systems to meet national needs in bioenergy, environmental remediation, and carbon cycling.
Suggested citation for this report: U.S. DOE. 2009. U.S. Department of Energy Office of Science Systems Biology Knowledgebase for a New Era in Biology: A Genomics:GTL Report from the May 2008 Workshop, DOE/SC-113, U.S. Department of Energy Office of Science (http://genomicsgtl.energy.gov/compbio/).
Download full document
- fast download PDF (3771 kb)
- high resolution PDF (14,262 kb)
Order printed copies or submit comments and suggestions.
Download document by sections (fast download PDFs)
- Cover and Table of Contents (PDF, 208 kb)
- Executive Summary (PDF, 276 kb)
- Introduction (PDF, 1256 kb)
- Data, Metadata, and Information (PDF, 393 kb)
- Data Integration (PDF, 472 kb)
- Database Architecture and Infrastructure (PDF, 367 kb)
- GTL Knowledgebase Community and User Issues (PDF, 143 kb)
- Appendices
- Information and Data Sharing Policy (PDF, 112 kb)
- Use Case Scenarios of Systems Biology Investigations Utilizing the GTL Knowledgebase (PDF, 811 kb)
- Systems Biology for Bioenergy Solutions (PDF, 217 kb)
- Opportunities and Requirements for Research in Carbon Cycling and Environmental Remediation (PDF, 127 kb)
- Summary List of Findings from Introduction (PDF, 89 kb)
- Bibliography (PDF, 100 kb)
- Descriptions of a Selected Sampling of Databases Having Relevance to the GTL Knowledgebase (PDF, 146 kb)
- Genomics:GTL Systems Biology Knowledgebase Workshop: Agenda, Participant List, and Biosketches (PDF, 177 kb)
- Glossary (PDF, 127 kb)
- List of Web Addresses (PDF, 88 kb)
- Acronyms and Abbreviations (PDF, 137 kb)
- Information and Data Sharing Policy (PDF, 112 kb)
*Note to readers: The following notice applies to Fig. 1.3, which is used with permission from Science and AAAS for this report.
Readers may view, browse, and/or download material for temporary copying purposes only, provided these uses are for noncommercial personal purposes. Except as provided by law, this material may not be further reproduced, distributed, transmitted, modified, adapted, performed, displayed, published, or sold in whole or in part, without prior written permission from the publisher.