DOE Genomes
Human Genome Project Information  Genomics:GTL  DOE Microbial Genomics  home
-
HGP Home
Human Genome News Archive Edition

Vol.10, No.3-4   October 1999 
Available in PDF 
 
In this issue... 

DOE '99 Oakland Highlights
* Meeting Intro
* HGP Progress, Challenges
* Joint Genome Institute
*
JGI Facility Opening
*
Sequencing
*
Device Speeds Sequencing
* Chromosome 19 Genes
* Functional Genomics
* Sample Prep System
* Informatics
* Education & Bioethics
* Microbial Genomes
* Next Meeting Feb. 2000

Genome Project
* Accelerated Timetable
* FY1999 Budgets
* JGI Refits Facility
* Progress

In the News 
* Drosophila Near Completion
* Directors, Researchers Receive Awards
* Chromosomes 21, 22 Near Finish
* Human MHC Region Sequenced
* Mouse Probes Aid Mapping
* Genetic Testing Committee
* DOE Labs & Biomedicine
* Biomedical Engineering Proposals
* Drug Firms Create SNP Resource
* SBIR 1999 Awards
* HUGO News
* 1999 Hollaender Winners

Microbial Genomics 
* EcoCyc Database for E. coli
* Developing EcoCyc
* Thermotoga Sequence
* Clostridium Sequenced
* Microbial Gene Finder
* Neisseria Sequence
* Microbial Web Sites

Ethical, Legal, and Social Issues
* Education, Counseling Foundation
* ELSI Studies
* ELSI Retrospective
* Protecting Privacy

Informatics 
* MGI 2.2 Released
* DNA Repository

Web, Other Resources, Publications 
* Metabolic Pathways
* New Staden Package
* PDB Newsletter
* Polymorphism Database
* Bacterial Protein Database
* DOE BER Publications
* HGP Information Web
* Biotechnology Business Webs
* Behavioral Genetics
* Office of Science Reports
* Articles on Genetics

Funding 
* Genome Research Funding

Meeting Calendars & Acronyms 
* Genome & Biotech Meetings 
* Training Courses & Workshops 
* Acronyms 


*HGN archives & subscriptions 
*HGP Information home

Informatics

Report from 1999 DOE Genome Meeting

Oakland presentations emphasized that genome-sequencing projects are producing data at a rate exceeding current analytical and data-management capabilities. Additionally, some current computing problems are expected to scale up exponentially as the data increase.

Genome Annotation Consortium

Ed Uberbacher, Jay Snoddy, and Phil LoCasio (all at Oak Ridge National Laboratory) offered an update on progress at ORNL and the multi-institutional Genome Annotation Consortium (GAC), which was established to address massive computational and informational challenges.

The goals of this work are to develop a system for whole-genome annotation that (1) organizes various types of data around genome frameworks that can be cross-indexed, compared, and cross-navigated and (2) allows multiple analytical methods to be applied to the same data. Steps in the annotation process include the following:

  • retrieving data and assembling genomes;
  • computationally finding genes and other sequence-level features;
  • computationally determining homology, function, and other relationships;
  • genome-wide structural modeling of gene products;
  • analyzing and modeling pathways and systems; and
  • managing, accessing, and visualizing data.
Snoddy, Uberbacher, and LoCasio discussed the growing complexity and expense involved in biological computing for genome assembly and annotation. They noted that assembly problems will increase as billions of nucleotides are entered as draft sequences into the sequence databases by mid-2000, when the daily assembling of new data alone will require over 1600 workstations.

Other significant computational challenges include integrating the major community maps, which often have inconsistences and discrepancies, and performing comprehensive sequence analyses for gene modeling, which requires the time-consuming application of several algorithms. Furthermore, completing some desired analyses for protein classification currently could require about 70 days on a 1024-node processor. Challenges are similar for such other comparative processes as genome-to-genome alignment for studying mouse and human synteny. As sequence numbers and lengths increase, challenges become even greater for making phylogenetic gene and species trees. Meeting these and other high-performance biological computing needs, the speakers emphasized, demands a centralized approach with advanced infrastructure and specialized facilities.

Uberbacher gave an overview of GAC progress in developing tools, servers, and special data views to serve the community. Achievements include establishment of data-acquisition and semiautomated sequence-assembly components and modules that are integrated to allow comprehensive genome-wide analysis. He noted that the computation-based GRAIL-EXP is finding about 10 times more human genes than investigators had identified previously, as indicated in the GenBank annotation. All human and microbial gene-analysis tools are available to researchers. At present, the Oak Ridge group is focusing on urgent annotation challenges from the massive sequencing ramp-up under way at the DOE Joint Genome Institute.

For More Information:

Multiple Genome Analysis: WIT

A poster by Natalia Maltsev (Argonne National Laboratory, ANL) and colleagues described ANL's WIT system. WIT was designed and implemented to support genetic sequence and comparative analysis of sequenced genomes and metabolic reconstructions from sequence data. It now contains data from 34 genomes (some incomplete).

The authors believe that parallel analysis of a large number of phylogenetically diverse genomes can add much to the understanding of higher-level functional subsystems and major physiological designs. They reported a new method for using conserved clusters of genes from numerous genomes to predict functional coupling between genes. Although early results are encouraging, investigators believe the precision of prediction and the amount of accessible functional coupling will increase dramatically as more genomes are added. They emphasized that this class of data may well become a significant resource for establishing the function of hypothetical proteins, better understanding the functions of paralogous genes, and reconstructing connections in higher-level functional subsystems.


The electronic form of the newsletter may be cited in the following style:
Human Genome Program, U.S. Department of Energy, Human Genome News (v10n3-4).

Return to Top of Page

Acronym List

Send the url of this page to a friend


Last modified: Wednesday, October 29, 2003

Home * Contacts * Disclaimer

Base URL: www.ornl.gov/hgmis

Office of Science Site sponsored by the U.S. Department of Energy Office of Science, Office of Biological and Environmental Research, Human Genome Program