In this issue...
1997 Santa Fe Highlights
Patrinos Address
Joint Genome Institute (JGI) Comes of Age
JGI Sequencing
JGI Informatics
JGI and Bermuda Quality Sequence
Grants Awarded for JGI Collaboration
JGI Sequencing Clones
Sequencing Strategies,Tools
Gene-Discovery Resources
Sequencing at NIH NHGRI
Functional Genomics
Data Surge Challenges Informaticists
Genome Annotation: Informatics Advances Needed for Age of Functional Genomics
ELSI: Rapid Progress Accelerates Societal Impact of Genome Research
1999 DOE HGP Meeting Set for California
Human Genome Project Administration
New 5-Year Goals, Project Midpoint
DOE, NIH Discuss Informatics
JASON Group Review
BER Genome Instrumentation Research
In the News
Private-Sector Sequencing Plan
Bang for the Buck: Government-Backed Research Underpins Potentially High Payoff Ventures
Palmisano Joins DOE OBER
DNA Files series to be on NPR
HUGO Addresses Sample Collection
Sickle Cell Mice May Lead to New Treatments
TIGR Sequencing 6 More Microbes
Tuberculosis Microbe Sequenced
C. Elegans Sequencing Nears Finish
HGMIS Website Restructured
cDNA Cloning Workshop Identifies Critical Issues
Survey Identifies Growing Need for Synchrotron Analyses
NCGR Announcements
Publications
Report on Functional Consequences of Gene Expression
Book on Tuskegee Conference
Book Focuses on Biomarker Implications, Conference Proceedings
Genome Analysis Protocol Handbook
Software and the Internet
Mouse Genome Informatics Release 2.0
New System Identifies Polymorphisms
DOE Supports Web Site for 1997 AAAS Genome Symposium
Expressed Human Genome Database
Funding
DOE ELSI
NIH NHGRI
NHGRI Initiates Mailing List
U.S. Genome Research Funding
Meeting Calendars & Acronyms
Genome and Biotechnology Meetings
Training Courses and Workshops
Acronyms
HGN archives and subscriptions
HGP Information home
|
Genome Annotation: Informatics Advances Needed for Age of Functional Genomics
Sharply increasing rates of sequence-data production are placing greater and greater demands on information systems for new ways to view and better understand the meaning of the growing strings of As, Cs, Ts, and Gs piling up in GenBank and community databases (see article.). Enriching data with such information as gene features and locations, gene-control regions, related sequences, gene-expression patterns, gene and protein families, pathways, and phenotypes can help pave the way for a successful transition from the current structural genomics phase of DNA mapping and sequencing to functional genomics studies.
Genome Annotation Consortium
Ed Uberbacher (Oak Ridge National Laboratory) described several pilot projects in the multi-institutional Genome Annotation Consortium, which was established to minimize some problems posed by genome-scale sequencing and to build a shared infrastructure for integrating diverse biological information. Four basic components of the pilot projects are daily sequence and biological data acquisition from 19 major genome centers; automated data analysis to link biological information to sequence using tools for exon prediction, gene modeling, and sequence comparison; a storage, maintenance, and update component; and a series of methods for browsing, querying, and accessing other tools of value to researchers. An important goal is to build a level of interoperation using CORBA, which has not yet been implemented into the system.
In outlining some current challenges in sequence annotation, Uberbacher noted that no community-wide annotation processes exist and that much of the annotation does not describe the methods and evidence used to create the data. Moreover, even if the sequence were annotated extensively when submitted to the database, long-term update and maintenance are challenges. New ESTs that may be important to understanding a genomic region of interest, for example, may have been entered into the dbEST database but are not represented in the original annotation. Annotation by end users is difficult because it requires multiple tools that use different formats and lack interoperability.
Genome Channel
Uberbacher also demonstrated the Genome Channel, a prototype graphical user tool for browsing and querying the annotated reference genome (http://compbio.ornl.gov/tools/channel/index.html). The Java interface relies on a number of underlying data resources, analysis tools, and data-retrieval agents to provide an up-to-date view of genomic sequences as well as computational and experimental annotation. Designed to be simple enough for a layperson, the channel also offers sophisticated capabilities for hypothesis testing. The system had about 6000 GRAIL-EXP and 4000 GENSCAN predicted human genes as of June.
Return to Top of Page
The electronic form of the newsletter may be cited in the following
style:
Human Genome Program, U.S. Department of Energy,
Human Genome News (v9n3).
|