Version 2.5.2.0 CRISP Logo CRISP Homepage Help for CRISP Email Us

Abstract

Grant Number: 5R01LM006916-03
Project Title: BIOINFORMATICS, DISORDERED PROTEINS, AND FUNCTION
PI Information:NameEmailTitle
DUNKER, A. KEITH. kedunker@iupui.edu PROFESSOR

Abstract: A major objective of bioinformatics is to predict protein structure/function from sequence; the most successful current methods use sequence homology. These current homology-based methods explicitly or implicitly assume the paradigm sequence yields structure yields function. Since protein function depends on flexibility, movement, or even lack of structure (disorder) in some cases, combining various motion predictions with structure predictions should improve prediction of function. Algorithms for predicting flexibility from sequence and local disorder have been developed, as have algorithms to predict sequences that snitch between two structured states. Sequence complexity might also be an indirect measure of mobility and still other measures could be discovered by the study of known examples. We propose to determine the interplay of homology-based predictions that use sequence information only with explicit representations of both structure information and motion information as determined from amino acid sequence. Structural information will be represented by helix, sheet, etc., predictions, and by hydrophobic moment calculations. Motion information will be represented by flexibility prediction, switch sequence prediction, order / disorder prediction, sequence complexity, and new measures if any are discovered in the course of this work. Since different functions would involve motional parameters to different extents, the plan is to apply this combined approach on a sequence family basis. Novel comparisons, called Attribute Profiles, are proposed for the representation of structure and motion information. Sets of Gribskov/Eisenberg 1D Profiles and associated Attribute Profiles will be combined into single predictions using neural network data models. Prediction outcomes will be compared with experimental observations and evaluated using the jackknife method. If successful, this work will improve prediction of protein function from amino acid sequence, which is important given the significant numbers of protein sequences with undetermined functions coming out of the various genome projects. Motion and disorder are important pieces that need to be added to the characterization of protein structure/function. For example, taxol has been shown recently to bind to a disordered loop in Bcl-2. Alzheimer disease, transmissible spongiform encephalopathies, Parkinson disease and infectious agents such as Staphylococcal aureus, foot-and-mouth disease virus, and (perhaps) HIV depend critically on disordered regions of protein. Thus, understanding the interplay of sequence, flexibility, order / disorder, complexity, structure and function clearly relates to human health.

Public Health Relevance:
This Public Health Relevance is not available.

Thesaurus Terms:
informatics, protein sequence, protein structure function
artificial intelligence, computer assisted sequence analysis, computer system design /evaluation, conformation, hydropathy, structural biology

Institution: WASHINGTON STATE UNIVERSITY
423 Neill Hall
PULLMAN, WA 99164
Fiscal Year: 2002
Department: SCHOOL OF MOLECULAR BIOSCIENCES
Project Start: 01-MAY-2000
Project End: 30-APR-2003
ICD: NATIONAL LIBRARY OF MEDICINE
IRG: BLR


CRISP Homepage Help for CRISP Email Us