Version 2.5.2.0 CRISP Logo CRISP Homepage Help for CRISP Email Us

Abstract

Grant Number: 5R01LM005770-08
Project Title: COMPUTATIONAL APPROACHES TO PROTEIN SEQUENCE ANALYSIS
PI Information:NameEmailTitle
STATES, DAVID J. dstates@umich.edu PROFESSOR

Abstract: DESCRIPTION (adapted from the Abstract): The large and growing databases of known protein sequences represent a knowledge base with the power to revolutionize biology, biochemistry, and biotechnology. These sequencing efforts have highlighted the growing gap between the sequence data and our ability to analyze this data. We are generally interested in answering specific questions about structure, function, and mechanisms. Much information can come from the identification of homologous proteins about which more is known. Identifying distant homologs is still difficult, even with the advent of new profile methods. Another powerful approach is to predict the tertiary structure. While progress is being made, we are still far from being able to reliably predict structures based on sequence data alone. Both of these techniques can be assisted by an analysis of the evolutionary record encoded in the sequences of available homologous proteins. We still do not have a good understanding of how to interpret this record, partially due to a lack of good models of the evolutionary process. Optimal score functions for the identification of distant homologies will be developed and analyzed, and the optimization techniques will be applied to the creation of optimal score functions for alignment of known homologs. Models of amino acid site substitutions will be used to create protein profiles that will allow the identification of further homologs and analogs. Optimization procedures will be developed for the identification of tertiary structures in proteins, including encoding the evolutionary patterns of sidechain conservation and variation. These techniques will be applied to the "inverse-felding" process, that is, identifying sequences that are likely to fold into a given structure. Simple models of the evolutionary process will be developed to examine how observed properties of proteins can be understood in an evolutionary context. These models will be elaborated to include the effect of population dynamics on the evolutionary process, as well as selective pressure resulting from the need for the protein to be functional. These models will be used to explore which protein properties are likely to be inherent, and to understand how much information can be derived for proteins based on information about known homologs.

Public Health Relevance:
This Public Health Relevance is not available.

Thesaurus Terms:
biochemical evolution, computer assisted sequence analysis, protein folding, protein sequence, protein structure
chemical model, computer simulation, conformation, molecular dynamics, peptide analog
statistics /biometry, thermodynamics

Institution: UNIVERSITY OF MICHIGAN AT ANN ARBOR
3003 SOUTH STATE STREET, Room 1040
ANN ARBOR, MI 481091274
Fiscal Year: 2002
Department: HUMAN GENETICS
Project Start: 01-APR-1995
Project End: 31-MAR-2005
ICD: NATIONAL LIBRARY OF MEDICINE
IRG: BLR


CRISP Homepage Help for CRISP Email Us