Version 2.5.2.0 CRISP Logo CRISP Homepage Help for CRISP Email Us

Abstract

Grant Number: 1R21LM008309-01A1
Project Title: Analysis and Annotation Pipeline for Functional Genomics
PI Information:NameEmailTitle
OCHS, MICHAEL F. mfo@jhu.edu ASSOCIATE PROFESSOR

Abstract: DESCRIPTION: This grant proposes the development of an extendable, scalable automated data analysis pipeline for functional genomics data. Functional genomics, including microarrays and proteomics, is evolving quickly, with data sets increasing rapidly in size and new analysis methodologies appearing monthly. Because there are no de facto standards for addressing typical experimental questions, the application of multiple analyses is desirable, but rarely performed due to the effort required. Furthermore, the analysis of functional genomics data is generally a multi-step process, with many possible methods in use at each step (e.g., for image analysis, data normalization, statistical analysis, data mining), leading to a combinatorial explosion of effort when using multiple analyses. The functional genomics data pipeline proposed in this application will provide the ability to automatically perform multiple analyses, will provide easy extendibility for adding new functions and data types, will provide a distributed computing environment to provide adequate computational power, and will integrate automated annotation to allow analyses to be guided by biological knowledge. The system will utilize Enterprise Java Beans to provide a robust server architecture, Java server pages for dynamic generation of web interfaces, and object oriented design patterns to optimize the software architecture. The system will be extendable during operation through use of the Strategy design pattern coupled to the Java reflection mechanism. Functional genomics data sets will be encapsulated within data objects that include links to the NCI caBIO objects to utilize the NCI Center for Bioinformatics data resources. In addition, annotations will be retrievable from web sites and through the Distributed Annotation System. Documentation and testing will proceed in parallel with development, and will integrate end users during design and deployment to tune the user interface. The final system will provide dramatic improvements in researchers' abilities to fully explore their growing data sets and to interpret their experimental results in light of the larger biological knowledge bases. It will be fully supported and released to the community open source.

Public Health Relevance:
This Public Health Relevance is not available.

Thesaurus Terms:
automated data processing, computational biology, computer data analysis, computer system design /evaluation, functional /structural genomics
Internet, data management, genetic mapping, image processing, information dissemination, information system, statistics /biometry
biotechnology, clinical research, human data

Institution: FOX CHASE CANCER CENTER
333 Cottman Avenue
PHILADELPHIA, PA 191112434
Fiscal Year: 2005
Department:
Project Start: 01-FEB-2005
Project End: 31-JAN-2007
ICD: NATIONAL LIBRARY OF MEDICINE
IRG: BLR


CRISP Homepage Help for CRISP Email Us