Version 2.5.2.0 CRISP Logo CRISP Homepage Help for CRISP Email Us

Abstract

Grant Number: 5K22LM009105-02
Project Title: Protecting Genetic Privacy through Risk Assessment
PI Information:NameEmailTitle
LIN, ZHEN zhen.lin@unc.edu

Abstract: DESCRIPTION (provided by applicant): Free access to research data is vital to promote scientific discovery. However, privacy concerns revolve around publicly sharing biomedical data. Sharing such data puts at risk the identity and health information of individuals who have volunteered to anonymously release their information for medical research. One type of genomic sequence data that are generated rapidly by high-throughput methods is single nucleotide polymorphisms (SNPs). SNPs merit tremendous research attentions. Free exchange of these personal genotypes also poses difficult challenges for protecting privacy and information security. To deal with the challenges, I propose an investigation to acquire an accurate assessment of the privacy risk assumed by research subjects whose SNPs are disseminated in public biomedical databases. This knowledge will provide database privacy officers and policy makers the information that they need in protecting privacy of research subjects. In particular, I will develop methods to examine linkage disequilibrium (LD) patterns among SNPs throughout the genome, and I will compile a "risk map" detailing the genomic locations most likely to threaten privacy. Because of LD, a small set of tag SNPs can capture the majority of SNP information content in the genome. They are thus valuable tools in genetics to reduce the effort necessary to map genes to diseases and phenotypes. Only the tag SNPs, rather than the entire gnome, needs to be examined. However, because of that very attribute, they are also the high-risk ones that would lead to individual identifications. Therefore, it is important to study the relationship between tag SNPs and privacy. I have previously developed methods to find tag SNPs with good performance. I propose improving these tagging methods as well as developing new ones to compile a comprehensive list of tag SNPs in the human genome. I will evaluate the ability of tag SNPs in disclosing individuals. I have also previously established an initial probabilistic model for the risk assessment. I propose to further develop a knowledgebase of tag SNPs with their locations and frequencies, and an automatic risk assessment tool that utilizes the probabilistic risk assessment model and the tag SNP knowledgebase. I will evaluate the usability and functionality of the risk assessment tool by applying to existing public genomic databases. I will also make the resulting methods available on the web for real-time tag SNP detection and risk assessment and will distribute the tools and software for open source development.

Public Health Relevance:
This Public Health Relevance is not available.

Thesaurus Terms:

There are no thesaurus terms on file for this project.

Institution: UNIVERSITY OF NORTH CAROLINA CHAPEL HILL
Office of Sponsored Research
CHAPEL HILL, NC 27599
Fiscal Year: 2007
Department: NONE
Project Start: 15-SEP-2006
Project End: 14-SEP-2009
ICD: NATIONAL LIBRARY OF MEDICINE
IRG: ZLM1


CRISP Homepage Help for CRISP Email Us