text-only page produced automatically by LIFT Text Transcoder Skip all navigation and go to page contentSkip top navigation and go to directorate navigationSkip top navigation and go to page navigation
National Science Foundation
Search  
Awards
design element
Search Awards
Recent Awards
Presidential and Honorary Awards
About Awards
Grant Policy Manual
Grant General Conditions
Cooperative Agreement Conditions
Special Conditions
Federal Demonstration Partnership
Policy Office Website


Award Abstract #0415175
Reconciling Semantic Heterogeneity by Leveraging Past Experience


NSF Org: IIS
Division of Information & Intelligent Systems
divider line
divider line
Initial Amendment Date: August 24, 2005
divider line
Latest Amendment Date: August 1, 2007
divider line
Award Number: 0415175
divider line
Award Instrument: Standard Grant
divider line
Program Manager: Sylvia J. Spengler
IIS Division of Information & Intelligent Systems
CSE Directorate for Computer & Information Science & Engineering
divider line
Start Date: September 1, 2005
divider line
Expires: August 31, 2009 (Estimated)
divider line
Awarded Amount to Date: $270000
divider line
Investigator(s): Dan Suciu suciu@cs.washington.edu(Principal Investigator)
Alon Halevy (Former Principal Investigator)
divider line
Sponsor: University of Washington
4333 Brooklyn Ave NE
SEATTLE, WA 98195 206/543-4043
divider line
NSF Program(s): INFORMATION & KNOWLEDGE MANAGE
divider line
Field Application(s): 0104000 Information Systems
divider line
Program Reference Code(s): HPCC,9218
divider line
Program Element Code(s): 6855

ABSTRACT

The goal of this research project is to develop methods for bridging semantic heterogeneity. Semantic heterogeneity arises in contexts where data needs to be shared among multiple data sources and applications, and these sources use different terminologies. For example, companies own a large number of databases, and need to coordinate between them in order to leverage their value. Similarly, large-scale scientific projects and coordination among government agencies also requires sharing data across multiple repositories. The approach consists of collecting a large number of schemas in a particular domain and trying to learn the patterns and variations on patterns that database designers use in the domain. By leveraging such patterns, it is possible to match between previously unseen database schemata in the domain. The techniques are validated by developing systems for matching between disparate schemata, and by applying the techniques to searching the growing number of web-services available today on the World Wide Web. One of the systems being built by this research is a search engine for web services that attempts to get at the underlying meaning of the web-service operations and will be available from the University of Washington (http://data.cs.washington.edu/schemaMatching/index.htm). The results of the project will provide a set of online services as well as public data sets that can be used by the research community. Possible direct applications of this research include biomedical informatics and deep-web search. The results


PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

(Showing: 1 - 11 of 11).

Franklin, Michael, Halevy, Alon, Maier, David.  "From databases to dataspaces: a new abstraction for information management,"  SIGMOD Record,  v.34,  2005,  p. 27.

Halevy, Alon, Franklin, Mike, Maier, David.  "Principles of Database Systems,"  PODS,  v.1,  2006,  p. 1.

Jayant Madhavan, Shawn Jeffery, Shirley Cohen, Xin Dong, Alon Halevy, David Ko, Cong Yu.  "Web-Scale Data Integration: You can Afford to Pay as You Go,"  CIDR,  2007, 

Jayant Madhavan, Shirley Cohen, Xin Dong, Alon Halevy, Shawn Jeffery, David Ko, Cong Yu.  "Structured Data Meets the Web: A Few Observations,"  IEEE Data Eng. Bulletin,  v.29,  2006,  p. 19.

Jing Liu, Xin Dong, Alon Halevy.  "Answering Structured Queries on Unstructured Data,"  In WedBD,  2006, 

Michael Cafarella, Christopher Re, Dan Suciu, Oren Etzioni, Michele Banko.  "Structured Querying of Web Text Data: A Technical Challenge,"  Proceedings of the Conference on Innovative Data Systems Research,  v.1,  2007,  p. 1.

Michael Cafarella, Dan Suciu, Oren Etzioni.  "Navigating Extracted Data with Schema Discovery,"  Proceedings of the Tenth International Workshop on the Web and Databases,  2007, 

Michael Cafarella, Edward Chang,Andrew Fikes, Alon Halevy, Wilson Hsieh, Alberto Lerner, Jayant Madhavan, S. Muthukrishnan.  "Data Management Projects at Google,"  SIGMOD Record,  v.37,  2008, 

Michael Cafarella, Oren Etzioni, Dan Suciu.  "Structured Queries Over Web Text,"  IEEE Data Buleltin,  v.29,  2006,  p. 4.

Xin Dong, Alon Halevy.  "Indexing Dataspaces,"  In Sigmod,  2007, 

Xin Dong, Alon Halevy, Cong Yu.  "Data Integration with Uncertainties,"  VLDB,  2007, 


(Showing: 1 - 11 of 11).

 

Please report errors in award information by writing to: awardsearch@nsf.gov.

 

 

Print this page
Back to Top of page
  Web Policies and Important Links | Privacy | FOIA | Help | Contact NSF | Contact Web Master | SiteMap  
National Science Foundation
The National Science Foundation, 4201 Wilson Boulevard, Arlington, Virginia 22230, USA
Tel: (703) 292-5111, FIRS: (800) 877-8339 | TDD: (800) 281-8749
Last Updated:
April 2, 2007
Text Only


Last Updated:April 2, 2007