Semantic Technologies for Knowledge Synthesis

Knowledge Synthesis

NISAC analysts and modelers require vast amounts of data and information for their critical infrastructure interdependency work. The knowledge synthesis effort provides an access-controlled, organized, and searchable view on the data for rapid retrieval and update through a suite of Web-based tools and programmatic interfaces. Four semantic technology projects are underway to help improve information access:

Ontology Development

An ontology is a map of the key terms in a domain of knowledge and the relationships among the terms. It forms both a vocabulary and a cognitive framework for reasoning about the domain. By structuring domain knowledge and making relationships both within and among subject areas explicit, NISAC analysts are able to categorize and access information more efficiently.Diagram of semantic ontology of hazardsAdditionally, the ontology is flexible enough to be adjusted as the understanding of the domain evolves. High level structural components of the current ontology for the critical infrastructure protection domain include hazards, critical assets, geographical location, consequences, and organizations.

Semantic Navigation

The NISAC Critical Infrastructure Protection Knowledge Management (CIP KM) Portal is a one-stop shop where NISAC analysts, modelers, and administrators can contribute and share information. Semantic navigation Image of sematic navigation elements in Critical Infrastructure Protection categoriesallows the Knowledge Management Portal to be searched in terms of concepts and relationships, rather than by keyword only. This approach overcomes the semantic ambiguity and unrecognized synonym issues associated with keyword search. As a result, analysts can navigate the portal at the “speed of click” using categories drawn from the domain of critical infrastructure protection.

 

Keyword Expansion

A step along the way to full synonym resolution is the expansion of keywords used to search the CIP KM Portal. Each keyword is expanded to include synonyms related to the concept being searched, in priority order. For example, a search on the term “bird flu” brings back documents containing that phrase and also those containing “avian influenza” and “H5N1.” The synonyms are drawn from those commonly used by critical infrastructure protection analysts, as well as a more general set of synonyms taken from a public domain thesaurus.

Automatic Semantic Metadata Determination

As new documents are added to the CIP KM Portal, they must be tagged under metadata categories defined within the ontology in order to be available for semantic navigation. Diagram of automatic semantic metadata determinationThis is a tedious and error-prone process when carried out manually. A system is being developed to automatically recommend the appropriate metadata categories using a text analysis tool. However, the person submitting the information will review the recommendations and make the final decision. The benefits of automatic semantic metadata determination include:

An extension of this application is being designed to target information on the Internet, making a tremendous amount of other data resources available to users of the CIP KM Portal.