GLOSSARY OF XML-STANDARDS-RELATED TERMS

Cocoon A web development framework built around the concepts of separation of concerns and component-based web development, focusing on XML and XSLT publishing and written in Java.
http://cocoon.apache.org/
Dublin Core A simple and standardized set of fields for describing objects. Dublin Core is widely used to describe digital materials such as video, sound, image, text and composite media like Web pages.
http://dublincore.org/
EAD Encoded Archival Description: A nonproprietary standard structure for machine-readable finding aids such as inventories, registers, indexes, and other documents created by archives and libraries, to support the indexing and display of extensive and interrelated (often hierarchical) holdings.
http://www.loc.gov/ead/
JHOVE JSTOR/Harvard Object Validation Environment: An extensible framework for digital object validation. Documents (in formats such as AIFF, ASCII, Bytestream, GIF, HTML, JPEG, JPEG2000, PDF, TIFF, UTF-8, WAV, and XML) are analyzed and checked for being well-formed (consistent with the basic requirements of the format) and valid (internally consistent).
http://hul.harvard.edu/jhove/
JPEG2000 An image compression standard which improves on the JPEG standard for compression. In addition to a better compression algorithm, it allows more sophisticated progressive downloads and display of multiple sizes or portions of an image without the need for additional derivative copies.
http://www.jpeg.org/jpeg2000/
LCP Library of Congress Presents: Music, Theater and Dance. A Web-accessible performing arts digital library designed using XML and open source tools as well as merging standards such as METS and MODS.
http://www.loc.gov/rr/perform/ihas/
Lucene An open source search engine originally implemented in Java and supported by the Apache Software Foundation. Lucene has been ported to Perl, C#, C++, Python, Ruby, and PHP. Lucene's API is agnostic of file format, so text from any file can be indexed as long as its textual information can be extracted.
http://lucene.apache.org/
MADS Metadata Authority Description Schema: An authority element set that may be used to provide metadata about agents (people, organizations), events, and terms (topics, geography, genres, etc.). MADS is a subset of MARC 21 and was created to serve as a companion to MODS.
http://www.loc.gov/standards/mads/
METS Metadata Encoding and Transmission Standard: An XML standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library.
http://www.loc.gov/standards/mets/
MIX

Metadata for Images in XML: A schema for technical data elements required to manage digital image collections. http://www.loc.gov/standards/mix/
The NISO Draft Standard Data Dictionary can be found at:
http://www.niso.org/standards/resources/Z39_87_trial_use.pdf

MODS Metadata Object Description Schema: An XML structure derived from MARC21 that uses English-like elements instead of tag numbers as elements to describe objects. Not all MARC21 tags are translated into MODS. It was designed as a compromise between the complexity of MARC and the simplicity of Dublin Core metadata.
http://www.loc.gov/standards/mods/
MySQL An open source database similar to Oracle which uses SQL (Standard Query Language [see below] ) to retrieve records. My SQL is used by Google for its AdWords program.
http://www.mysql.com/
OCR Optical Character Recognition. Software designed to read and translate scanned images of text into machine-editable text for searching and display.
PREMIS Preservation Metadata Implementation Strategies: A data dictionary to define an implementable set of core preservation elements for digital preservation repositories.
http://www.loc.gov/standards/premis/
SQL Structured Query Language: Pronounced "sequel," SQL is a language for talking to, retrieving and formatting information from relational databases (such as the Voyager Oracle database).
http://www.sql.org/
SRU

Search Retrieval via URL: A standard search protocol for Internet search queries. It utilizes CQL (Common Query Language), a standard query syntax for expressing search queries. CQL is often used to query library catalogs via z3950 gateways.
http://www.loc.gov/standards/sru/

VHP Veterans History Project. Another digital library Web site built using the XML/XSLT/Cocoon framework. VHP contains collections of veterans and civilians documenting wartime and/or service. It showcases interviews with veterans along with digitized collections of their letters, diaries, artwork, photos, etc.
http://www.loc.gov/vets/

Webcast: Using <METS> and <MODS> to Create
XML Standards-Based Digital Library Applications