GLOSSARY OF XML-STANDARDS-RELATED TERMS
Cocoon | A
web development framework built around the concepts of separation of
concerns and component-based web development, focusing on XML and XSLT
publishing and written in Java. http://cocoon.apache.org/ |
Dublin Core | A simple and standardized
set of fields for describing objects. Dublin Core is widely used to
describe digital materials such as video, sound,
image, text and composite media like Web pages. http://dublincore.org/ |
EAD | Encoded Archival Description: A nonproprietary standard structure for
machine-readable finding aids such as inventories, registers, indexes,
and other documents created by archives and libraries, to support the
indexing and display of extensive and interrelated (often hierarchical)
holdings. http://www.loc.gov/ead/ |
JHOVE | JSTOR/Harvard Object
Validation Environment: An extensible framework for digital object
validation. Documents (in formats such as AIFF, ASCII,
Bytestream, GIF, HTML, JPEG, JPEG2000, PDF, TIFF, UTF-8, WAV, and XML)
are analyzed and checked for being well-formed (consistent with the basic
requirements of the format) and valid (internally consistent). http://hul.harvard.edu/jhove/ |
JPEG2000 | An image compression
standard which improves on the JPEG standard for compression. In addition
to a better compression algorithm, it allows
more sophisticated progressive downloads and display of multiple sizes
or portions of an image without the need for additional derivative copies. http://www.jpeg.org/jpeg2000/ |
LCP | Library of Congress
Presents: Music, Theater and Dance. A Web-accessible performing arts
digital library designed using XML and open source tools
as well as merging standards such as METS and MODS. http://www.loc.gov/rr/perform/ihas/ |
Lucene | An open source search
engine originally implemented in Java and supported by the Apache Software
Foundation. Lucene has been ported to Perl, C#,
C++, Python, Ruby, and PHP. Lucene's API is agnostic of file format,
so text from any file can be indexed as long as its textual information
can be extracted. http://lucene.apache.org/ |
MADS | Metadata Authority
Description Schema: An authority element set that may be used to provide
metadata about agents (people, organizations),
events, and terms (topics, geography, genres, etc.). MADS is a subset
of MARC 21 and was created to serve as a companion to MODS. http://www.loc.gov/standards/mads/ |
METS | Metadata Encoding and Transmission Standard: An XML standard for encoding
descriptive, administrative, and structural metadata regarding objects
within a digital library. http://www.loc.gov/standards/mets/ |
MIX | Metadata for Images in XML: A schema for technical data elements required
to manage digital image collections. http://www.loc.gov/standards/mix/ |
MODS | Metadata Object Description Schema: An XML structure derived from MARC21
that uses English-like elements instead of tag numbers as elements to
describe objects. Not all MARC21 tags are translated into MODS. It was
designed as a compromise between the complexity of MARC and the simplicity
of Dublin Core metadata. http://www.loc.gov/standards/mods/ |
MySQL | An open source database
similar to Oracle which uses SQL (Standard Query Language [see
below]
) to retrieve records. My SQL is used by
Google for its AdWords program. http://www.mysql.com/ |
OCR | Optical Character Recognition. Software designed to read and translate scanned images of text into machine-editable text for searching and display. |
PREMIS | Preservation Metadata Implementation Strategies: A data dictionary
to define an implementable set of core preservation elements for digital
preservation repositories. http://www.loc.gov/standards/premis/ |
SQL | Structured Query
Language: Pronounced "sequel," SQL is a language for
talking to, retrieving and formatting information from relational databases
(such as the Voyager Oracle database). http://www.sql.org/ |
SRU | Search Retrieval via URL: A standard search protocol for Internet search
queries. It utilizes CQL (Common Query Language), a standard query syntax
for expressing search queries. CQL is often used to query library catalogs
via z3950 gateways. |
VHP | Veterans History Project.
Another digital library Web site built using the XML/XSLT/Cocoon framework.
VHP contains collections of veterans and
civilians documenting wartime and/or service. It showcases interviews
with veterans along with digitized collections of their letters, diaries,
artwork, photos, etc. http://www.loc.gov/vets/ |
Webcast: Using <METS> and <MODS> to
Create
XML Standards-Based Digital Library Applications