Annex A: GEO Attribute Set
Contents
The GEO Profile requires clients and servers to support the GEO Attribute Set {ANSI-standard-Z39.50 3 9}, as specified in this Annex. For interoperability with Bib-1 and GILS clients, GEO servers must also recognize the Bib-1 {ANSI-standard-Z39.50 3 1} and GILS {ANSI-standard-Z39.50 3 5} Attribute Sets.
A.1. Attribute Types
The GEO Profile requires clients and servers to support the following Bib-1 Attribute Types:
- Use (1)
- Relation (2)
- Structure (4)
Optionally, a server may also support the Truncation (5) Attribute Type.
A.2. Use Attributes
The GEO Attribute Set inherits some Use Attributes from the Bib-1 and GILS Attribute Sets. Use Attributes whose values are between 1 and 1999 are from the Bib-1 Attribute Set, and Use Attributes between 2000 and 2999 are from the GILS Attribute Set. Additional GEO Use Attributes that cannot be mapped to Bib-1 or GILS are numbered from 3000 through 3999. With the exception of the Any and Anywhere Use Attributes, all Use Attributes map to an element in the GEO Schema (see Annex C). Not all Use Attributes are expected to be available for search. See Annex B for a list of the mandatory Use Attributes.
For convenience, the Use Attributes are tabulated as follows:
The tables below provide the name and value of each Use Attribute and its corresponding SGML element tag from the GEO Schema.
A.2.1 By Attribute Value
A.2.2. By Element Name
A.2.3. By SGML Tag
A.3. Relation Attributes
Relation attributes describe the relationship of the access point (left side of the relation) to the search term as qualified by the attributes (right side of the relation), e.g., Publication Date <= 1975. The Relation Attributes adopted by the GEO Profile are specified in the table below. They are divided into several classes of relations, based on the properties of the access point and search term. For example,
- Numeric: Less Than (1), Less Than or Equal (2), Equal (3), Greater Than or Equal (4), Greater Than (5), and Not Equal (6).
- Textual: Equal (3) and Not Equal (6).
- Spatial: Overlaps (7), Fully Enclosed Within (8), Encloses (9), Fully Outside Of (10), and Near (11).
- Temporal: Before (14), Before or During (15), During (16), During or After (17), and After (18).
Textual relations apply to all words within the search access point, including any subelements of the access point. For example, if the search term "water" is specified with a Use Attribute of Title (4), a Relation Attribute of Equal (3), and a Structure Attribute of Word (2), then all records containing the term "water" anywhere in the Title element, and any subordinate elements of Title, should match the query term.
Value |
Name |
Semantics |
---|---|---|
1 | Less Than | The access point value is less than the search term value. |
2 | Less Than or Equal | The access point value is less than or equal to the search term value. |
3 | Equal | The access point value is equal to the search term value (subject to possible qualification by the Truncation and Structure Attributes). |
4 | Greater Than or Equal | The access point value is greater than or equal to the search term value. |
5 | Greater Than | The access point value is greater than the search term value. |
6 | Not Equal | The access point value is not equal to the search term value (subject to possible qualification by the Truncation and Structure Attributes). |
7 | Overlaps | The access point region has a geometric area in common with the search term region. Given a search term region of S and access point region of T, the following algebra expresses the conditions required: {S(North) >= T(South)} and {S(South) <= T(North)} and {S(East) >= T(West)} and {S(West) <= T(East)}. |
8 | Fully Enclosed Within | The access point region is fully enclosed within the search term region. |
9 | Encloses | The access point region fully encloses the search term region. |
10 | Fully Outside Of | The access point region has no geometric area in common with the search term region. |
11 | Near | The access point region falls within a default distance of the search term region. The default distance is defined by the server. |
12 | Members Contain | The access point element or one of its subordinate elements is equal to the search term value (subject to possible qualification by the Truncation and Structure Attributes). |
13 | Members Not Contain | The access point element and all of its subordinate elements are not equal to the search term value (subject to possible qualification by the Truncation and Structure Attributes). |
14 | Before | The access point date (or date range) is before the search term date (or date range). |
15 | Before or During | The access point date (or date range) is before or during the search term date (or date range). |
16 | During | The access point date (or date range) is during the search term date (or date range). |
17 | During or After | The access point date (or date range) is during or after the search term date (or date range). |
18 |
After | The access point date (or date range) is after the search term date (or date range). |
A.4. Structure Attributes
The Structure Attribute specifies the type of search term (e.g., a single word, a phrase, several words to be treated as multiple single terms, etc.). The table below outlines the Structure Attributes used by GEO clients and servers.
Value |
Name |
Semantics |
---|---|---|
1 | Phrase | A Phrase consists of one or more groups of characters separated by whitespace (for example, ASCII hex "20"). The value to be searched is exactly as it appears in the search term with respect to order and adjacency. Word(s) in the phrase may be explicitly truncated with a Truncation Attribute. |
2 | Word | A Word consists of a group of non-whitespace characters. It specifies the exact text of the value to be searched, unless the word is explicitly truncated with a Truncation Attribute. A word search term contains no whitespace. |
6 | Word List | A Word List consists of one or more words separated by whitespace (for example, ASCII hex "20"). No order of the words is implied. The attributes (other than structure) that are associated with the search term apply to each word in the Word List. Any words in a Word List may be explicitly truncated. The relationship between the words in a word list is server-specific. |
103 | Always Matches | The Always Matches Structure Attribute indicates that the server is to ignore the supplied search term. If the Use Attribute is Any or Anywhere, then all records are to be selected. If a Use Attribute other than Any or Anywhere is supplied, all records are selected for which the field corresponding to the supplied Use Attribute is meaningful. For example: if the Use Attribute is Title, all records that have a title field are selected. |
104 | URx | A URx is a document identifier, for example, an identifier extracted from a Z39.50 URL. |
109 | Numeric String | A Numeric String is a character string that represents a number. |
201 | Coordinate String | A Coordinate String is a character string containing an ordered list of coordinates as y (latitude) and x (longitude) pairs expressed with a space or comma delimiter between the y and x and a space between the pairs, as so: y,x y,x y,x ... or 45.003,-102.32 46.007,-103.45 46.141,-103.79... If a Coordinate String is used to define an enclosing region, the terminal pair shall be encoded with the same values as the initial pair to define full enclosure. A Coordinate String with only two pairs entered shall represent a bounding rectangle where the northwest and southeast corners of a described area, using a system orthogonal to the axes of latitude and longitude. The Use Attribute Bounding Coordinates (2060) can thus be queried as a shortcut to querying the four properties that it references (i.e. North, West, South, and East Bounding Coordinates). A search region defined by the area of 23 degrees North to 5 degrees South and 70 degrees West to 10 degrees East could be expressed as either "23 -70 -5 10" or "23,-70 23,10 -5,10, -5,-70 23,-70". |
204 | Composite | A Composite element is a grouping element that contains multiple searchable elements. In an object-oriented view, it represents a parent element that contains valued child elements. A GEO server may support the query of compound elements by passing the query as text to all appropriate child elements it contains. This allows search at higher levels of abstraction than a single Use Attribute but at something less than the entire document. |
210 | Date String | A Date String is a character string that represents a single date using CCYYMMDD format, or a beginning and ending date range using CCYYMMDD/CCYYMMDD format, where CC, YY, MM, DD are the two-digit representation of the century, year, month, and day, respectively. The month (MM) and day (DD) are optional. |
A.5. Truncation Attributes
A Truncation Attribute specifies whether one or more characters may be omitted in matching the search term. If a word in a search term is right truncated, the word is treated both as a complete word and as the beginning of a longer word. Right truncation may also be indicated using the asterisk character "*" at the end of a word in a query term.
Right Truncation is only applicable to textual query terms, that is, query terms whose Structure Attribute is Phrase (1), Word (2), or Word List (6). The table below specifies the Truncation Attributes recommended for use by GEO clients and servers.
Value |
Name |
Semantics |
---|---|---|
1 | Right Truncation | In combination with Word or Phrase, the last word of the term is right truncated. In combination with Word List, each word in the word list is right truncated. |
100 | Do Not Truncate | No truncation is to be applied. |
Top of Annex A |