Library of Congress >> MARC >> Authority >> 01X-09X >> 066

066 - Character Sets Present (NR)


MARC 21 Authority - Full
October 1999
Link disclaimer
First Indicator
Undefined
# - Undefined

Second Indicator
Undefined
# - Undefined


Subfield Codes
$a - Primary G0 character set (NR)
$b - Primary G1 character set (NR)
$c - Alternate G0 or G1 character set (R)

FIELD DEFINITION AND SCOPE

Used in records encoded with characters from sets other than ISO 10646 (or Unicode) to specify the character sets for data content that are present in the record. The field alerts users that special processing may be required. A detailed description of the standard escape sequences used in MARC records is provided in MARC 21 Specifications for Record Structure, Character Sets and Exchange Media.

Codes for identifying character sets are all but the first character of the escape sequences that designate the sets (the first character is the escape character, hex 1B).


GUIDELINES FOR APPLYING CONTENT DESIGNATORS

INDICATORS

SUBFIELD CODES

$a - Primary G0 character set
Code is composed of the Intermediate and Final characters of the escape sequence that designates and invokes the default G0 character set.
Since MARC Latin (including ASCII, MARC Greek, MARC subscript, or MARC superscript) is the MARC default set, if it is the primary set, it does not need to be identified in this subfield.
066 ##$a(N
[The Intermediate character in the designation sequence is hex 28 (ASCII graphic "(" opening parenthesis) that identifies the character set as one byte per character and its use as a G0 set, and the Final character is hex 4E (ASCII graphic "N") that identifies the Basic Cyrillic set.]
066 ##$a$1
[The Intermediate character in the designation sequence is hex 24 (ASCII graphic "$") that identifies the character set as multiple bytes per character and its use as a G0 set, and the Final character is hex 31 (ASCII graphic "1") that identifies the Chinese, Japanese, Korean character set.]
$b - Primary G1 character set
Code is composed of the Intermediate and Final characters of the escape sequence that designates and invokes the default G1 character set.
Since ANSEL is the MARC default set if it is the primary extended set it does not need to be identified in this subfield.
066 ##$b$)1
[The Intermediate characters in the designation sequence are hex 24 hex 29 (ASCII graphics "$)") that identify the character set as multiple bytes per character and its use as a G1 set, and the Final character is hex 31 (ASCII graphic "1") that identifies the East Asian character code for bibliographic use (ANSI/NISO Z39.64).]
066 ##$b)Q
[The Intermediate character in the designation sequence is hex 29 (ASCII graphic ")") that identifies the character set as one byte per character and its use as a G1 set, and the Final character is hex 51 (ASCII graphic "Q") that identifies the Extended Cyrillic character set.]
$c - Alternate G0 or G1 character set
Code is composed of the Intermediate and Final characters of each escape sequence that will be used to designate an alternate graphic character set used in the record.
Intermediate character(s) indicate whether the set is single or multibyte and whether it will be designated as a G0 or G1 set. The subfield is repeated for each additional character set present.
066 ##$c)2
[The Intermediate character in the designation sequence is hex 29 (ASCII graphic ")" ) that identifies the character set as the G1 set and one byte per character, and the Final character is hex 32 (ASCII graphic "2") that identifies the Hebrew character set.]

(03/03/2008) Contact Us