PROPOSAL NO: 97-5

DATE: December 15, 1996
REVISED:

NAME: Changes to the USMARC Classification Format for Multilingual Classification Schemes

SOURCE: Decimal Classification Division, Library of Congress

SUMMARY: This proposal suggests changes in the USMARC classification format to accommodate multilingual editions of a classification scheme. Several additions are suggested: 1) add subfield $n for a textual note in field 084 (Classification scheme and edition) to allow for the ability to specify a source edition which may vary within a translation; 2) allow for a new subfield for the specification of whether the edition is authorized or unauthorized in a new subfield $f; 3) addition of field 686 for Relationship to Source Note to show the variations in different editions and how they relate to the source edition.

KEYWORDS: Field 084 (Classification); Field 686 (Classification); Classification Scheme and Edition; Relationship to Source Note

RELATED:

STATUS/COMMENTS:

12/15/96 - Forwarded to USMARC Advisory Group for discussion at the 1997 Midwinter MARBI meetings.

2/17/97 - Results of USMARC Advisory Group discussion - Approved.

It was suggested that a code be used in subfield $f Authorization (e.g. "u" for unauthorized). Some concern was expressed that the information in field 084 is repeated in every record, but it was pointed out that records from different translations could be mixed in the same database.

2/26/97 - Results of final LC review - Approved.


PROPOSAL NO. 97-5: Changes to the USMARC Classification Format 
for Multilingual Classification Schemes

1.       BACKGROUND

         The Dewey Decimal Classification is used by over 200,000
libraries in 135 countries and has been translated into over 30
languages.  The current edition, Edition 20, has been translated
into Italian, Spanish, and Turkish.  These translations are very
close in structure and content to the English-language standard
edition with minor cultural adaptations. There is also an
intermediate French edition based on Abridged Edition 12 with
excerpts from Edition 20.  Abridged Edition 12 has been translated
or is in the process of being translated into Arabic, French,
Greek, Hebrew, Italian, and Persian. Work is already underway on a
translation of Edition 21 into Russian.  It is expected that
translations of Edition 21 will also appear in Arabic, Chinese,
French, Italian, and Turkish, plus excerpts (the major revisions)
in Spanish.  

          Classification schemes are unique among authority control
systems in that they may retain the same controlled vocabulary
(notation) and meaning when translated into other languages or
linked with other thesauri.  Over a decade ago,  Elaine Svenonius
(1983) noted Dewey's  potential  as a switching language in
multilingual databases.  In order for this potential to be
realized, there must be explicit links between the English-language
standard editions of Dewey and each translation, and documentation
on the nature of those links.  It is likely that the USMARC Format
for Classification Data will be used for the development of
explicit links between different translations.  

         An IFLA working group has reviewed the MARC format for
adequacy for international classification systems and compatibility
with UNIMARC, and made preliminary recommendations for extensions
to the MARC format.  A paper will be presented a later MARBI
meeting addressing changes needed for accommodating the Universal
Decimal Classification (UDC).  This paper discusses additional data
elements needed to link records from a translation to a standard
edition.  
(http://www.nlc-bnc.ca/ifla/VII/s29/projects/rep0796.htm)


2.       DISCUSSION

Ability to specify source edition.  There is a need to specify the
source edition for a translation.
Source edition may vary within a translation; for example, the new
Spanish edition is a translation of Edition 20, but contains parts
of Edition 21 such as the revised area table for the former Soviet
Union and the Table 6 expansions for North and South American
native languages.

Field 084 is used for Classification Scheme and Edition and is
defined as follows:

Indicators
   First          Type of edition
   0                Full
   1                Abridged
   8                Other


   Second         Undefined
   #              Undefined

   Subfield Codes
   $a    Classification scheme code (from USMARC Code List for
Relators, Sources, Description)
   $b    Edition title (NR)
   $c    Edition identifier (NR)
   $e    Language code (NR)

In the case of the Spanish translation mentioned above, it is
important to note the source on which it was based and the
variations incorporated.  A new subfield $n for a variations note
could be used to show the variation.  This information would appear
in every classification record that is part of this edition.  In
addition, a new subfield $d could be added for Source edition
identifier.  

Examples:
084 8             $addc $bSistema de Clasificación Decimal $c20 $d21 $n
                  contains parts of edition 21 in revised area table for
                  former Soviet Union and Table 6 expansions for North and
                  South American native languages

084 8             $addc $bClassification décimale de Dewey $cintermédiaire
                  $d12 $n based on Abridged Edition 12 with extensions from
                  DDC 20
 

Authorized/unauthorized.  It is also necessary to specify whether
a translation is authorized or unauthorized.  This information is
useful from the perspective of quality (the assistance the
translators' received in preparing the translation) and to
distinguish it from an authorized translation  in the same
language.  This information could be added as a new subfield for
authorization in field 084.
Examples:
084 8             $addc $bSistema de Clasificación Decimal $c20 
$funauthorized

Relationship of number to source edition number.
Another change needed in the format is the ability to show the
relationship of the number to the source edition.  There are three
types of relationships possible between the number and the source
edition: expansion (link to base number); authorized option (link
to option); adaptation.    Even if based on a standard edition,
there may be variations in the meanings of some numbers or
expansions.  For example, Table 2 in Edition 20 contains area
notation 4541 for the province of Bologna.  In the Italian edition,
area notation 4541 has a 26-number expansion for the parts of the
province of Bologna.  It would be useful to know that this is an
expansion, and the base number in the standard edition on which the
expansion is based.

Each edition of the DDC contains options to address cultural
differences or to provide a method for emphasizing topics of local
importance.  If a scheme employs a standard option, it would be
useful to have a link to that instruction or number in the standard
edition.

Sometimes, translators adapt parts of the Classification to meet
local needs.  For example, the religion schedule in the Persian
edition reflects the needs of an Islamic majority.  It would be
useful to document an adaptation as such while retaining the link
to the source edition.

Because there is a need to provide a linking mechanism to other
numbers and no existing field contains the same type of
information, a new field could be added to the classification
format for Relationship to Source.  It might be defined as follows:

686  Relationship to Source Note
Indicator 1  Type of relationship
   0              Number from other source edition
   1              Expansion
   2              Option
   3              Adaptation, other
   $a    Number in edition described in field 084--single number or
         beginning number of span (R)
   $b    Number in primary source edition--single number or beginning
         number of span (R)
   $c    Number in edition described in field 084, number in primary
         source edition, or number where instructions are found--ending
         number of span (R)
   $i    Explanatory text (R)
   $o    Number where instructions are found--single number or
         beginning number of span (R)
   $t    Topic (R)
   $z    Table identification (R)
   $2    Edition identifier (R)
   $5    Institution to which field applies (R)
   $8    Link and sequence no. (NR)

EXAMPLES:

Number from other source edition                     
084 8#              $addc$bSistema de Clasificación$c20$ncontains parts of
                    edition 21 in revised Table 2 notation for former
                    Soviet Union and Table 6 expansions for North and South
                    American Languages$espa
153 ##              $z2$4771$hEuropa    Europa Occidental$hEuropa oriental 
                      Rusia$hUcrania$jProvincia de Crimea
686 0#              $221

Expansion
084 8#            $addc$bClassificazione Decimale Dewey$c20$eita
153 ##            $z2$a454126$hEuropa    Europa occidentale$hPenisola
                  italiana e isole adiacenti  Italia $hRegione dellýEmilia-
                  Romagna e San Marino$hProvincia di Bologna$hNordovest
                  della provincia di Bologna$jCrevalcore
686 1#            $z2$b4541

Option
084 8#            $addc$bClassificazione Decimale Dewey$c20$eita
153 ##            $a222.86$hReligione$hBibbia$hLibri storici dellýAntico
                  Testamento$hNeemia (Esdra 2)$jTobia
683 2#            $i(Opzione: Classificare in$a229.22)$p253
686 2#            $o229.22


084 8#            $addc$bDewey Onlu Sýnýflama ve Baýýntýlý Dizin$c20$etur
153 ##            $$a412$hDil ve dilbilim$hBelirli diller$hTürk
                  dili$h$jStandart Türkçeýnin kökenbilimi (etimolojisi)
686 2#            $b494.352$o410

Adaptation, other
084 8#            $addc$bDewey Onlu Sýnýflama ve Baýýntýlý Dizin$c20$etur
153 ##            $z2$a56226$hTablo: Coýrafi Alanlar, Tarihi Dönemler,
                  Kiýiler$hAsya Doýu (Orient) Uzakdoýu$hOrta Doýu (Yakin
                  Doýu)$hEge Bölgesi (Batý Anadolu) ve Marmara Bölgesi
                  $hMarmara Bölgesi$jýstanbul
686 3#            $tComprehensive works and European portion of Istanbul
                  province$z2$b49618
686 3#            $tAsian portion of Istanbul province$z2$b563

084 8#            $addc$bClassificazione Decimale Dewey$c20$eita
153 ##            $a641.815$hTecnologia (Scienze applicate)$hEconomia
                  domestica e vita familiare$hCibi e bevande
                  (Alimenti)$hConservazione, immagazzinamento, cucina degli
                  alimenti$hCucina di specifici tipi di piatti$hPiatti
                  preliminari e di accompagnamento$jPane e affini
680 1#            $iEsempî: cialde, crackers, crˆpes, focacce, panini,
                  pizze, schiacciate
686 3#            $tPizza$b641.824

084 8#            $addc$bClassificazione Decimale Dewey$c20$eita
153 ##            $a641.824$hTecnologia (Scienze applicate)$hEconomia
                  domestica e vita familiare$hCibi e bevande
                  (Alimenti)$hConservazione, immagazzinamento, cucina degli
                  alimenti$hCucina di specifici tipi di piatti$hPiatti
                  principali$jSformati di carne e torte di formaggio
686 3#            $tPizza$a641.815


Script.  In order for the classification format to be used
internationally, it is necessary to provide deails about the
translation itself, such as script or romanization system.  DDC is
already published in other scripts (Arabic, Russian).  This issue
will be explored in a later discussion paper concerning script and
romanization in the authority format.  In addition, some editions
contain text in more than one language.  Thus, it is desirable to
make field 084 subfield $e (Language code) repeatable.


3.       PROPOSED CHANGES

The following is presented for consideration:

   -     In the USMARC Classification Format, define the following in
         field 084 (Classification Scheme and Edition:
                  $d       Source edition identifier
                  $f       Authorization
                  $n       Variations
         Make the following subfield in field 084 repeatable:
                  $e       Language code
   See Attachment A for a description of this field if this proposal
is approved.

   -     In the USMARC Classification Format, define a new field 686
         for Relationship to Source Note.
   See Attachment B for a description of this field if this proposal
   is approved.

------------------------------------------------------------------
                                                    ATTACHMENT A
<  > indicates addition; [  ] indicates deletion

084   Classification Scheme and Edition  (NR)

Indicators

  First          Type of edition
     0             Full
     1             Abridged
     8             Other

  Second         Undefined
     #             Undefined

Subfield Codes

     $a      Classification scheme code (NR)                           
     $b      Edition title (NR)
     $c      Edition identifier  (NR)
     <$d     Source edition identifier  (NR)>
     <$f     Authorization  (NR)>                          
     $e      Language  code (NR)
     <$n     Variations  (R)>



FIELD DEFINITION AND SCOPE

        This field contains information about the authoritative
classification scheme and edition that contains the classification
number(s) and term(s) in the record.  It also may indicate the
edition title, date, and language of a particular version of the
classification scheme.  If a library creates its own record for a
classification number maintained by another classification source,
the classification scheme on which it is based is specified in
field 084 and the library creating the record is identified in
field 040 (Record Source).



GUIDELINES FOR APPLYING CONTENT DESIGNATORS

INDICATORS

First Indicator - Type of edition 
        The first indicator position contains a value that specifies
        the type of edition containing the classification data.

        0 - Full
             Value 0 indicates that the classification data is contained
             in the full edition of the classification scheme.  This
             value is also used for classification schemes not issued in
             an abridged edition.


               084                0#$addc$c20
               153                ##$a616.9792$hTechnology (Applied
                                  sciences)$hMedical sciences. 
                                  Medicine$hDiseases$kSpecific diseases$hOther
                                  diseases$hDiseases of the immune
                                  system$hImmune deficiency diseases$jAcquired
                                  immune deficiency syndrome (AIDS)

               084                0#$alcc
               153                ##$aN6370$cN6494$hVisual arts$hHistory$hModern
                                  art$jBy century

        1 - Abridged
             Value 1 indicates that the classification data is from an
             abridged edition of the classification scheme.

               084                1#$addc$c11
               153                ##$a323.3$hSocial sciences$hPolitical science
                                  (Politics and government)$hRelation of state
                                  to its residents$hRelation of state to social
                                  aggregates$jOther social aggregates

        8 - Other
             Value 8 indicates that the classification data is contained
             in an edition other than those specified by the other
             values.  The edition is specified in subfield $b (Edition
             title) or subfield $c (Edition identifier).

               084                8#$audc$cInternational medium edition
               153                ##$a512.5$hMathematics and natural
                                  sciences$hAlgebra$jGeneral algebra

Second Indicator - Undefined
        The second indicator position is undefined and contains a
        blank ($).


 SUBFIELD CODES

$a - Classification scheme code
        Subfield $a contains a variable-length alphabetic USMARC code
        that identifies the classification scheme used to formulate
        the classification number and caption in field 153
        (Classification Number).  The code is based on the general
        classification scheme used without regard to the particular
        edition or adaptation of the scheme.  A classification number
        or span that has been adapted in some way from the information
        in the authoritative classification scheme is coded for the
        scheme in this subfield and the NUC symbol or name of the
        library that made the adaptation is contained in field 040
        (Record Source).  The source of the classification scheme code
        is USMARC Code List for Relators, Sources, Description
        Conventions that is maintained by the Library of Congress.

           084     0#$addc$c20
           153     ##$a323.32$hSocial sciences$hPolitical science
                   (Politics and government)$hCivil and political
                   rights$hCivil and political rights of other social
                   aggregates $jSocioeconomic classes

           084     0#$alcc
           153     ##$aHE381$hTransportation and communications$hWater
                   transportation$hWaterways$jGeneral works

           040     ##$aDNLM$cDNLM
           084     0#$alcc
           153     ##$aSF887$hAnimal culture$hVeterinary
                   medicine$hVeterinary medicine of special organs,
                   regions, and systems$hUrinary and reproductive
                   organs$jObstetrics
           753     ##$aAbortion, Veterinary
                 [This record is created by NLM for use in the NLM index
                 to refer users to an LCC number.  The basic
                 classification scheme is identified in field 084 and
                 agency that created the record is in field 040 (Record
                 Source).]

           084     8#$audc$cInternational medium edition
           153     ##$a642.12$hHousekeeping. Home economics. Domestic
                   science$hFood. Cooking. Dishes.
                   Meals$hMeals and mealtimes. Tableware$jMorning meal.
                   Breakfast


$b - Edition title
        Subfield $b contains the title of the edition when a USMARC
        code has not been assigned to the scheme or further
        information needs to be given about the edition.

           084     8#$addc$bSistema de Clasificación Decimal$c1980$espa
           153     ##$a331.012$hCiencias sociales$hEconom¡a$hEconom¡a
                   laboral$hFilosof¡a y
                   teor¡a$jSatisfacciones del trabajo
                 [Data is from the Spanish edition of the Dewey Decimal
                 Classification.]


$c - Edition identifier
        Subfield $c contains the edition number, date, or other
        textual designation of the classification scheme edition
        contained in the classification record.

           084     0#$addc$c20
           153     ##$a401.3$hLanguage$hPhilosophy and
                   theory$jInternational languages

           084     0#$anlm$c4th ed., rev.
           153     ##$aWQ160$hObstetrics$jMidwifery


<$d     Source edition identifier
           Subfield $d contains the edition number, date, or other
           textual designation of the classification scheme edition used
           as the primary source for the edition identified in subfield
           $c.  Subfield $d is not used if it would be the same as
           subfield $c.  Subfield $d contains the edition on which the
           current edition is based.

             084 8#               $addc $bSistema de Clasificación Decimal $c20
                                  $d21 $n contains parts of edition 21 in revised
                                  area table for former Soviet Union and Table 6
                                  expansions for North and South American native
                                  languages>


$e - Language code 
        Subfield $e contains the USMARC code for the language of the
        classification scheme edition when the language is other than
        English. The source of the codes is USMARC Code List for
        Languages that is maintained by the Library of Congress.


<$f - Authorization
        Subfield $f contains an indication of whether the translation
        has been authorized, i.e., done with the approval of the
        producer of the source edition.  If this subfield is not used,
        it is assumed to be authorized.

           084 8#                 $addc$bSistema de Clasificación
                                  Decimal$c20$funauthorized>

<$n - Variations
        Subfield $n contains general information about variations in
        this edition from the primary source edition.  Field 686
        Relationship to Source Note contains specific information
        about the relationship of a particular number to the source
        edition. 

           084 8#                 $addc$bSistema de Clasificación$c20$ncontains
                                  parts of edition 21 in revised Table 2 notation
                                  for former Soviet Union and Table 6 expansions
                                  for North and South American Languages$espa>



SCHEME-SPECIFIC CONVENTIONS

DEWEY DECIMAL CLASSIFICATION

Only the standard abridged edition uses value 1 in the first
indicator position.


RELATED USMARC FIELD/DOCUMENT

        040      Record Source
        USMARC Code List for Languages 
        USMARC Code List for Relators, Sources, Description
        Conventions

-------------------------------------------------------------------

                                                    ATTACHMENT B

686          Relationship to Source Note  (R)

Indicators

  First          Type of relationship
     0             Expansion not based on other source edition
     1             Option
     2             Adaptation
     3             Number from other source edition

  Second         Undefined
     #             Undefined

Subfield Codes
     $a      Number in edition described in field 084--single number or
             beginning number of span (R)
     $b      Number in primary source edition--single number or beginning
             number of span (R)
     $c      Number in edition described in field 084, number in primary
             source edition, or number where instructions are found--
             ending number of span (R)
     $i      Explanatory text (R)
     $o      Number where instructions are found--single number or
             beginning number of span (R)
     $t      Topic (R)
     $z      Table identification (R)
     $2      Edition identifier (R)
     $5      Institution to which field applies (R)
     $8      Link and sequence no. (NR)



FIELD DEFINITION AND SCOPE

  This field contains information about the relationship of a number
to the source edition when the number is 
different from the standard number for the same topic in the
primary source edition.  This field is used for 
numbers based on a source other than the primary source,
expansions, implemented options, and adaptations.

  The information in this field is intended primarily for computer
processing or to guide classifiers and is often 
not written in a form adequate for public user display.


GUIDELINES FOR APPLYING CONTENT DESIGNATORS

INDICATORS

First Indicator - Type of relationship
     The first indicator position contains a value that indicates the
     type of relationship between the number in the 153 field and the
     standard number for the same topic in the source edition.

     0 - Number from other source edition
     Value 0 indicates that the classification number in field 153 is
     based on a source other than the primary source.  If the
     classification number in field 153 is the implementation of an
     option described in the other source edition, use indicator
     value 2.

     1 - Expansion
     Value 1 indicates that the classification number in field 153
     represents a more specific number in the same hierarchy as the
     standard number in the primary source edition for the topic
     identified in subfield $t. If this number is based on another
     source edition, use indicator value 0.

     2 - Option
     Value 2 indicates that the classification number in field 153
     represents the implementation of an option described in the
     primary source or other source edition.  

     3 - Adaptation, other
     Value 3 indicates that the classification number in field 153 is
     different from the number in the primary source edition for the
     topic identified in subfield $t, and none of the types of
     relationships described with indicator values 0-2 is applicable.


SUBFIELD CODES
$a -       Number in edition described in field 084--single number or
           beginning number of span 
           Subfield $a contains the number in the edition described in
           field 084 for the topic identified in subfield $t.  Subfield
           $a is not used if it would be the same as the number in field
           153.

$b -       Number in primary source edition--single number or beginning
           number of span 
           Subfield $b contains the standard number in the primary
           source edition for the topic identified in subfield $t, or if
           there is no subfield $t, then in subfield $j of field 153. 
           Subfield $b is not used if it would be the same as subfield
           $o (Number where instructions are found).

$c -       Number in edition described in field 084, number in primary
           source edition, or number where instructions are found--
           ending number of span 
           Subfield $c contains the ending number of a classification
           number span cited in field 686. The beginning number of the
           span is recorded in subfields $a, $b or $o.

$o -       Number where instructions are found--single number or
           beginning number of span
           Subfield $o contains the number in the source edition where
           instructions are given for the option that is being
           implemented in the number in field 153.  This subfield is
           used only for an option described in the primary source
           edition or another source edition (indicator value 2).

$t -         Topic
           Subfield $t contains the topic that is being added to or
           subtracted from the meaning of the number in field 153.
           Subfield $t is not used if it would be the same as subfield
           $j (Caption) in field 153.

$i -         Explanatory text
           Subfield $i contains the explanatory text in field 686.

$z -         Table identification
           Subfield $z contains the identification of the table to which
           a classification number recorded in field 686 belongs, if the
           classification number is part of a table.  For a
           classification number span, subfield $z is given only once,
           before the first number.

$2 -       Edition identifier 
           Subfield $2 contains the edition number, date, or other
           textual designation of the classification scheme edition
           used as the source for the classification number in field
           153 when that source is not the primary source.  This
           subfield is used with indicator value 0, and with indicator
           value 2 when the option is described in the other source
           edition. This subfield is not used to record the edition
           identifier of the primary source edition; that edition
           identifier is recorded in subfield $c or $d of the 084
           field.

$5 -         Institution to which field applies
           Subfield $5 contains the USMARC code of the organization to
           which the Relationship of Source note applies. The source
           of this code is USMARC Code List for Organizations that is
           maintained by the Library of Congress. 


$8 -         Link and sequence number
           Subfield $8 contains data that is used to sequence a 686
           field with other related 6XX or 76X fields.  The subfield
           is structured as follows:

                 .

           The linking number is a variable length whole number.  The
           linking number is the same for each 6XX or 76X field being
           linked to this field.

           A variable length sequence number is added to control the
           display sequencing of fields with identical linking
           numbers.  A sequence number is separated from a linking
           number by a decimal point.  A sequence number may itself be
           a decimal number.

           For examples of the use of this subfield see field 763
           (Internal Subarrangement or Add Table Entry).


SCHEME-SPECIFIC CONVENTIONS

DEWEY DECIMAL CLASSIFICATION

Only longer numbers in the same hierarchy are expansions.


Go to:


Library of Congress
Library of Congress Help Desk (09/02/98)