The Library of Congress >> Especially for Librarians and Archivists >> Standards
MARC Standards
MARC 21 HOME >> Specifications >> Character Sets >> Part 5

MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media

CHARACTER SETS AND ENCODING OPTIONS: Part 5

MARC-8 Code Tables

December 2007

INTRODUCTION

The MARC-8 repertoire and encoding are specified by the collection of character sets named below together with the escape sequences described in Part 2. Mappings between valid MARC-8 code points and their UCS/Unicode equivalents are provided in tables on this site. Only MARC-8 code points included in the tables should be used. XML and comma-delimited versions of the MARC-8 to Unicode mapping tables for use in software applications are also provided.

Basic Latin (ASCII)
Greek symbols
Subscripts
Superscripts
Extended Latin (ANSEL)
Basic Hebrew
Basic Cyrillic
Extended Cyrillic
Basic Arabic
Extended Arabic
Greek
East Asian (WARNING: Before attempting to print the East Asian set, note that it consists of over 200 pages of character mappings)

MARC-8 to Unicode XML mapping file (all MARC-8 characters) (WARNING: This file is very large and may take several minutes to load in an Internet browser)

MARC-8 to Unicode comma-delimited mapping file (EACC characters only)


MARC 21 HOME >> Specifications >> Character Sets >> Part 5

The Library of Congress >> Especially for Librarians and Archivists >> Standards
( 12/05/2007 )
Contact Us