Medical Subject Headings - Files Available to Download
Download of any of the full data files requires the completion of an online Memorandum of Understanding.
- 2008 MeSH
- These formatted versions of the entire MeSH vocabulary are available to download. For best results in downloading, use of a graphical interface is recommended.
- 2008 MeSH in XML format. MeSH descriptors and qualifiers, and Supplementary Concept Records (formerly Supplementary Chemical Records) in XML format. Files are updated weekly. Supplementary records are compatible with 2008 MeSH descriptors. Descriptor file is 260MB in size, uncompressed. Compressed file is 14MB.
- 2008 MeSH in ASCII format. MeSH descriptors and qualifiers, and Supplementary Concept Records (formerly Supplementary Chemical Records), in ASCII format. Files are updated weekly. The descriptor file is 25MB in size and may take more than 30 minutes to download using a connection 2400 bps or slower. Postings data in descriptor and qualifier records reflect updates in citation files through the October, 2000 entry month.
Note that subfield separator for descriptor entry terms has been changed from a colon to a bar (|) in order to make it easier to parse terms containing colons.
- 2008 MeSH Trees. MeSH main headings with the tree numbers that place the heading in a hierarchical arrangement. Sorted by tree number. ASCII format. [1.8MB]
- 2008 MeSH in MARC format. MeSH vocabulary data in the USMARC authority format.
- These files are also available for 2008 MeSH:
- 2007 MeSH
- 2007 MeSH in XML format. MeSH descriptors and qualifiers, and Supplementary Concept Records (formerly Supplementary Chemical Records) in XML format. Descriptor file is 228MB in size, uncompressed. Compressed file is 12MB. The file of Supplementary Concept Records is updated weekly.
NOTE: The 2007 SCR XML files between August 13 and August 23 have an extraneous circumflex accent character ("^") before the three XML entity characters: "&", "<", and "> ". If this character is causing a problem with your system, you can delete the character or download a file after 11:45 AM, August 23.
- 2007 MeSH in ASCII format. MeSH descriptors and qualifiers, and Supplementary Concept Records (formerly Supplementary Chemical Records), in ASCII format. Files are updated weekly. The descriptor file is 23MB in size and may take more than 30 minutes to download using a connection 2400 bps or slower. Postings data in descriptor and qualifier records reflect updates in citation files through the October, 2000 entry month.
Note that subfield separator for descriptor entry terms has been changed from a colon to a bar (|) in order to make it easier to parse terms containing colons.
- 2007 MeSH Trees. MeSH main headings with the tree numbers that place the heading in a hierarchical arrangement. Sorted by tree number. ASCII format. [1.4MB]
- 2007 MeSH in MARC format. MeSH vocabulary data in the USMARC authority format.
- 2006 MeSH
- 2004 MeSH
- 2004 MeSH files for TREC (Text REtrieval Conference) Genomics track. For use by the TREC community with the 2004 MEDLINE/PubMed Baseline.
- MeSH ELHILL Format Document (PDF format): This is a description of the MeSH data elements in the ELHILL Unit Record Format (EURF), NLM's former format for distributing MeSH on tape. Most elements are common to both the ASCII and EURF MeSH formats and so the description of MeSH data in the EUR format may be of interest to those downloading the ASCII MeSH files from the Web.
- For questions concerning distribution, format, etc., contact:
- Jacque-Lynne Schulman
- Medical Subject Headings
- Telephone: 301-496-1495; FAX: 301-402-2002
- email: schulman@nlm.nih.gov
Last reviewed: 09 September 2008
Last updated: 09 September 2008
First published: 01 September 1999
Metadata| Permanence level: Permanent: Dynamic Content