![]() |
MBR Files |
Users are responsible for compliance with all applicable MEDLINE®/PubMed® and other NLM Databases License Agreements. The MEDLINE citations included in this MEDLINE Baseline Repository were retrieved in their respective years and represent a static view of MEDLINE at that time. |
|
Available Files: The following files have been made available from the MEDLINE Baseline Repository and include all of the generated files we create during our processing of each baseline. We have used GNU's gzip utility to compress the larger files and used the Unix tar command to compile the files into a single download. The compressed files can be expanded using either GNU's gunzip utility or WinZip. WinZip will also be able to understand the tar files and separate out the files as appropriate. To download each file, simply move your cursor over the file icon for the file you wish to download, press the right mouse button and select the "Save Link Target as ..." option. The frequency count files represent a complete count of the MEDLINE baseline for each category. For example, the MH_freq_count file represents a count for each unique MeSH Heading found in MEDLINE for the given baseline year. We include an overall count for each term and a count of when the term has been starred (considered an IM or Index Medicus index term which represents the most significant points of an article) when applicable (MH and SH terms only). We provide at least two versions of each frequency count category which is just a sorting difference - MH_freq_count is ordered by the overall frequency count for each term, while MH_freq_alpha is ordered in alphabetical order by MeSH Heading. The MH_major_freq_count is sorted using the count of when each term has been starred. The README file associated with each baseline explains in greater detail the format of each of the files. The raw data files represent the files we generated to use in our MEDLINE Baseline Repository Query tool database and it was felt that others might find these files useful and by providing them here, we help eliminate the duplication of effort. The README file associated with each baseline explains in greater detail the format of each of the files. We also provide two files where we look at the MeSH Headings assigned to the completed citations during the given baseline or year. The "hist" file is a frequency count of MeSH Headings based on their assigned MeSH Treecodes. The "histST" file is a frequency count of MeSH Headings based on their UMLS Semantic Types and more specifically, their UMLS Semantic Groupings (groups of Semantic Types). We have also included several graphs to help illustrate this data. The related MeSH Vocabulary data files are also included here to make sure that you have available all of the year specific data you might need for your research. The SemGroups.txt file is the latest addition to the Repository and it's unclear whether this file is updated each year, as the Semantic Types change, or is static. This file has grouped the UMLS Semantic Types into 15 (currently) high-level categories. We are using this file to see if we can detect patterns in how the MeSH Headings are assigned in MEDLINE. The papers: "Aggregating UMLS semantic types for reducing conceptual complexity. McCray AT, Burgun A, Bodenreider O; Medinfo. 2001;10(Pt 1):216-20." and "Exploring semantic groups through visual approaches., Bodenreider O, McCray AT; Journal of Biomedical Informatics. 2003; 36(6):414-432." provide much greater detail on the grouping of the Semantic Types. Both papers can be found at the Lister Hill National Center for Biomedical Communications web site (http://lhncbc.nlm.nih.gov) under the "Publications & Lectures" section. One or more of the following tools may be needed to access the files located on this page after they have been downloaded. The need depends on your current computer resources.
|
Last Modified: February 22, 2008 | ii-public | |||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||
![]() |
Lister Hill National Center for Biomedical Communications |
![]() |
U.S. National Library of Medicine |
![]() |
National Institutes of Health | |||||||||||||||||||||||
![]() |
Department of Health and Human Services | |||||||||||||||||||||||||||
|