Skip Navigation
Lister Hill Center Logo  

Search Tips
About the Lister Hill Center
Innovative Research
Publications and Lectures
Training and Employment
LHNCBC: Document Abstract
Year: 2000Adobe Acrobat Reader
Download Free Adobe Acrobat Reader
LHNCBC-2000-056
Automatic Indexing by Discipline and High-Level Category: Methodology and Potential Applications
Humphrey S, Rindflesch TC, Aronson AR
Proc. of the 11th SIG/CR Classification Research Workshop. 2000;:103-16.
As background, this paper describes journal descriptor (JD) indexing, based on indexing at the journal level using only 127 descriptors, and applying statistical methods that associate this journal indexing with text words in a training set of MEDLINE citations. These associations then form the basis for automatic indexing of documents outside the training set. The paper then presents the new technique of semantic type (ST) indexing, based on JD indexing associated with each of 134 ST's, and applying the standard cosine coefficient measure to compare the similarity between the JD indexing of a document and the JD indexing of each ST. The ST indexing of the document is the list of ST's ranked in decreasing order of similarity between the JD indexing of the document and the JD indexing of the ST's. Discussion of the potential usefulness and application of the very general indexing provided by ST's comprises the remainder of the paper. In particular, it is suggested, with several examples, that ST's may convey a unique slant of a document's content not normally represented in standard indexing vocabularies. Use of ST indexing to rank retrieved output is mentioned as a possible application. Notwithstanding the importance of methodology and performance issues, the intent of this paper is to explore questions of the potential utility and applicability of ST indexing.
PDF