Skip Navigation
Lister Hill Center Logo  

Search Tips
About the Lister Hill Center
Innovative Research
Publications and Lectures
Training and Employment
LHNCBC: Document Abstract
Year: 2007Adobe Acrobat Reader
Download Free Adobe Acrobat Reader
LHNCBC-2007-036
Automatic Indexing of Specialized Documents: Using Generic vs. Domain-Specific Document Representations
Neveol A, Mork JG, Aronson R
Proc BioNLP 2007 Workshop, 183-92.
The shift from paper to electronic documents has caused the curation of information sources in large electronic databases to become more generalized. In the biomedical domain, continuing efforts aim at refining indexing tools to assist with the update and maintenance of databases such as MEDLINE. In this paper, we evaluate two statistical methods of producing MeSH indexing recommendations for the genetics literature, including recommendations involving subheadings, which is a novel application for the methods. We show that a generic representation of the documents yields both better precision and recall. We also find that a domain specific representation of the documents can contribute to enhancing recall.
PDF