Skip to Content
United States National Library of Medicine National Institutes of Health

MEDLINE® Data Entry by Fiscal Year

Beginning in 1997, NLM has managed a dramatic shift in how citation data first enters the MEDLINE process: from double keyboarding citations by hand; to scanning and using optical character recognition (OCR); to importing records supplied by publishers in eXtensible Markup Language (XML) format. The data below reflects citations created (not necessarily indexed) for MEDLINE in each fiscal year.

data entry methods by year

 

Fiscal Year
Total Citations Entered
Keyboarded1 Citations
OCRed Citations
Citations Received Electronically via XML
2007 685,134 0 105,841 579,293
2006 669,124 0 122,106 547,018
2005 639,000 0 152,000 487,000
2004 584,000 55,000 98,000 431,000
2003 535,000 101,000 109,000 325,000
2002 519,000 114,000 113,000 292,000
2001 472,000 124,000 129,000 219,000
2000 488,000 156,000 146,000 186,000
1999 453,000 198,000 151,000 104,000
1998 434,000 260,000 130,000 44,000
1997 496,000 434,000 62,000 0
1996 336,0002 336,000 0 0
1995 410,5593 410,559 0 0

The U.S. Government Fiscal Year runs from October 1 - September 30.

1Keyboarding ceased as a method of data entry in June 2004.

2There was a data entry and concomitant indexing backlog resulting from the General Services Administration Board of Contract Appeals' suspension of NLM's editing and data entry contract from late February 1996 to late April 1996. The large backlog was eliminated in early April 1997. This caused fewer records to be added during FY 1996 and a greater number added in FY 1997.

3Number derived from citations present in the database in 2007 that are tagged as being created during FY1995; not the original processing statistic.

Last reviewed: 23 April 2008
Last updated: 23 April 2008
First published: 16 March 2007
Metadata| Permanence level: Permanence Not Guaranteed