Home Projects Publications Presentations Repositories Photo Gallery Career Staff Favorites
  • MyDelivery
  • Turning The Pages Online
  • MyMorph
  • Medical Article Records GROUNDTRUTH (MARG)
  • MD on Tap
  • AnatQuest
Links to Feeds:
PublicationsRSS  RSS
CEB NewsRSS  RSS

Last updated: September 03, 2008

Jong Woo Kim, Ph.D.

Print this Print this  E-mail this E-mail this

Jong Woo Kim, Ph.D. National Library of Medicine
Communications Engineering Branch/MSC 3824
Bldg. 38A, Room 10S1011G
8600 Rockville Pike
Bethesda, MD 20894 USA

(301) 435-3227 (voice)
(301) 402-0341 (fax)

 

Dr. Kim has 6 years of software engineering experience and 16 years experience working with a variety of computer systems. He has a strong research background in pattern recognition, image processing, computer/machine vision, fuzzy set theory, neural networks, probability and robust statistics combined with an extended mathematical and electronics background. Dr. Kim has programmed in C, C++, Visual C++, and SQL in Windows Me/2000 and UNIX environments.

Dr. Kim joined the National Library of Medicine in 1998. He is currently working for the Communications Engineering Branch at the Lister Hill Center. His job is involved in the Medical Article Record System Project to develop Artificial Intelligence Modules for automatic article labeling systems using MS Visual C++, C#, and MS Windows NT environment.

Dr. Kim received his B.S. and M.S. degrees in Electrical and Computer Engineering in 1989 and 1991 at the Kyungpook National University in Taegu, Korea. Also, Dr. Kim received his Ph.D. in Computer Engineering and Computer Science in 1997 at the University of Missouri in Columbia, Missouri.


Current Projects

WebMARS/WebMARS-SpinOff: The CEB has developed an automated system, called Web Based Medical Article Record System (WebMARS), to produce bibliographic records for its MEDLINEâ database from full text versions of HTML (PDF) format online journal articles. The WebMARS employs document image analysis and understanding techniques, and DOM technology to complement existing MARS. I am responsible for developing a labeling module callded Web Labeling Module. The module detects rubric, title, vernacular, author, corporate author, affiliation, abstract, pagination, grant number, e-mail, zip code, databank accession number, and support zones (grant number, databank accession number, and support zones for WebMARS-SpinOff) automatically from HTML-format journal articles using Fuzzy/Crisp rule-based algorithms and statistical information. Visual C++, ADO, and SQL are used to implement this module. I am also responsible for Web Updating module. The module extracts journal specific information of rubric, title, vernacular, author, corporate author, affiliation, abstract, pagination, grant number, e-mail, zip code, databank accession number, and support zones automatically from HTML-format journal articles using string matching algorithms. Visual C++, ADO, and SQL are used to implement these modules.

MARSII: The CEB has developed an automated system to produce bibliographic records for its MEDLINE® database from hard-copy medical journals. This system, named Medical Article Record System (MARS), employs document image analysis and understanding techniques and optical character recognition (OCR). The system is composed of eleven modules. I am in charge of developing and maintaining a labeling module callded ZoneCzar. The module labels title, author, affiliation, and abstract zones automatically from scanned journal articles, using rule-based algorithms. Visual C++, Rogue Wave, and SQL are used to implement this module.

MTA Project:The Bibliographic Services Division (BSD), a part of Library Operations at the NLM, asked the Communications Engineering Branch, NLM, to develop an automated system to produce bibliographic records for its database from several AIDS related conference journals. This project, called the Meeting Abstract (MTA), employs document image analysis and understanding techniques and optical character recognition (OCR). The system was currently composed of ten modules. I developed a labeling module for MTA. The module labeled title, author, affiliation, abstract, keyword, rubric, pagination, abstract number, grant number, e-mail, zip code, databank, and corporate zones automatically from scanned articles, using rule-based algorithms. Visual C++, Rogue Wave, and SQL were used for the development.

 

National Institutes of Health (NIH)National Institutes of Health (NIH)
9000 Rockville Pike
Bethesda, Maryland 20892

U.S. Dept. of Health and Human ServicesU.S. Dept. of Health
and Human Services

USA.gov Website