Historical author affiliations assist verification of automatically generated MEDLINE citations
- PMID: 17238701
- PMCID: PMC1839323
Historical author affiliations assist verification of automatically generated MEDLINE citations
Abstract
High OCR error rates encountered in author affiliations increase the manual labor needed to verify MEDLINE citations automatically created from scanned journal articles. This is due to poor OCR recognition of the small text and italics frequently used in printed affiliations. Using author-affiliation relationships found in existing MEDLINE records, the SeekAffiliation (SA) program automatically finds potentially correct and complete affiliations, thereby reducing manual effort and increasing the efficiency of creating the citations.
Similar articles
-
International access to the Chinese medical literature through MEDLINE.Chin Med J (Engl). 1993 Apr;106(4):243-9. Chin Med J (Engl). 1993. PMID: 8325151
-
Leading 20 at 20: top cited articles and authors in the Journal of Orthopaedic Trauma, 1987-2007.J Orthop Trauma. 2010 Jan;24(1):53-8. doi: 10.1097/BOT.0b013e3181aa2182. J Orthop Trauma. 2010. PMID: 20035179
-
A probabilistic similarity metric for Medline records: a model for author name disambiguation.AMIA Annu Symp Proc. 2003;2003:1033. AMIA Annu Symp Proc. 2003. PMID: 14728536 Free PMC article.
-
Towards automatic augmentation of electronic medical records with MEDLINE citations.AMIA Annu Symp Proc. 2007 Oct 11:894-5. AMIA Annu Symp Proc. 2007. PMID: 18693995
-
[Breast pathology: evaluation of the Portuguese scientific activity based on bibliometric indicators].Acta Med Port. 2006 May-Jun;19(3):225-34. Epub 2006 Sep 7. Acta Med Port. 2006. PMID: 17234084 Review. Portuguese.
References
-
- Thoma GR. Automating data entry into MEDLINE. Proc. 1999 Symp. on Document Image Understanding Technology; Apr 1999; College Park, MD: Institute for Advanced Computer Studies; pp. 217–8.
-
- Hauser SE, Sabir TF, Thoma GR. OCR correction using historical relationships from verified text in biomedical citations. Proc. 2003 Symp. on Document Image Understanding Technology; Apr 2003; College Park, MD: Institute for Advanced Computer Studies; pp. 171–7.
-
- U.S. National Institutes of Health, National Library of Medicine. Entrez Programming Utilities. http://eutils.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html.
-
- Hauser SE, Schlaifer J, Sabir TF, Demner-Fushman D, Thoma GR. Correcting OCR text by association with historic datasets. Proc. SPIE Electronic Imaging, January 2003. SPIE Vol. 5010; pp. 84–93.
MeSH terms
LinkOut - more resources
Full Text Sources