Facilitating accurate health provider directories using natural language processing
- PMID: 30943977
- PMCID: PMC6448184
- DOI: 10.1186/s12911-019-0788-x
Facilitating accurate health provider directories using natural language processing
Abstract
Background: Accurate information in provider directories are vital in health care including health information exchange, health benefits exchange, quality reporting, and in the reimbursement and delivery of care. Maintaining provider directory data and keeping it up to date is challenging. The objective of this study is to determine the feasibility of using natural language processing (NLP) techniques to combine disparate resources and acquire accurate information on health providers.
Methods: Publically available state licensure lists in Connecticut were obtained along with National Plan and Provider Enumeration System (NPPES) public use files. Connecticut licensure lists textual information of each health professional who is licensed to practice within the state. A NLP-based system was developed based on healthcare provider taxonomy code, location, name and address information to identify textual data within the state and federal records. Qualitative and quantitative evaluation were performed, and the recall and precision were calculated.
Results: We identified nurse midwives, nurse practitioners, and dentists in the State of Connecticut. The recall and precision were 0.95 and 0.93 respectively. Using the system, we were able to accurately acquire 6849 of the 7177 records of health provider directory information.
Conclusions: The authors demonstrated that the NLP- based approach was effective at acquiring health provider information. Furthermore, the NLP-based system can always be applied to update information further reducing processing burdens as data changes.
Conflict of interest statement
Ethics approval and consent to participate
The data analyzed in this study was public information and was considered as non-human subjects research. An ethics approval was waived by the corresponding IRB.
Consent for publication
Written informed consent for publication was obtained from all research participants.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures




Similar articles
-
Improving provider directory accuracy: can machine-readable directories help?Am J Manag Care. 2019 May;25(5):241-245. Am J Manag Care. 2019. PMID: 31120718
-
Characterizing physician directory data quality: variation by specialty, state, and insurer.BMC Health Serv Res. 2024 Jul 18;24(1):808. doi: 10.1186/s12913-024-11269-5. BMC Health Serv Res. 2024. PMID: 39020337 Free PMC article.
-
Natural language processing to extract follow-up provider information from hospital discharge summaries.AMIA Annu Symp Proc. 2010 Nov 13;2010:872-6. AMIA Annu Symp Proc. 2010. PMID: 21347103 Free PMC article.
-
A Comparative Workforce Study of Midwives Practicing in the State of Texas.J Midwifery Womens Health. 2018 Nov;63(6):682-692. doi: 10.1111/jmwh.12739. Epub 2018 Jun 8. J Midwifery Womens Health. 2018. PMID: 29883047 Free PMC article. Review.
-
Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review.J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379. doi: 10.1093/jamia/ocy173. J Am Med Inform Assoc. 2019. PMID: 30726935 Free PMC article.
Cited by
-
SEETrials: Leveraging large language models for safety and efficacy extraction in oncology clinical trials.Inform Med Unlocked. 2024;50:101589. doi: 10.1016/j.imu.2024.101589. Epub 2024 Oct 11. Inform Med Unlocked. 2024. PMID: 39493413 Free PMC article.
-
Detecting Ground Glass Opacity Features in Patients With Lung Cancer: Automated Extraction and Longitudinal Analysis via Deep Learning-Based Natural Language Processing.JMIR AI. 2023 Jun 1;2:e44537. doi: 10.2196/44537. JMIR AI. 2023. PMID: 38875565 Free PMC article.
-
AI integration in nephrology: evaluating ChatGPT for accurate ICD-10 documentation and coding.Front Artif Intell. 2024 Sep 2;7:1457586. doi: 10.3389/frai.2024.1457586. eCollection 2024. Front Artif Intell. 2024. PMID: 39286549 Free PMC article.
-
Special issue of BMC medical informatics and decision making on health natural language processing.BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):76. doi: 10.1186/s12911-019-0777-0. BMC Med Inform Decis Mak. 2019. PMID: 30943961 Free PMC article. No abstract available.
References
-
- Morris G, Afzal S, Bhasker M, Finney D: Provider Directory Solutions: Market Assessment and Opportunities Analysis [https://www.healthit.gov/sites/default/files/provider_directory_solution... 02/10/2019].
-
- Centers for Medicare and Medicaid Services. Online Provider Directory Review Report [https://www.cms.gov/Medicare/Health-Plans/ManagedCareMarketing/Downloads... 02/10/2019].
-
- Samuel CA, King J, Adetosoye F, Samy L, Furukawa MF. Engaging providers in underserved areas to adopt electronic health records. Am J Manag Care. 2013;19:229–234. - PubMed
-
- Office of the National Coordinator for Health Information Technology . Federal Health Information Technology Strategic Plan 2011–2015. Washington, DC: Department of Health and Human Services; 2011.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources