. 2017 Mar 17;12(3):e0173410.

doi: 10.1371/journal.pone.0173410. eCollection 2017.

Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity

YunZhi Chen^{1

2}, HuiJuan Lu³, LanJuan Li^{1

4}

Affiliations

¹ Zhejiang University School of Medicine, Hangzhou, China.
² Hangzhou Vocational and Technical College, Hangzhou, China.
³ College of Information Engineering of China Jiliang University, Hangzhou, China.
⁴ Zhejiang University the First Affiliated Hospital, Hangzhou, China.

PMID: 28306739
PMCID: PMC5356997
DOI: 10.1371/journal.pone.0173410

Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity

YunZhi Chen et al. PLoS One. 2017.

. 2017 Mar 17;12(3):e0173410.

doi: 10.1371/journal.pone.0173410. eCollection 2017.

Authors

YunZhi Chen^{1

2}, HuiJuan Lu³, LanJuan Li^{1

4}

Affiliations

¹ Zhejiang University School of Medicine, Hangzhou, China.
² Hangzhou Vocational and Technical College, Hangzhou, China.
³ College of Information Engineering of China Jiliang University, Hangzhou, China.
⁴ Zhejiang University the First Affiliated Hospital, Hangzhou, China.

PMID: 28306739
PMCID: PMC5356997
DOI: 10.1371/journal.pone.0173410

Abstract

ICD-10(International Classification of Diseases 10th revision) is a classification of a disease, symptom, procedure, or injury. Diseases are often described in patients' medical records with free texts, such as terms, phrases and paraphrases, which differ significantly from those used in ICD-10 classification. This paper presents an improved approach based on the Longest Common Subsequence (LCS) and semantic similarity for automatic Chinese diagnoses, mapping from the disease names given by clinician to the disease names in ICD-10. LCS refers to the longest string that is a subsequence of every member of a given set of strings. The proposed method of improved LCS in this paper can increase the accuracy of processing in Chinese disease mapping.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Fig 2. Corpus of Chinese word segmentation of 181 kinds of hepatitis.**

**Fig 3. Similarity line chart when L(A) = 1.**

**Fig 4. Similarity line chart when L(A) = 2.**

**Fig 5. Similarity line chart when L(A) = 3.**

**Fig 6. Similarity line chart when L(A) = 4.**

**Fig 7. Similarity line chart when L(A) = 5.**

**Fig 8. Similarity line chart when L(A) = 6.**

**Fig 9. Accuracy analysis chart under similarity threshold (n = 1000).**

**Fig 10. Given threshold of coding accuracy and percentage.**

See this image and copyright information in PMC

Cited by

EHR problem list clustering for improved topic-space navigation.
Kreuzthaler M, Pfeifer B, Vera Ramos JA, Kramer D, Grogger V, Bredenfeldt S, Pedevilla M, Krisper P, Schulz S. Kreuzthaler M, et al. BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):72. doi: 10.1186/s12911-019-0789-9. BMC Med Inform Decis Mak. 2019. PMID: 30943968 Free PMC article.
Leveraging Shannon Entropy to Validate the Transition between ICD-10 and ICD-11.
Chen D, Zhang R, Zhu X. Chen D, et al. Entropy (Basel). 2018 Oct 8;20(10):769. doi: 10.3390/e20100769. Entropy (Basel). 2018. PMID: 33265857 Free PMC article.
Comparison of different feature extraction methods for applicable automated ICD coding.
Shuai Z, Xiaolin D, Jing Y, Yanni H, Meng C, Yuxin W, Wei Z. Shuai Z, et al. BMC Med Inform Decis Mak. 2022 Jan 12;22(1):11. doi: 10.1186/s12911-022-01753-5. BMC Med Inform Decis Mak. 2022. PMID: 35022039 Free PMC article.
Artificial Intelligence Algorithm with ICD Coding Technology Guided by the Embedded Electronic Medical Record System in Medical Record Information Management.
Wang C, Yao C, Chen P, Shi J, Gu Z, Zhou Z. Wang C, et al. J Healthc Eng. 2021 Aug 30;2021:3293457. doi: 10.1155/2021/3293457. eCollection 2021. J Healthc Eng. 2021. PMID: 34497706 Free PMC article.
Automatic ICD Code Assignment based on ICD's Hierarchy Structure for Chinese Electronic Medical Records.
Cao L, Gu D, Ni Y, Xie G. Cao L, et al. AMIA Jt Summits Transl Sci Proc. 2019 May 6;2019:417-424. eCollection 2019. AMIA Jt Summits Transl Sci Proc. 2019. PMID: 31258995 Free PMC article.

See all "Cited by" articles

References

1. O’Malley KJ, Cook KF, Price MD, Wildes KR, Hurdle JF, and Ashton CM. Measuring diagnoses:ICD code accuracy.Health Services Research. 2005;40:1620–1639 10.1111/j.1475-6773.2005.00444.x - DOI - PMC - PubMed
1. Arifo˘glu D, Deniz O, Aleçakır K and Yöndem M. CodeMagic: Semi-Automatic Assignment of ICD-10-AM Codes to Patient Records. Information Sciences and Systems. 2014:259–268
1. Boytcheva S. Automatic Matching of ICD-10 codes to Diagnoses in Discharge Letters. Proceedings of the Workshop on Biomedical Natural Language Processing. 2011;9:11–18
1. Patrick J, Zhang Y, Wang Y. Developing feature types for classifying clinical notes. Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing; 2007.pp.191–192
1. Zweigenbaum P, Lavergne T, Hybrid methods for ICD-10 coding of death certificates, Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis (LOUHI); 2016.pp.96–105

MeSH terms

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity

Affiliations

Automatic ICD-10 coding algorithm using an improved longest common subsequence based on semantic similarity

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources