Rare disease diagnosis: A review of web search, social media and large-scale data-mining approaches
- PMID: 26442199
- PMCID: PMC4590007
- DOI: 10.1080/21675511.2015.1083145
Rare disease diagnosis: A review of web search, social media and large-scale data-mining approaches
Abstract
Physicians and the general public are increasingly using web-based tools to find answers to medical questions. The field of rare diseases is especially challenging and important as shown by the long delay and many mistakes associated with diagnoses. In this paper we review recent initiatives on the use of web search, social media and data mining in data repositories for medical diagnosis. We compare the retrieval accuracy on 56 rare disease cases with known diagnosis for the web search tools google.com, pubmed.gov, omim.org and our own search tool findzebra.com. We give a detailed description of IBM's Watson system and make a rough comparison between findzebra.com and Watson on subsets of the Doctor's dilemma dataset. The recall@10 and recall@20 (fraction of cases where the correct result appears in top 10 and top 20) for the 56 cases are found to be be 29%, 16%, 27% and 59% and 32%, 18%, 34% and 64%, respectively. Thus, FindZebra has a significantly (p < 0.01) higher recall than the other 3 search engines. When tested under the same conditions, Watson and FindZebra showed similar recall@10 accuracy. However, the tests were performed on different subsets of Doctors dilemma questions. Advances in technology and access to high quality data have opened new possibilities for aiding the diagnostic process. Specialized search engines, data mining tools and social media are some of the areas that hold promise.
Keywords: clinical diagnosis decision support systems; data mining; information retrieval; machine learning; rare diseases; search engines.
Figures
Similar articles
-
Specialized tools are needed when searching the web for rare disease diagnoses.Rare Dis. 2013 May 16;1:e25001. doi: 10.4161/rdis.25001. eCollection 2013. Rare Dis. 2013. PMID: 25002998 Free PMC article.
-
FindZebra: a search engine for rare diseases.Int J Med Inform. 2013 Jun;82(6):528-38. doi: 10.1016/j.ijmedinf.2013.01.005. Epub 2013 Feb 23. Int J Med Inform. 2013. PMID: 23462700
-
FindZebra online search delving into rare disease case reports using natural language processing.PLOS Digit Health. 2023 Jun 29;2(6):e0000269. doi: 10.1371/journal.pdig.0000269. eCollection 2023 Jun. PLOS Digit Health. 2023. PMID: 37384616 Free PMC article.
-
Google Versus PubMed: Comparison of Google and PubMed's Search Tools for Answering Clinical Questions in the Emergency Department.Ann Emerg Med. 2020 Mar;75(3):408-415. doi: 10.1016/j.annemergmed.2019.07.003. Epub 2019 Oct 14. Ann Emerg Med. 2020. PMID: 31623934 Review.
-
[From symptom to diagnosis-symptom checkers re-evaluated : Are symptom checkers finally sufficient and accurate to use? An update from the ENT perspective].HNO. 2019 May;67(5):334-342. doi: 10.1007/s00106-019-0666-y. HNO. 2019. PMID: 30993374 Review. German.
Cited by
-
National Registry of Designated Intractable Diseases in Japan: Present Status and Future Prospects.Neurol Med Chir (Tokyo). 2017 Jan 15;57(1):1-7. doi: 10.2176/nmc.st.2016-0135. Epub 2016 Sep 21. Neurol Med Chir (Tokyo). 2017. PMID: 27666154 Free PMC article.
-
Interviews with experts in rare diseases for the development of clinical decision support system software - a qualitative study.BMC Med Inform Decis Mak. 2020 Sep 16;20(1):230. doi: 10.1186/s12911-020-01254-3. BMC Med Inform Decis Mak. 2020. PMID: 32938448 Free PMC article.
-
Improving rare disease classification using imperfect knowledge graph.BMC Med Inform Decis Mak. 2019 Dec 5;19(Suppl 5):238. doi: 10.1186/s12911-019-0938-1. BMC Med Inform Decis Mak. 2019. PMID: 31801534 Free PMC article.
-
Access to patient oriented information-a baseline Endo-ERN survey among patients with rare endocrine disorders.Endocrine. 2021 Mar;71(3):542-548. doi: 10.1007/s12020-021-02654-9. Epub 2021 Feb 18. Endocrine. 2021. PMID: 33599944 Free PMC article.
-
A proof-of-concept study of extracting patient histories for rare/intractable diseases from social media.Genomics Inform. 2020 Jun;18(2):e17. doi: 10.5808/GI.2020.18.2.e17. Epub 2020 Jun 18. Genomics Inform. 2020. PMID: 32634871 Free PMC article.
References
-
- The UK Strategy for Rare Diseases https://www.gov.uk/government/uploads/system/uploads/attachment_data/fil...
-
- Graber ML, Franklin N, Gordon R. Diagnostic error in internal medicine. Arch Intern Med 2005; 165(13):1493-9; PMID:16009864 - PubMed
-
- Berner ES, Graber ML. Overconfidence as a cause of diagnostic error in medicine. Am J Med 2008; 121(5):2-23; PMID:18187063 - PubMed
-
- Graber M, Gordon R, Franklin N. Reducing diagnostic errors in medicine: what's the goal? Acad Med 2002; 77(10):981-92; PMID:12377672 - PubMed
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources