Improving perceived and actual text difficulty for health information consumers using semi-automated methods
- PMID: 23304324
- PMCID: PMC3540563
Improving perceived and actual text difficulty for health information consumers using semi-automated methods
Abstract
We are developing algorithms for semi-automated simplification of medical text. Based on lexical and grammatical corpus analysis, we identified a new metric, term familiarity, to help estimate text difficulty. We developed an algorithm that uses term familiarity to identify difficult text and select easier alternatives from lexical resources such as WordNet, UMLS and Wiktionary. Twelve sentences were simplified to measure perceived difficulty using a 5-point Likert scale. Two documents were simplified to measure actual difficulty by posing questions with and without the text present (information understanding and retention). We conducted a user study by inviting participants (N=84) via Amazon Mechanical Turk. There was a significant effect of simplification on perceived difficulty (p<.001). We also saw slightly improved understanding with better question-answering for simplified documents but the effect was not significant (p=.097). Our results show how term familiarity is a valuable component in simplifying text in an efficient and scalable manner.
Figures
References
-
- Artificial Intelligence, With Help From the Humans. The New York Times. 2007 Mar 25;
-
- Ajzen I. The Theory of Planned Behavior. Organizational Behavior and Human Decision Processes. 1988;50:179–211.
-
- Baker D, Williams M, Parker R, Gazmararian J. Development of a brief test to measure functional health literacy. Patient Educ Couns. 1999;38(1):33–42. - PubMed
-
- Baker L, Wagner TH, Signer S, Bundorf MK. Use of the Internet and E-mail for Health Care Information: Results from a National Survey. Journal of the American Medical Association. 2003 May 14;289(18):2400–6. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical