Term Familiarity to Indicate Perceived and Actual Difficulty of Text in Medical Digital Libraries
Document Type
Book Chapter
Department
Information Systems and Technology (CGU)
Publication Date
2011
Disciplines
Databases and Information Systems
Abstract
With increasing text digitization, digital libraries can personalize materials for individuals with different education levels and language skills. To this end, documents need meta-information describing their difficulty level. Previous attempts at such labeling used readability formulas but the formulas have not been validated with modern texts and their outcome is seldom associated with actual difficulty. We focus on medical texts and are developing new, evidence-based meta-tags that are associated with perceived and actual text difficulty. This work describes a first tag, ’term familiarity’, which is based on term frequency in the Google corpus. We evaluated its feasibility to serve as a tag by looking at a document corpus (N=1,073) and found that terms in blogs or journal articles displayed unexpected but significantly different scores. Term familiarity was then applied to texts and results from a previous user study (N=86) and could better explain differences for perceived and actual difficulty.
Rights Information
© 2011 Springer-Verlag
Terms of Use & License Information
DOI
10.1007/978-3-642-24826-9_38
Recommended Citation
G. Leroy and J.E. Endicott, "Term Familiarity to Indicate Perceived and Actual Difficulty of Text in Medical Digital Libraries," International Conference on Asia-Pacific Digital Libraries (ICADL 2011) - Digital Libraries -- for Culture Heritage, Knowledge Dissemination, and Future Creation. Beijing, China, October 24-27, 2011. doi: 10.1007/978-3-642-24826-9_38