6533b853fe1ef96bd12ac01f

RESEARCH PRODUCT

Estimating web site readability using content extraction

Thomas GottronLudger Martin

subject

Information retrievalbusiness.industryComputer sciencemedia_common.quotation_subjectContent extractionQuality (business)UsabilitybusinessReadabilitymedia_commonWeb site

description

Nowadays, information is primarily searched on the WWW. From a user perspective, the readability is an important criterion for measuring the accessibility and thereby the quality of an information. We show that modern content extraction algorithms help to estimate the readability of a web document quite accurate.

https://doi.org/10.1145/1526709.1526911