6533b839fe1ef96bd12a5b35

RESEARCH PRODUCT

P-FCM: a proximity-based fuzzy clustering for user-centered web applications

Sabrina SenatoreWitold PedryczVincenzo LoiaVincenzo Loia

subject

0209 industrial biotechnologymedicine.medical_specialtyComputer science02 engineering and technologyWeb engineeringcomputer.software_genreSimilarityTheoretical Computer ScienceWorld Wide Web020901 industrial engineering & automationArtificial IntelligenceWeb query classificationWeb design0202 electrical engineering electronic engineering information engineeringmedicineWeb navigationWeb search queryInformation retrievalHuman–computer interactionApplied MathematicsFuzzy logicSearch enginesWeb search engine020201 artificial intelligence & image processingWeb servicecomputerWeb modelingSoftwareFuzzy C-mean algorithm

description

Abstract In last years, the Internet and the web have been evolved in an astonishing way. Standard web search services play an important role as useful tools for the Internet community even though they suffer from a certain difficulty. The web continues its growth, making the reliability of Internet-based information and retrieval systems more complex. Nevertheless there has been a substantial analysis of the gap between the expected information and the returned information, the work of web search engine is still very hard. There are different problems concerning web searching activity, one among these falls in the query phase. Each engine provide an interface which the user is forced to learn. Often, the searching process returns a huge list of answers that are irrelevant, unavailable, or outdated. The tediosity of querying, due to the fact the queries are too weak to cope with the user’s expressiveness, has stimulated the designers to enrich the human-system interaction with new searching metaphors. One of these is the searching of “similar” pages, as offered by Google, Yahoo and others. The idea is very good, since the similarity gives an easy and intuitive mechanism to express a complex relation. We believe that this approach could become more effective if the user can rely on major flexibility in expressing the similarity dependencies with respect the current and available possibilities. In this paper we introduce a novel method for considering and processing the user-driven similarity during web navigation. We define an extension of fuzzy C-means algorithm, namely proximity fuzzy C-means (P-FCM) incorporating a measure of similarity or dissimilarity as user’s feedback on the clusters. We present the theoretical framework of this extension and then we observe, through a suite of web-based experiments, how significant is the impact of user’s feedback during P-FCM functioning. These observations suggest that the P-FCM approach can offer a relatively simple way of improving the web page classification according with the user interaction with the search engine.

10.1016/j.ijar.2003.07.004http://dx.doi.org/10.1016/j.ijar.2003.07.004