Search results for "tf–idf"

showing 2 items of 2 documents

Document Word Clouds: Visualising Web Documents as Tag Clouds to Aid Users in Relevance Decisions

2009

Περιέχει το πλήρες κείμενο Information Retrieval systems spend a great effort on determining the significant terms in a document. When, instead, a user is looking at a document he cannot benefit from such information. He has to read the text to understand which words are important. In this paper we take a look at the idea of enhancing the perception of web documents with visualisation techniques borrowed from the tag clouds of Web 2.0. Highlighting the important words in a document by using a larger font size allows to get a quick impression of the relevant concepts in a text. As this process does not depend on a user query it can also be used for explorative search. A user study showed, th…

Information retrievalProcess (engineering)Computer sciencemedia_common.quotation_subjectDocument clusteringUser requirements documentWorld Wide WebPerceptionRelevance (information retrieval)Tag cloudtf–idfΤεχνικές υπηρεσίες σε βιβλιοθήκες αρχεία και μουσείαTechnical services in libraries archives and museumsWord (computer architecture)media_common
researchProduct

A Study on Classification Methods Applied to Sentiment Analysis

2013

Sentiment analysis is a new area of research in data mining that concerns the detection of opinions and/or sentiments in texts. This work focuses on the application and the comparison of three classification techniques over a text corpus composed of reviews of commercial products in order to detect opinions about them. The chosen domain is about "perfumes", and user opinions composing the corpus are written in Italian language. The proposed approach is completely data-driven: a Term Frequency / Inverse Document Frequency (TFIDF) terms selection procedure has been applied in order to make computation more efficient, to improve the classification results and to manage some issues related to t…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniText corpusNaive Bayes classifierComputer sciencebusiness.industrySentiment analysisTF-IDFSentiment Classificationcomputer.software_genreClass Association RulesDomain (software engineering)Naive Bayes classifierRandom indexingArtificial IntelligenceSelection (linguistics)One-class classificationArtificial intelligenceRandom Indexingbusinesstf–idfcomputerNatural language processing
researchProduct