Search results for "TF-IDF"
showing 3 items of 3 documents
A Study on Classification Methods Applied to Sentiment Analysis
2013
Sentiment analysis is a new area of research in data mining that concerns the detection of opinions and/or sentiments in texts. This work focuses on the application and the comparison of three classification techniques over a text corpus composed of reviews of commercial products in order to detect opinions about them. The chosen domain is about "perfumes", and user opinions composing the corpus are written in Italian language. The proposed approach is completely data-driven: a Term Frequency / Inverse Document Frequency (TFIDF) terms selection procedure has been applied in order to make computation more efficient, to improve the classification results and to manage some issues related to t…
Identifying the k Best Targets for an Advertisement Campaign via Online Social Networks
2020
We propose a novel approach for the recommendation of possible customers (users) to advertisers (e.g., brands) based on two main aspects: (i) the comparison between On-line Social Network profiles, and (ii) neighborhood analysis on the On-line Social Network. Profile matching between users and brands is considered based on bag-of-words representation of textual contents coming from the social media, and measures such as the Term Frequency-Inverse Document Frequency are used in order to characterize the importance of words in the comparison. The approach has been implemented relying on Big Data Technologies, allowing this way the efficient analysis of very large Online Social Networks. Resul…
Automātiska teksta konspektēšana izmantojot jēdzientelpu
2016
Šobrīd pasaulē ir vērojams milzīgs informācijas daudzuma pieaugums un ir arvien grūtāk iepazīties ar šo informāciju. Automātiskas teksta konspektēšanas mērķis ir spēt pārveidot lielu tekstuālas informācijas daudzumu īsākā formātā, kurš spēj saglabāt oriģinālā teksta svarīgāko informāciju. Viena no metodēm kā automātiski konspektēt tekstu ir izvēlēties svarīgākos teikumus no teksta. Mērķis ir izvēlēties teikumus tā, lai tajos esošā informācija savstarpēji nepārklājas, kā arī nosedz pietiekamu daļu no konspektējamā teksta. Lai to izdarītu ir jāsalīdzina teikumu ietvertās informācijas līdzīgums. Jēdzientelpa ir moderns rīks, ar kura palīdzību var noteikt vārdu nozīmi un līdzību ar citiem vārdi…