Search results for "Language processing"
showing 10 items of 421 documents
A Study on Classification Methods Applied to Sentiment Analysis
2013
Sentiment analysis is a new area of research in data mining that concerns the detection of opinions and/or sentiments in texts. This work focuses on the application and the comparison of three classification techniques over a text corpus composed of reviews of commercial products in order to detect opinions about them. The chosen domain is about "perfumes", and user opinions composing the corpus are written in Italian language. The proposed approach is completely data-driven: a Term Frequency / Inverse Document Frequency (TFIDF) terms selection procedure has been applied in order to make computation more efficient, to improve the classification results and to manage some issues related to t…
A User-Friendly Interface for Fingerprint Recognition Systems Based on Natural Language Processing
2009
Biometric recognition systems represent a valid solution to the safety problem of internet accessibility, even if they do not always provide an environment easily comprehensible by users and operators with a mid-level of competence. This gap can be partially filled if, instead of using the conventional access routines to the authentication system, the user could simply write to the system through the interface and using high level sentences and requests be able to use its own natural language to reach the intended goal. On the other hand, biometrics features are widely used for recognition and identification all over the world, generating large databases. In this paper a user-friendly inter…
Embedded Knowledge-based Speech Detectors for Real-Time Recognition Tasks
2006
Speech recognition has become common in many application domains, from dictation systems for professional practices to vocal user interfaces for people with disabilities or hands-free system control. However, so far the performance of automatic speech recognition (ASR) systems are comparable to human speech recognition (HSR) only under very strict working conditions, and in general much lower. Incorporating acoustic-phonetic knowledge into ASR design has been proven a viable approach to raise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as de…
Named Entity Recognition and Linking in Tweets Based on Linguistic Similarity
2017
This work proposes a novel approach in Named Entity rEcognition and Linking (NEEL) in tweets, applying the same strategy already presented for Question Answering (QA) by the same authors. The previous work describes a rule-based and ontology-based system that attempts to retrieve the correct answer to a query from the DBPedia ontology through a similarity measure between the query and the ontology labels. In this paper, a tweet is interpreted as a query for the QA system: both the text and the thread of a tweet are a sequence of statements that have been linked to the ontology. Provided that tweets make extensive use of informal language, the similarity measure and the underlying processes …
An Approach to Enhance Chatbot Semantic Power and Maintainability: Experiences within the FRASI Project
2012
The paper illustrates the implementation and semantic enhancement of a domain-oriented Question-Answering system based on a pattern-matching chat bot technology, developed within an industrial project, named FRASI. The main difficulty in building a KB for a chat bot is to handwrite all possible question-answer pairs that constitute the KB. The proposed approach simplifies the chat bot realization thanks to two solutions. The first one uses an ontology, which is exploited in a twofold manner: to construct dynamic answers as a result of an inference process about the domain, and to automatically populate, off-line, the chat bot KB with sentences that can be derived from the ontology, describi…
Intelligent Agents supporting user interactions within self regulated learning processes
2010
The paper focuses on the main advantages in the defnition and utilization of an open and modular e-learning software platform to support highly cognitive tasks performed by the main actors of the learning process. We present in detail the integration inside the platform of two intelligent agents devoted to talking with the student and to retrieving new information sources on the Web. The process is triggered as a reply to the system’s perception that the student feels discontented with the presented contents. The architecture is detailed, and some conclusions about the growth of the platform’s overall performance are expressed.
Fake News Spreaders Detection: Sometimes Attention Is Not All You Need
2022
Guided by a corpus linguistics approach, in this article we present a comparative evaluation of State-of-the-Art (SotA) models, with a special focus on Transformers, to address the task of Fake News Spreaders (i.e., users that share Fake News) detection. First, we explore the reference multilingual dataset for the considered task, exploiting corpus linguistics techniques, such as chi-square test, keywords and Word Sketch. Second, we perform experiments on several models for Natural Language Processing. Third, we perform a comparative evaluation using the most recent Transformer-based models (RoBERTa, DistilBERT, BERT, XLNet, ELECTRA, Longformer) and other deep and non-deep SotA models (CNN,…
Contact between Italian and dialect in Sicily: the case of phrasal verb constructions
2017
The Phrasal Verb Constructions (PVCs) are an interesting example of the intertwining between Italian and dialects of Italy. These constructions are formed by a verbal base (especially of motion), and a locative or direction marking particle and exist in standard Italian as well as in regional varieties of Italian and dialects (i.e. Italian andare via 'go away', mettere giù 'put down'). Because of their progressive diffusion in different varieties of regional Italian, PVCs can be construed as an emerging feature increasingly accepted in regional standards. In this perspective, PVCs are an example of the contact between varieties that contribute to the restandardization of contemporary Italia…
Miten viittomakielen korpusta luodaan ja mihin sitä tarvitaan? Viittomakielten korpukset ja niiden tehtävät
2020
Artikkeli käsittelee suomalaisen ja suomenruotsalaisen viittomakielen korpusten luontia CFINSL-projektissa (Corpus project of Finland’s sign languages, Suomen viittomakielten korpusprojekti). Viittomakielillä ei ole kirjoitettua muotoa, joten korpusten laatiminen vaatii erilaista lähestymistä kuin korpusten luonti sellaisille puhutuille kielille, joilla on kirjoitettu muoto. Artikkelissa kuvataan ne menetelmät, joilla Jyväskylän yliopiston viittomakielen keskuksessa on koottu aineistoa suomalaisen ja suomenruotsalaisen viittomakielen korpukseen. Lisäksi kuvataan korpusaineiston teknistä käsittelyä, annotointia, metatietojen keruuta ja käsittelyä sekä aineiston säilytystä ja tutkijoiden käyt…
Contrasting Automatic and Manual Group Formation: A Case Study in a Software Engineering Postgraduate Course
2021
This paper proposes the comparison of a group formation approach based on an evolutionary algorithm with a manual approach performed by an instructor with ten years of experience on this task. The groups were created based on the professional, psychological, and experience profile of each student. The results obtained demonstrated the algorithm’s potential, reaching an average similarity of \(83.46\%\) with the groups formed manually by the instructor.