Search results for "Knowledge discovery"
showing 10 items of 25 documents
Collective Reasoning over Shared Concepts for the Linguistic Atlas of Sicily
2013
In this chapter, collective intelligence principles are applied in the context of the Linguistic Atlas of Sicily (ALS - Atlante Linguistico Siciliano), an interdisciplinary research focusing on the study of the Italian language as it is spoken in Sicily, and its correlation with the Sicilian dialect and other regional varieties spoken in Sicily. The project has been developed over the past two decades and includes a complex information system supporting linguistic research; recently it has grown to allow research scientists to cooperate in an integrated environment to produce significant scientific advances in the field of ethnologic and sociolinguistic research. An interoperable infrastruc…
XML-based Knowledge Discovery for Linguistic Atlas of Sicily (ALS) Project
2009
The identification of new useful patterns in data is a core process for intelligent systems. Information overflow is directly related to this problem. In this work we propose a knowledge discovery methodology to retrieve useful and novel information from raw data stored in a DBMS. We used ALSDB, a database that has been built suitably to access structured information obtained from the questionnaires produced in the Linguistic Atlas of Sicily (ALS) project. The ALS project is a decennal joint effort led by researchers at the Dipartimento di Scienze Filologiche e Linguistiche of the University of Palermo that has the purpose to track and study the geo-linguistic and lexicographic processes ab…
The ALSWEB Framework: A Web-based Framework for the Linguistic Atlas of Sicily Project
2011
In this work the ALSWEB framework is presented. The ALSWEB is a virtual linguistic laboratory for linguistic research developed as a web application. The purpose of the framework is to model the entire process regarding the different steps of data acquisition, data transformation, information acquisition from different data and research hypotheses verification in the ALS (Linguistic Atlas of Sicily) project. The nature of the ALS research involves different type of data. The socio-linguistic researcher that is the main actor of the proposed framework has to acquire information in many formats: multimedia data, audio data, question-answer (textual) from particular questionnaires. In this wor…
Towards A Twitter Observatory: A Multi-Paradigm Framework For Collecting, Storing And Analysing Tweets
2016
International audience; In this article we show how a multi-paradigm framework can fulfil the requirements of tweets analysis and reduce the waiting time for researchers that use computational resources and storage systems to support large-scale data analysis. The originality of our approach is to combine concerns about data harvesting, data storage, data analysis and data visualisation into a framework that supports inductive reasoning in multidisciplinary scientific research. Our main contribution is a polyglot storage system with a generic data model to support logical data independence and a set of tools that can provide a suitable solution for mixing different types of algorithms in or…
Application of a Knowledge Discovery Process to Study Instances of Capacitated Vehicle Routing Problems
2020
Vehicle Routing Problems (VRP) are computationally challenging, constrained optimization problems, which have central role in logistics management. Usually different solvers are being developed and applied for different kind of problems. However, if descriptive and general features could be extracted to describe such problems and their solution attempts, then one could apply data mining and machine learning methods in order to discover general knowledge on such problems. The aim then would be to improve understanding of the most important characteristics of VRPs from both efficient solution and utilization points of view. The purpose of this article is to address these challenges by proposi…
Discovering knowledge in various applications with a novel hyperspectral imager
2013
Knowledge discovery from physical activity
2017
Tässä pro gradu -tutkielmassa käydään läpi Knowledge Discovery in Databases (KDD) -prosessi ja sen soveltamismahdollisuuksia fyysiseen aktiivisuuteen liittyvän datan kanssa. KDD-prosessi koostuu monesta eri vaiheesta, sisältäen esikäsittelyn, datan muunnoksen ja tiedonlouhinnan. Tässä tutkielmassa tiedonlouhinnan menetelmänä käytetään klusterointia, joka käydään läpi yksityiskohtaisesti. Vertailemme myös laajan joukon eri klusterointi indeksejä (CVAIs) sekä niiden eri toteutuksia k-means klusteroinnin kanssa ja esittelemme parhaat näistä yleisemmässä muodossa. Tutkielman empiirisessä osassa seitsemäsluokkalaisten koululaisten aktiivisuusdataa tutkitaan KDD-prosessia seuraten ja hyödyntäen m…
Unstable feature relevance in classification tasks
2011
Knowledge discovery using diffusion maps
2013
Automatic knowledge discovery from sparse and large-scale educational data : case Finland
2017
The Finnish educational system has received a lot of attention during the 21st century. Especially, the outstanding results in the first three cycles of the Programme for International Student Assessment (PISA) have made Finland’s education system internationally famous, and its unique characteristics have been under active research by various, predominantly educational, scholars since then. However, despite the availability of real but often sparse big data sets that would allow more evidence-based decision making, existing research to date has mostly concentrated on using classical qualitative and (univariate) quantitative methods. This thesis discusses, in general terms, knowledge discove…