Search results for "louhinta"
showing 10 items of 93 documents
Automatic knowledge discovery from sparse and large-scale educational data : case Finland
2017
The Finnish educational system has received a lot of attention during the 21st century. Especially, the outstanding results in the first three cycles of the Programme for International Student Assessment (PISA) have made Finland’s education system internationally famous, and its unique characteristics have been under active research by various, predominantly educational, scholars since then. However, despite the availability of real but often sparse big data sets that would allow more evidence-based decision making, existing research to date has mostly concentrated on using classical qualitative and (univariate) quantitative methods. This thesis discusses, in general terms, knowledge discove…
Semantics of Voids within Data: Ignorance-Aware Machine Learning
2021
Operating with ignorance is an important concern of geographical information science when the objective is to discover knowledge from the imperfect spatial data. Data mining (driven by knowledge discovery tools) is about processing available (observed, known, and understood) samples of data aiming to build a model (e.g., a classifier) to handle data samples that are not yet observed, known, or understood. These tools traditionally take semantically labeled samples of the available data (known facts) as an input for learning. We want to challenge the indispensability of this approach, and we suggest considering the things the other way around. What if the task would be as follows: how to buil…
Multitask deep learning for native language identification
2020
Identifying the native language of a person by their text written in English (L1 identification) plays an important role in such tasks as authorship profiling and identification. With the current proliferation of misinformation in social media, these methods are especially topical. Most studies in this field have focused on the development of supervised classification algorithms, that are trained on a single L1 dataset. Although multiple labeled datasets are available for L1 identification, they contain texts authored by speakers of different languages and do not completely overlap. Current approaches achieve high accuracy on available datasets, but this is attained by training an individua…
Finnish attitudes toward mining : citizen survey - 2016 results
2017
Detector-based visual analysis of time-series data
2015
Information Extraction from Binary Skill Assessment Data with Machine Learning
2021
Strength training exercises are essential for rehabilitation, improving our health as well as in sports. For optimal and safe training, educators and trainers in the industry should comprehend exercise form or technique. Currently, there is a lack of tools measuring in-depth skills of strength training experts. In this study, we investigate how data mining methods can be used to identify novel and useful skill patterns from a binary multiple choice questionnaire test designed to measure the knowledge level of strength training experts. A skill test assessing exercise technique expertise and comprehension was answered by 507 fitness professionals with varying backgrounds. A triangulated appr…
Sääntöpohjaiset tiedonlouhintamenetelmät ohjelmistojen ymmärtämisen tukena
2013
Teknologian nopean kehityksen myötä digitaalisessa muodossa oleva tietomäärä kasvaa kaikkialla. Tietovarastojen koon kasvaessa tarpeellista tietoa tallennettuun tietomäärään nähden on hyvin vähän ja tärkeän informaation löytäminen on haasteellista. Tähän ongelmaan ratkaisuna on tiedonlouhintatekniikat. Tiedonlouhintaa käytettäessä tavoitteena on löytää datajoukosta uusia tuloksia ja näkökohtia tiettyyn kyseessä olevaan ongelmaan. Tutkielmassa keskitytään ohjelmistoaineistojen louhintaan, jonka avulla voidaan saada hyödyllistä informaatiota ohjelmistoprojektin vaiheista ja siinä tapahtuvista virheistä ja niiden ehkäisemisestä. Rapidly expanding and evolving technology makes digitally stored …
Tekstinlouhinta semanttisen webin metatietojen tuottamisessa
2010
Koivunen, Juuso Oskari Tekstinlouhinta semanttisen webin metatietojen tuottamisessa / Juuso Koivunen Jyväskylä: Jyväskylän Yliopisto, 2010. 26s. Kandidaatintutkielma Tässä tutkielmassa selvitetään kirjallisuuskatsauksen avulla tekstinlouhintajär-jestelmien toimintaa ja mitä haasteita ne kohtaavat. Aineistona on pääasiassa 2000 -luvulla julkaistuja tieteellisiä artikkeleita, konferenssijulkaisuja ja teknis-ten standardien dokumentaatioita. Aihetta on tutkittu huomattavasti, sillä tuo-reita lähteitä löytyy paljon. Semanttisen webin kehityksen ja yleistymisen myötä metatietojen automaatti-nen tuottaminen on ajankohtainen tutkimusalue. Semanttisessa webissä tar-peelliset metatiedot on luotu aie…