Search results for "Louhi"
showing 10 items of 96 documents
Tekstinlouhinta semanttisen webin metatietojen tuottamisessa
2010
Koivunen, Juuso Oskari Tekstinlouhinta semanttisen webin metatietojen tuottamisessa / Juuso Koivunen Jyväskylä: Jyväskylän Yliopisto, 2010. 26s. Kandidaatintutkielma Tässä tutkielmassa selvitetään kirjallisuuskatsauksen avulla tekstinlouhintajär-jestelmien toimintaa ja mitä haasteita ne kohtaavat. Aineistona on pääasiassa 2000 -luvulla julkaistuja tieteellisiä artikkeleita, konferenssijulkaisuja ja teknis-ten standardien dokumentaatioita. Aihetta on tutkittu huomattavasti, sillä tuo-reita lähteitä löytyy paljon. Semanttisen webin kehityksen ja yleistymisen myötä metatietojen automaatti-nen tuottaminen on ajankohtainen tutkimusalue. Semanttisessa webissä tar-peelliset metatiedot on luotu aie…
Ontologioiden oppiminen tekstistä
2008
Evolutionary cloud for cooperative UAV coordination
2014
Intrusion detection applications using knowledge discovery and data mining
2014
Improving Scalable K-Means++
2021
Two new initialization methods for K-means clustering are proposed. Both proposals are based on applying a divide-and-conquer approach for the K-means‖ type of an initialization strategy. The second proposal also uses multiple lower-dimensional subspaces produced by the random projection method for the initialization. The proposed methods are scalable and can be run in parallel, which make them suitable for initializing large-scale problems. In the experiments, comparison of the proposed methods to the K-means++ and K-means‖ methods is conducted using an extensive set of reference and synthetic large-scale datasets. Concerning the latter, a novel high-dimensional clustering data generation …
Improvements and applications of the elements of prototype-based clustering
2018
Clustering or cluster analysis is an essential part of data mining, machine learning, and pattern recognition. The most popularly applied clustering methods are partitioning-based or prototype-based methods. Prototype-based clustering methods usually have easy implementability and good scalability. These methods, such as K-means clustering, have been used for different applications in various fields. On the other hand, prototype-based clustering methods are typically sensitive to initialization, and the selection of the number of clusters for knowledge discovery purposes is not straightforward. In the era of big data, in high-velocity, ever-growing datasets, which can also be erroneous, outl…
Automatic Taxonomy Induction based on Word-embedding of Neural Nets
2018
Taxonomy is a knowledge management tool that presents useful information in a well-ordered structure prevents overloading of information on its access and making the information access qualitative. This article is concerned with automatically extracting asymmetrical hierarchical relations from a large corpus and subsequent taxonomy construction by domain independent and semi-supervised system. The methodology relies on the term’s distributional semantics. The algorithm utilizes the word-embedding generated from the vector space model. The model is trained over a large corpus to generate word-embedding of each word in a corpus. Then, the system finds and extracts the hypernyms by using the g…
Detecting cellular network anomalies using the knowledge discovery process
2015
Analytical companies unanimously forecast the exponential growth of mobile traffic consumption over the next five years. The densification of a network structure with small cells is regarded as a key solution to meet growing capacity demands. The manual management of a multi-layer network is a very expensive, error prone, and sluggish process. Hence, the automation of the whole life cycle of network operation is highly anticipated. To this aim 3GPP introduces a self-management concept referred to as SON. It is envisioned that SON updates information concerning the latest network conditions through the MDT mecha- nism. MDT enables a network operator to collect radio and service quality measurem…
Advanced performance monitoring for self-healing cellular mobile networks
2015
This dissertation is devoted to development and validation of advanced per- formance monitoring system for existing and future cellular mobile networks. Knowledge mining techniques are employed for analysis of user specific logs, collected with Minimization of Drive Tests (MDT) functionality. Ever increas- ing quality requirements, expansion of the mobile networks and their extend- ing heterogeneity, call for effective automatic means of performance monitoring. Nowadays, network operation is mostly controlled manually through aggregated key performance indicators and statistical profiles. These methods are are not able to fully address the dynamism and complexity of modern mobile networks. Se…
Intelligent solutions for real-life data-driven applications
2017
The subject of this thesis belongs to the topic of machine learning or, specifically, to the development of advanced methods for regression analysis, clustering, and anomaly detection. Industry is constantly seeking improved production practices and minimized production time and costs. In connection to this, several industrial case studies are presented in which mathematical models for predicting paper quality were proposed. The most important variables for the prediction models are selected based on information-theoretic measures and regression trees approach. The rest of the original papers are devoted to unsupervised machine learning. The main focus is developing advanced spectral cluster…