Search results for "Louhi"

showing 10 items of 96 documents

Tekstinlouhinta semanttisen webin metatietojen tuottamisessa

2010

Koivunen, Juuso Oskari Tekstinlouhinta semanttisen webin metatietojen tuottamisessa / Juuso Koivunen Jyväskylä: Jyväskylän Yliopisto, 2010. 26s. Kandidaatintutkielma Tässä tutkielmassa selvitetään kirjallisuuskatsauksen avulla tekstinlouhintajär-jestelmien toimintaa ja mitä haasteita ne kohtaavat. Aineistona on pääasiassa 2000 -luvulla julkaistuja tieteellisiä artikkeleita, konferenssijulkaisuja ja teknis-ten standardien dokumentaatioita. Aihetta on tutkittu huomattavasti, sillä tuo-reita lähteitä löytyy paljon. Semanttisen webin kehityksen ja yleistymisen myötä metatietojen automaatti-nen tuottaminen on ajankohtainen tutkimusalue. Semanttisessa webissä tar-peelliset metatiedot on luotu aie…

ontologiattekstinlouhintametatietosemanttinen web

researchProduct

Ontologioiden oppiminen tekstistä

2008

oppiminenontologiattiedonlouhinta

researchProduct

Evolutionary cloud for cooperative UAV coordination

2014

pilvipalvelutkoordinointiälytekniikkaevoluutiolaskentamiehittämättömät ilma-aluksetsemanttinen webtiedonlouhintaturvallisuustekniikka

researchProduct

Intrusion detection applications using knowledge discovery and data mining

2014

pääsynvalvontaintrusion detectionknowledge discoverydata miningvalvontajärjestelmätanomaly detectionbig dataalgoritmitklusterianalyysitietoturvatiedonlouhintakyberturvallisuusverkkohyökkäyksetdimensionality reductionclustering

researchProduct

Improving Scalable K-Means++

2021

Two new initialization methods for K-means clustering are proposed. Both proposals are based on applying a divide-and-conquer approach for the K-means‖ type of an initialization strategy. The second proposal also uses multiple lower-dimensional subspaces produced by the random projection method for the initialization. The proposed methods are scalable and can be run in parallel, which make them suitable for initializing large-scale problems. In the experiments, comparison of the proposed methods to the K-means++ and K-means‖ methods is conducted using an extensive set of reference and synthetic large-scale datasets. Concerning the latter, a novel high-dimensional clustering data generation …

random projectionlcsh:T55.4-60.8K-means++algoritmitclustering initializationalgoritmiikkalcsh:Industrial engineering. Management engineeringklusterianalyysilcsh:Electronic computers. Computer sciencetiedonlouhintaK-means‖lcsh:QA75.5-76.95

researchProduct

Improvements and applications of the elements of prototype-based clustering

2018

Clustering or cluster analysis is an essential part of data mining, machine learning, and pattern recognition. The most popularly applied clustering methods are partitioning-based or prototype-based methods. Prototype-based clustering methods usually have easy implementability and good scalability. These methods, such as K-means clustering, have been used for different applications in various ﬁelds. On the other hand, prototype-based clustering methods are typically sensitive to initialization, and the selection of the number of clusters for knowledge discovery purposes is not straightforward. In the era of big data, in high-velocity, ever-growing datasets, which can also be erroneous, outl…

random projectionparallel computingknowledge discoveryclustering initializationminimal learning machinedata miningprototype-based clusteringmachine learningkoneoppiminenbig datarinnakkaiskäsittelyklusterianalyysitiedonlouhintarobust clusteringK-means

researchProduct

Automatic Taxonomy Induction based on Word-embedding of Neural Nets

2018

Taxonomy is a knowledge management tool that presents useful information in a well-ordered structure prevents overloading of information on its access and making the information access qualitative. This article is concerned with automatically extracting asymmetrical hierarchical relations from a large corpus and subsequent taxonomy construction by domain independent and semi-supervised system. The methodology relies on the term’s distributional semantics. The algorithm utilizes the word-embedding generated from the vector space model. The model is trained over a large corpus to generate word-embedding of each word in a corpus. Then, the system finds and extracts the hypernyms by using the g…

sanasemantiikkatekstinlouhintataxonomy inductionneuroverkottiedonlouhintaword-embeddinghyponym-hypernym relations

researchProduct

Detecting cellular network anomalies using the knowledge discovery process

2015

Analytical companies unanimously forecast the exponential growth of mobile trafﬁc consumption over the next ﬁve years. The densiﬁcation of a network structure with small cells is regarded as a key solution to meet growing capacity demands. The manual management of a multi-layer network is a very expensive, error prone, and sluggish process. Hence, the automation of the whole life cycle of network operation is highly anticipated. To this aim 3GPP introduces a self-management concept referred to as SON. It is envisioned that SON updates information concerning the latest network conditions through the MDT mecha- nism. MDT enables a network operator to collect radio and service quality measurem…

self-healKDDtoimintahäiriötviatrakenteettomat verkotdata miningtietoliikenneverkotmatkaviestinverkotradio networksanomaly detectionself-organizing networksLTEMDTcell outagehäiriötradioverkot3G-tekniikkasimulointitiedonlouhintalangattomat verkot

researchProduct

Advanced performance monitoring for self-healing cellular mobile networks

2015

This dissertation is devoted to development and validation of advanced per- formance monitoring system for existing and future cellular mobile networks. Knowledge mining techniques are employed for analysis of user speciﬁc logs, collected with Minimization of Drive Tests (MDT) functionality. Ever increas- ing quality requirements, expansion of the mobile networks and their extend- ing heterogeneity, call for effective automatic means of performance monitoring. Nowadays, network operation is mostly controlled manually through aggregated key performance indicators and statistical proﬁles. These methods are are not able to fully address the dynamism and complexity of modern mobile networks. Se…

sleeping cellsekvensointitoimintahäiriötsequence-based analysisrakenteettomat verkotmonitorointidata miningtietoliikenneverkotmatkaviestinverkotanomaly detectionself-organizing networkshäiriötperformance monitoringtiedonlouhintacellular mobile networksquality and performance managementknowledge mining

researchProduct

Intelligent solutions for real-life data-driven applications

2017

The subject of this thesis belongs to the topic of machine learning or, speciﬁcally, to the development of advanced methods for regression analysis, clustering, and anomaly detection. Industry is constantly seeking improved production practices and minimized production time and costs. In connection to this, several industrial case studies are presented in which mathematical models for predicting paper quality were proposed. The most important variables for the prediction models are selected based on information-theoretic measures and regression trees approach. The rest of the original papers are devoted to unsupervised machine learning. The main focus is developing advanced spectral cluster…

spectral clusteringregression treesanomaly detectionregression analysislaadunvalvontaregressioanalyysikoneoppiminenpaper machinebig datagraph segmentationcommunity detectionnetwork securityklusterianalyysitiedonlouhintatietoturvamutual informationpaperikoneetclusteringvariable selection

researchProduct