Search results for "mining"
showing 10 items of 1730 documents
A functional approach to monitor and recognize patterns of daily traffic profiles
2014
Functional Data Analysis (FDA) is a collection of statistical techniques for the analysis of information on curves or functions. This paper presents a new methodology for analyzing the daily traffic flow profiles based on the employment of FDA. A daily traffic profile corresponds to a single datum rather than a large set of traffic counts. This insight provides ideal information for strategic decision-making regarding road expansion, control, and other long-term decisions. Using Functional Principal Component Analysis the data are projected into a low dimensional space: the space of the first functional principal components. Each curve is represented by their vector of scores on this basis.…
A method for detecting malfunctions in PV solar panels based on electricity production monitoring
2017
In this paper a new method is developed for automatically detecting outliers or faults in the solar energy production of identical sets (sister arrays) of photovoltaic (PV) solar panels. The method involves a two-stage unsupervised approach. In the first stage, "in control" energy production data are created by using outlier detection methods and functional principal component analysis in order to remove global and local outliers from the data set. In the second stage, control charts for the "in control" data are constructed using both a parametric method and three non-parametric methods. The control charts can be used to detect outliers or faults in the production data in real-time or at t…
Functional principal component analysis for multivariate multidimensional environmental data
2015
Data with spatio-temporal structure can arise in many contexts, therefore a considerable interest in modelling these data has been generated, but the complexity of spatio-temporal models, together with the size of the dataset, results in a challenging task. The modelization is even more complex in presence of multivariate data. Since some modelling problems are more natural to think through in functional terms, even if only a finite number of observations is available, treating the data as functional can be useful (Berrendero et al. in Comput Stat Data Anal 55:2619–2634, 2011). Although in Ramsay and Silverman (Functional data analysis, 2nd edn. Springer, New York, 2005) the case of multiva…
Efficacy of Zero-Profile Device versus Plate and Cage Implant for Treatment of Symptomatic Adjacent Segment Disease After Anterior Cervical Diskectom…
2018
Fuzzy methods for analysing fuzzy production environment
1998
Abstract Very recently, in production management research literature, the necessity to extend production systems analysis techniques, such as queue theory, Mean Value Analysis (MVA) and discrete simulation, to Fuzzy Production Environments, i.e. to those production situations in which data are vague, has emerged. Fuzzy set theory is a powerful tool to model vagueness and, therefore, fuzzy mathematics can be used to extend classical production system analysis techniques. This paper proposes a methodology based on fuzzy relation algebra to extend classical MVA and discrete event simulation.
Fuzzy Classifier Based on Fuzzy Decision Tree
2007
A popular method for making a fuzzy decision tree for classification is Fuzzy ID3 algorithm. We introduce a new approach that uses cumulative information estimations of initial data. Based on these estimations we propose a new greedy version of fuzzy ID3 algorithm to be used to generate understandable fuzzy classification rules. The goal is to find a sequence of rules that causes near minimal classification costs.
Combining one class fuzzy KNN’s
2007
This paper introduces a parallel combination of N > 2 one class fuzzy KNN (FKNN) classifiers. The classifier combination consists of a new optimization procedure based on a genetic algorithm applied to FKNN’s, that differ in the kind of similarity used. We tested the integration techniques in the case of N = 5 similarities that have been recently introduced to face with categorical data sets. The assessment of the method has been carried out on two public data set, the Masquerading User Data (www.schonlau.net) and the badges database on the UCI Machine Learning Repository (http://www.ics.uci.edu/~mlearn/). Preliminary results show the better performance obtained by the fuzzy integration …
A Combined Fuzzy and Probabilistic Data Descriptor for Distributed CBIR
2009
With the wide diffusion of digital image acquisition devices, the cost of managing hundreds of digital images is quickly increasing. Currently, the main way to search digital image libraries is by keywords given by the user. However, users usually add ambiguos keywords for large set of images. A content-based system intended to automatically find a query image, or similar images, within the whole collection is needed. In our work we address the scenario where medical image collections, which nowadays are rapidly expanding in quantity and heterogeneity, are shared in a distributed system to support diagnostic and preventive medicine. Our goal is to produce an efficient content-based descript…
Unsupervised tissue classification of brain MR images for voxel-based morphometry analysis
2016
In this article, a fully unsupervised method for brain tissue segmentation of T1-weighted MRI 3D volumes is proposed. The method uses the Fuzzy C-Means (FCM) clustering algorithm and a Fully Connected Cascade Neural Network (FCCNN) classifier. Traditional manual segmentation methods require neuro-radiological expertise and significant time while semiautomatic methods depend on parameter's setup and trial-and-error methodologies that may lead to high intraoperator/interoperator variability. The proposed method selects the most useful MRI data according to FCM fuzziness values and trains the FCCNN to learn to classify brain’ tissues into White Matter, Gray Matter, and Cerebro-Spinal Fluid in …
Distance-constrained data clustering by combined k-means algorithms and opinion dynamics filters
2014
Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid di…