Search results for "Data mining"
showing 10 items of 907 documents
A methodology to generate a synergetic land-cover map by fusion of different land-cover products
2012
Abstract The main goal of this study is to develop a general framework for building a hybrid land-cover map by the synergistic combination of a number of land-cover classifications with different legends and spatial resolutions. The proposed approach assesses class-specific accuracies of datasets and establishes affinity between thematic legends using a common land-cover language such as the UN Land-Cover Classification System (LCCS). The approach is illustrated over a large region in Europe using four land-cover datasets (CORINE, GLC2000, MODIS and GlobCover), but it can be applied to any set of existing products. The multi-classification map is expected to improve the performance of indiv…
A New Dissimilarity Measure for Clustering Seismic Signals
2011
Hypocenter and focal mechanism of an earthquake can be determined by the analysis of signals, named waveforms, related to the wave field produced and recorded by a seismic network. Assuming that waveform similarity implies the similarity of focal parameters, the analysis of those signals characterized by very similar shapes can be used to give important details about the physical phenomena which have generated an earthquake. Recent works have shown the effectiveness of cross-correlation and/or cross-spectral dissimilarities to identify clusters of seismic events. In this work we propose a new dissimilarity measure between seismic signals whose reliability has been tested on real seismic dat…
Discovering temporal relationships in databases of newspapers
1998
This paper is mainly dedicated to analyse the problem of discovering frequent temporal patterns in event sequences extracted from a large repository of newspapers. The proposed formalism and algorithms rely on Toodor, which is a document retrieval system that allows users to specify conditions over the structure, contents and temporal features of the stored documents. We develop in this work several algorithms for recognising frequent temporal patterns in terms of arc-consistency, which consist of discarding temporal occurrences that do not satisfy a temporal structure.
Flood frequency analysis for an urban watershed: comparison between several statistical methodologies simulating synthetic rainfall events
2016
To obtain the flooding frequency distribution for an urban watershed, different methods based on simulations of synthetic rainfall events were compared with an empirical analysis of the flooding data and with the results of long-term simulations. A copula-based multivariate statistical analysis of the main hydrological variables was proposed to generate synthetic hyetographs. Two different approaches were adopted to assess a temporal pattern to the synthetic rainfall: one analyses all available historical rainfall patterns, and another adopts the cluster analysis in three different variants to reduce the computational effort of the analysis. To test the methodology reliability, the analysis…
Comparing Spatial and Spatio-temporal FPCA to Impute Large Continuous Gaps in Space
2018
Multivariate spatio-temporal data analysis methods usually assume fairly complete data, while a number of gaps often occur along time or in space. In air quality data long gaps may be due to instrument malfunctions; moreover, not all the pollutants of interest are measured in all the monitoring stations of a network. In literature, many statistical methods have been proposed for imputing short sequences of missing values, but most of them are not valid when the fraction of missing values is high. Furthermore, the limitation of the methods commonly used consists in exploiting temporal only, or spatial only, correlation of the data. The objective of this paper is to provide an approach based …
A functional approach to monitor and recognize patterns of daily traffic profiles
2014
Functional Data Analysis (FDA) is a collection of statistical techniques for the analysis of information on curves or functions. This paper presents a new methodology for analyzing the daily traffic flow profiles based on the employment of FDA. A daily traffic profile corresponds to a single datum rather than a large set of traffic counts. This insight provides ideal information for strategic decision-making regarding road expansion, control, and other long-term decisions. Using Functional Principal Component Analysis the data are projected into a low dimensional space: the space of the first functional principal components. Each curve is represented by their vector of scores on this basis.…
A method for detecting malfunctions in PV solar panels based on electricity production monitoring
2017
In this paper a new method is developed for automatically detecting outliers or faults in the solar energy production of identical sets (sister arrays) of photovoltaic (PV) solar panels. The method involves a two-stage unsupervised approach. In the first stage, "in control" energy production data are created by using outlier detection methods and functional principal component analysis in order to remove global and local outliers from the data set. In the second stage, control charts for the "in control" data are constructed using both a parametric method and three non-parametric methods. The control charts can be used to detect outliers or faults in the production data in real-time or at t…
Functional principal component analysis for multivariate multidimensional environmental data
2015
Data with spatio-temporal structure can arise in many contexts, therefore a considerable interest in modelling these data has been generated, but the complexity of spatio-temporal models, together with the size of the dataset, results in a challenging task. The modelization is even more complex in presence of multivariate data. Since some modelling problems are more natural to think through in functional terms, even if only a finite number of observations is available, treating the data as functional can be useful (Berrendero et al. in Comput Stat Data Anal 55:2619–2634, 2011). Although in Ramsay and Silverman (Functional data analysis, 2nd edn. Springer, New York, 2005) the case of multiva…
Fuzzy methods for analysing fuzzy production environment
1998
Abstract Very recently, in production management research literature, the necessity to extend production systems analysis techniques, such as queue theory, Mean Value Analysis (MVA) and discrete simulation, to Fuzzy Production Environments, i.e. to those production situations in which data are vague, has emerged. Fuzzy set theory is a powerful tool to model vagueness and, therefore, fuzzy mathematics can be used to extend classical production system analysis techniques. This paper proposes a methodology based on fuzzy relation algebra to extend classical MVA and discrete event simulation.
Fuzzy Classifier Based on Fuzzy Decision Tree
2007
A popular method for making a fuzzy decision tree for classification is Fuzzy ID3 algorithm. We introduce a new approach that uses cumulative information estimations of initial data. Based on these estimations we propose a new greedy version of fuzzy ID3 algorithm to be used to generate understandable fuzzy classification rules. The goal is to find a sequence of rules that causes near minimal classification costs.