Search results for "Data mining"

showing 10 items of 907 documents

A methodology to generate a synergetic land-cover map by fusion of different land-cover products

2012

Abstract The main goal of this study is to develop a general framework for building a hybrid land-cover map by the synergistic combination of a number of land-cover classifications with different legends and spatial resolutions. The proposed approach assesses class-specific accuracies of datasets and establishes affinity between thematic legends using a common land-cover language such as the UN Land-Cover Classification System (LCCS). The approach is illustrated over a large region in Europe using four land-cover datasets (CORINE, GLC2000, MODIS and GlobCover), but it can be applied to any set of existing products. The multi-classification map is expected to improve the performance of indiv…

Flexibility (engineering)Global and Planetary ChangePixelLand coverManagement Monitoring Policy and LawSynergistic combinationcomputer.software_genreSet (abstract data type)Thematic mapGeographyData miningComputers in Earth SciencesCartographycomputerEarth-Surface ProcessesInternational Journal of Applied Earth Observation and Geoinformation
researchProduct

A New Dissimilarity Measure for Clustering Seismic Signals

2011

Hypocenter and focal mechanism of an earthquake can be determined by the analysis of signals, named waveforms, related to the wave field produced and recorded by a seismic network. Assuming that waveform similarity implies the similarity of focal parameters, the analysis of those signals characterized by very similar shapes can be used to give important details about the physical phenomena which have generated an earthquake. Recent works have shown the effectiveness of cross-correlation and/or cross-spectral dissimilarities to identify clusters of seismic events. In this work we propose a new dissimilarity measure between seismic signals whose reliability has been tested on real seismic dat…

Focal mechanismSimilarity (geometry)Cross-correlationHypocenterSettore INF/01 - InformaticaComputer sciencebusiness.industryHomogeneity (statistics)Pattern recognitioncomputer.software_genreMeasure (mathematics)Physics::GeophysicsSettore GEO/11 - Geofisica ApplicataWaveformArtificial intelligenceData miningbusinessCluster analysiscomputerDissimilarity measure Clustering Seismic Signals
researchProduct

Discovering temporal relationships in databases of newspapers

1998

This paper is mainly dedicated to analyse the problem of discovering frequent temporal patterns in event sequences extracted from a large repository of newspapers. The proposed formalism and algorithms rely on Toodor, which is a document retrieval system that allows users to specify conditions over the structure, contents and temporal features of the stored documents. We develop in this work several algorithms for recognising frequent temporal patterns in terms of arc-consistency, which consist of discarding temporal occurrences that do not satisfy a temporal structure.

Formalism (philosophy of mathematics)Information retrievalComputer scienceInformation processingData miningDocument retrievalConstraint satisfactionQuery languagecomputer.software_genrecomputerNewspaper
researchProduct

Flood frequency analysis for an urban watershed: comparison between several statistical methodologies simulating synthetic rainfall events

2016

To obtain the flooding frequency distribution for an urban watershed, different methods based on simulations of synthetic rainfall events were compared with an empirical analysis of the flooding data and with the results of long-term simulations. A copula-based multivariate statistical analysis of the main hydrological variables was proposed to generate synthetic hyetographs. Two different approaches were adopted to assess a temporal pattern to the synthetic rainfall: one analyses all available historical rainfall patterns, and another adopts the cluster analysis in three different variants to reduce the computational effort of the analysis. To test the methodology reliability, the analysis…

Frequency analysisEnvironmental EngineeringMultivariate analysisWatershedFlood frequency analysis0208 environmental biotechnologyGeography Planning and Development02 engineering and technologycomputer.software_genre020801 environmental engineeringFlooding (computer networking)Copula (probability theory)law.inventionlawStatisticsEnvironmental scienceData miningDrainageSafety Risk Reliability and QualityCluster analysiscomputerWater Science and TechnologyJournal of Flood Risk Management
researchProduct

Comparing Spatial and Spatio-temporal FPCA to Impute Large Continuous Gaps in Space

2018

Multivariate spatio-temporal data analysis methods usually assume fairly complete data, while a number of gaps often occur along time or in space. In air quality data long gaps may be due to instrument malfunctions; moreover, not all the pollutants of interest are measured in all the monitoring stations of a network. In literature, many statistical methods have been proposed for imputing short sequences of missing values, but most of them are not valid when the fraction of missing values is high. Furthermore, the limitation of the methods commonly used consists in exploiting temporal only, or spatial only, correlation of the data. The objective of this paper is to provide an approach based …

Functional principal component analysisComplete dataMultivariate statisticsLong gapComputer sciencecomputer.software_genreMissing dataCorrelationFDA FPCA GAM P-splinesData analysisData miningImputation (statistics)Settore SECS-S/01 - Statisticacomputer
researchProduct

A functional approach to monitor and recognize patterns of daily traffic profiles

2014

Functional Data Analysis (FDA) is a collection of statistical techniques for the analysis of information on curves or functions. This paper presents a new methodology for analyzing the daily traffic flow profiles based on the employment of FDA. A daily traffic profile corresponds to a single datum rather than a large set of traffic counts. This insight provides ideal information for strategic decision-making regarding road expansion, control, and other long-term decisions. Using Functional Principal Component Analysis the data are projected into a low dimensional space: the space of the first functional principal components. Each curve is represented by their vector of scores on this basis.…

Functional principal component analysisEngineeringbusiness.industryFunctional data analysisPoison controlFunctional approachTransportationManagement Science and Operations ResearchTraffic flowcomputer.software_genreTransport engineeringPrincipal component analysisOutlierData miningbusinessCluster analysiscomputerCivil and Structural EngineeringTransportation Research Part B: Methodological
researchProduct

A method for detecting malfunctions in PV solar panels based on electricity production monitoring

2017

In this paper a new method is developed for automatically detecting outliers or faults in the solar energy production of identical sets (sister arrays) of photovoltaic (PV) solar panels. The method involves a two-stage unsupervised approach. In the first stage, "in control" energy production data are created by using outlier detection methods and functional principal component analysis in order to remove global and local outliers from the data set. In the second stage, control charts for the "in control" data are constructed using both a parametric method and three non-parametric methods. The control charts can be used to detect outliers or faults in the production data in real-time or at t…

Functional principal component analysisRenewable Energy Sustainability and the EnvironmentComputer sciencebusiness.industryElectricity production monitoring020209 energyPhotovoltaic systemStatistical quality control02 engineering and technologyAnomaly detectionStatistical process controlcomputer.software_genreSolar energyOutlier0202 electrical engineering electronic engineering information engineeringGeneral Materials ScienceControl chartAnomaly detectionData miningbusinesscomputerPhotovoltaic solar panel sister arraysParametric statistics
researchProduct

Functional principal component analysis for multivariate multidimensional environmental data

2015

Data with spatio-temporal structure can arise in many contexts, therefore a considerable interest in modelling these data has been generated, but the complexity of spatio-temporal models, together with the size of the dataset, results in a challenging task. The modelization is even more complex in presence of multivariate data. Since some modelling problems are more natural to think through in functional terms, even if only a finite number of observations is available, treating the data as functional can be useful (Berrendero et al. in Comput Stat Data Anal 55:2619–2634, 2011). Although in Ramsay and Silverman (Functional data analysis, 2nd edn. Springer, New York, 2005) the case of multiva…

Functional principal component analysisStatistics and ProbabilityMultivariate statistics2300GeneralizationDimensionality reductionGeneralized additive modelFunctional data analysisFunctional principal component analysiContext (language use)computer.software_genreMultivariate spatio-temporal dataCovariateP-splineData miningStatistics Probability and UncertaintycomputerSmoothingGeneral Environmental ScienceMathematics
researchProduct

Fuzzy methods for analysing fuzzy production environment

1998

Abstract Very recently, in production management research literature, the necessity to extend production systems analysis techniques, such as queue theory, Mean Value Analysis (MVA) and discrete simulation, to Fuzzy Production Environments, i.e. to those production situations in which data are vague, has emerged. Fuzzy set theory is a powerful tool to model vagueness and, therefore, fuzzy mathematics can be used to extend classical production system analysis techniques. This paper proposes a methodology based on fuzzy relation algebra to extend classical MVA and discrete event simulation.

Fuzzy classificationFuzzy measure theorybusiness.industryGeneral MathematicsFuzzy setcomputer.software_genreDefuzzificationFuzzy logicIndustrial and Manufacturing EngineeringComputer Science ApplicationsControl and Systems EngineeringFuzzy mathematicsFuzzy set operationsFuzzy numberArtificial intelligenceData miningbusinesscomputerSoftwareMathematicsRobotics and Computer-Integrated Manufacturing
researchProduct

Fuzzy Classifier Based on Fuzzy Decision Tree

2007

A popular method for making a fuzzy decision tree for classification is Fuzzy ID3 algorithm. We introduce a new approach that uses cumulative information estimations of initial data. Based on these estimations we propose a new greedy version of fuzzy ID3 algorithm to be used to generate understandable fuzzy classification rules. The goal is to find a sequence of rules that causes near minimal classification costs.

Fuzzy classificationNeuro-fuzzybusiness.industryType-2 fuzzy sets and systemscomputer.software_genreMachine learningDefuzzificationComputingMethodologies_PATTERNRECOGNITIONInformation Fuzzy NetworksFuzzy numberFuzzy set operationsFuzzy associative matrixArtificial intelligenceData miningbusinesscomputerMathematicsEUROCON 2007 - The International Conference on "Computer as a Tool"
researchProduct