Search results for "Data set"
showing 10 items of 154 documents
SIOPRED performance in a Forecasting Blind Competition
2012
In this paper we present the results obtained by applying our automatic forecasting support system, named SIOPRED, over a data set of time series in a Forecasting Blind Competition. In order to apply our procedure for providing point forecasts it has been necessary to develop an interactive strategy for the choice of the suitable length of the seasonal cycle and the seasonality form for a generalized exponential smoothing method, which have been obtained using SIOPRED. For the choice of those essential characteristics of forecasting methods, also a certain multi-objective formulation which minimizes several measures of fitting is used. Once these specifications are established, the model pa…
Modeling Forest Tree Data Using Sequential Spatial Point Processes
2021
AbstractThe spatial structure of a forest stand is typically modeled by spatial point process models. Motivated by aerial forest inventories and forest dynamics in general, we propose a sequential spatial approach for modeling forest data. Such an approach is better justified than a static point process model in describing the long-term dependence among the spatial location of trees in a forest and the locations of detected trees in aerial forest inventories. Tree size can be used as a surrogate for the unknown tree age when determining the order in which trees have emerged or are observed on an aerial image. Sequential spatial point processes differ from spatial point processes in that the…
On implementation of the Gibbs sampler for estimating the accuracy of multiple diagnostic tests
2010
Implementation of the Gibbs sampler for estimating the accuracy of multiple binary diagnostic tests in one population has been investigated. This method, proposed by Joseph, Gyorkos and Coupal, makes use of a Bayesian approach and is used in the absence of a gold standard to estimate the prevalence, the sensitivity and specificity of medical diagnostic tests. The expressions that allow this method to be implemented for an arbitrary number of tests are given. By using the convergence diagnostics procedure of Raftery and Lewis, the relation between the number of iterations of Gibbs sampling and the precision of the estimated quantiles of the posterior distributions is derived. An example conc…
Bond-based 3D-chiral linear indices: Theory and QSAR applications to central chirality codification
2008
The recently introduced non-stochastic and stochastic bond-based linear indices are been generalized to codify chemical structure information for chiral drugs, making use of a trigonometric 3D-chirality correction factor. These improved modified descriptors are applied to several well-known data sets to validate each one of them. Particularly, Cramer's steroid data set has become a benchmark for the assessment of novel quantitative structure activity relationship methods. This data set has been used by several researchers using 3D-QSAR approaches such as Comparative Molecular Field Analysis, Molecular Quantum Similarity Measures, Comparative Molecular Moment Analysis, E-state, Mapping Prope…
A Dominance Variant Under the Multi-Unidimensional Pairwise-Preference Framework: Model Formulation and Markov Chain Monte Carlo Estimation.
2018
Forced-choice questionnaires have been proposed as a way to control some response biases associated with traditional questionnaire formats (e.g., Likert-type scales). Whereas classical scoring methods have issues of ipsativity, item response theory (IRT) methods have been claimed to accurately account for the latent trait structure of these instruments. In this article, the authors propose the multi-unidimensional pairwise preference two-parameter logistic (MUPP-2PL) model, a variant within Stark, Chernyshenko, and Drasgow’s MUPP framework for items that are assumed to fit a dominance model. They also introduce a Markov Chain Monte Carlo (MCMC) procedure for estimating the model’s paramete…
Deducing the USLE mathematical structure by dimensional analysis and self-similarity theory
2010
The Universal Soil Loss Equation (USLE) was originally deduced by a statistical analysis of a large data set of soil loss measurements. The multiplicative structure of the model has been criticised due to the considerable interdependence between the variables. Using the soil erosion representative variables and the reference condition adopted in the USLE, the aim of this paper was to apply dimensional analysis and self-similarity theory to deduce the functional relationship among the selected variables. The analysis yielded a multiplicative equation, similar to the USLE. Therefore, this study suggested that the USLE has a logical structure with respect to the variables used to simulate the …
Optimization of Complex SVM Kernels Using a Hybrid Algorithm Based on Wasp Behaviour
2010
The aim of this paper is to present a new method for optimization of SVM multiple kernels The kernel substitution can be used to define many other types of learning machines distinct from SVMs We introduced a new hybrid method which uses in the first level an evolutionary algorithm based on wasp behaviour and on the co-mutation operator LR−Mijn and in the second level a SVM algorithm which computes the quality of chromosomes The most important details of our algorithms are presented The testing and validation proves that multiple kernels obtained using our genetic approach are improving the classification accuracy up to 94.12% for the “leukemia” data set.
Urban monitoring using multi-temporal SAR and multi-spectral data
2006
In some key operational domains, the joint use of synthetic aperture radar (SAR) and multi-spectral sensors has shown to be a powerful tool for Earth observation. In this paper, we analyze the potentialities of combining interferometric SAR and multi-spectral data for urban area characterization and monitoring. This study is carried out following a standard multi-source processing chain. First, a pre-processing stage is performed taking into account the underlying physics, geometry, and statistical models for the data from each sensor. Second, two different methodologies, one for supervised and another for unsupervised approaches, are followed to obtain features that optimize the urban rela…
Assessment and inter-comparison of recently developed/reprocessed microwave satellite soil moisture products using ISMN ground-based measurements
2019
Soil moisture (SM) is a key state variable in understanding the climate system through its control on the land surface energy, water budget partitioning, and the carbon cycle. Monitoring SM at regional scale has become possible thanks to microwave remote sensing. In the past two decades, several satellites were launched carrying on board either radiometer (passive) or radar (active) or both sensors in different frequency bands with various spatial and temporal resolutions. Soil moisture algorithms are in rapid development and their improvements/revisions are ongoing. The latest SM retrieval products and versions of products that have been recently released are not yet, to our knowledge, com…
Distributed Data Clustering via Opinion Dynamics
2015
We provide a distributed method to partition a large set of data in clusters, characterized by small in-group and large out-group distances. We assume a wireless sensors network in which each sensor is given a large set of data and the objective is to provide a way to group the sensors in homogeneous clusters by information type. In previous literature, the desired number of clusters must be specified a priori by the user. In our approach, the clusters are constrained to have centroids with a distance at least ε between them and the number of desired clusters is not specified. Although traditional algorithms fail to solve the problem with this constraint, it can help obtain a better cluste…