Search results for "Statistica"
showing 10 items of 5969 documents
Large scale semi-supervised image segmentation with active queries
2011
A semiautomatic procedure to generate classification maps of remote sensing images is proposed. Starting from a hierarchical unsupervised classification, the algorithm exploits the few available labeled pixels to assign each cluster to the most probable class. For a given amount of labeled pixels, the algorithm returns a classified segmentation map, along with confidence levels of class membership for each pixel. Active learning methods are used to select the most informative samples to increase confidence in the class membership. Experiments on a AVIRIS hyperspectral image confirm the effectiveness of the method, especially when used with active learning query functions and spatial regular…
A Support Vector Domain Description Approach to Supervised Classification of Remote Sensing Images
2007
This paper addresses the problem of supervised classification of remote sensing images in the presence of incomplete (nonexhaustive) training sets. The problem is analyzed according to two different perspectives: 1) description and recognition of a specific land-cover class by using single-class classifiers and 2) solution of multiclass problems with single-class classification techniques. In this framework, we analyze different one-class classifiers and introduce in the remote sensing community the support vector domain description method (SVDD). The SVDD is a kernel-based method that exhibits intrinsic regularization ability and robustness versus low numbers of high-dimensional samples. T…
Automatic regrouping of strata in the goodness-of-fit chi-square test
2019
Pearson’s chi-square test is widely employed in social and health sciences to analyze categorical data and contingency tables. For the test to be valid, the sample size must be large enough to provide a minimum number of expected elements per category. This paper develops functions for regrouping strata automatically no matter where they are located, thus enabling the goodness-of-fit test to be performed within an iterative procedure. The functions are written in Excel VBA (Visual Basic for Applications) and in Mathematica. The usefulness and performance of these functions is illustrated by means of a simulation study and the application to different datasets. Finally, the iterative use of …
Finding condensed descriptions for multi-dimensional data.
1976
Abstract We describe two programs that may be used to find condensed descriptions for data available in a contingency table or in a covariance matrix in the case that these data follow a multinomial or a multivariate normal distribution, respectively. The programs perform a stepwise model search among multiplicative models by computing appropriate likelihood-ratio test statistics.
Continuity correction of pearson’s chi-square test in 2x2 contingency tables: A mini-review on recent development
2022
The Pearson’s chi-square test represents a nonparametric test more used in Biomedicine and Social Sciences, but it introduces an error for 2 x 2 contingency tables, when a discrete probability distribution is approximated with a continuous distribution. The first author to introduce the continuity correction of Pearson’s chi-square test has been Yates F. (1934). Unfortunately, Yates’s correction may tend to overcorrect of p-value, this can implicate an overly conservative result. Therefore many authors have introduced variants Pearson’s chi-square statistic, as alternative continuity correction to Yates’s correction. The goal of this paper is to describe the most recent continuity correctio…
Uncertainty quantification analysis of the biological Gompertz model subject to random fluctuations in all its parameters
2020
[EN] In spite of its simple formulation via a nonlinear differential equation, the Gompertz model has been widely applied to describe the dynamics of biological and biophysical parts of complex systems (growth of living organisms, number of bacteria, volume of infected cells, etc.). Its parameters or coefficients and the initial condition represent biological quantities (usually, rates and number of individual/particles, respectively) whose nature is random rather than deterministic. In this paper, we present a complete uncertainty quantification analysis of the randomized Gomperz model via the computation of an explicit expression to the first probability density function of its solution s…
Efficient simulation of the random-cluster model
2013
The simulation of spin models close to critical points of continuous phase transitions is heavily impeded by the occurrence of critical slowing down. A number of cluster algorithms, usually based on the Fortuin-Kasteleyn representation of the Potts model, and suitable generalizations for continuous-spin models have been used to increase simulation efficiency. The first algorithm making use of this representation, suggested by Sweeny in 1983, has not found widespread adoption due to problems in its implementation. However, it has been recently shown that it is indeed more efficient in reducing critical slowing down than the more well-known algorithm due to Swendsen and Wang. Here, we present…
Representation of Strongly Stationary Stochastic Processes
1993
A generalization of the orthogonality conditions for a stochastic process to represent strongly stationary processes up to a fixed order is presented. The particular case of non-normal delta correlated processes, and the probabilistic characterization of linear systems subjected to strongly stationary stochastic processes are also discussed.
Role of conditional probability in multiscale stationary markovian processes.
2010
The aim of the paper is to understand how the inclusion of more and more time-scales into a stochastic stationary Markovian process affects its conditional probability. To this end, we consider two Gaussian processes: (i) a short-range correlated process with an infinite set of time-scales bounded from below, and (ii) a power-law correlated process with an infinite and unbounded set of time-scales. For these processes we investigate the equal position conditional probability P(x,t|x,0) and the mean First Passage Time T(L). The function P(x,t|x,0) can be considered as a proxy of the persistence, i.e. the fact that when a process reaches a position x then it spends some time around that posit…