Search results for "Pattern"
showing 10 items of 4203 documents
Iterative Cluster Analysis of Protein Interaction Data
2004
Abstract Motivation: Generation of fast tools of hierarchical clustering to be applied when distances among elements of a set are constrained, causing frequent distance ties, as happens in protein interaction data. Results: We present in this work the program UVCLUSTER, that iteratively explores distance datasets using hierarchical clustering. Once the user selects a group of proteins, UVCLUSTER converts the set of primary distances among them (i.e. the minimum number of steps, or interactions, required to connect two proteins) into secondary distances that measure the strength of the connection between each pair of proteins when the interactions for all the proteins in the group are consid…
On the usage of joint diagonalization in multivariate statistics
2022
Scatter matrices generalize the covariance matrix and are useful in many multivariate data analysis methods, including well-known principal component analysis (PCA), which is based on the diagonalization of the covariance matrix. The simultaneous diagonalization of two or more scatter matrices goes beyond PCA and is used more and more often. In this paper, we offer an overview of many methods that are based on a joint diagonalization. These methods range from the unsupervised context with invariant coordinate selection and blind source separation, which includes independent component analysis, to the supervised context with discriminant analysis and sliced inverse regression. They also enco…
A web application for the unspecific detection of differentially expressed DNA regions in strand-specific expression data
2015
Abstract Genomic technologies allow laboratories to produce large-scale data sets, either through the use of next-generation sequencing or microarray platforms. To explore these data sets and obtain maximum value from the data, researchers view their results alongside all the known features of a given reference genome. To study transcriptional changes that occur under a given condition, researchers search for regions of the genome that are differentially expressed between different experimental conditions. In order to identify these regions several algorithms have been developed over the years, along with some bioinformatic platforms that enable their use. However, currently available appli…
Likelihood Inference for Gibbs Processes in the Analysis of Spatial Point Patterns
2001
Plusieurs auteurs ont propose des approximations stochastiques et non-stochastiques au MLE pour les processus de Gibbs utilises pour decrire les interactions entre deux points dans une distribution spatiale de points. Cettes approximations sont necessaires a cause de la difficulte en l'evaluation de la constante qui normalise la f.d.p., Cet article present une comparaison, parmi d'un model de Strauss, des methodes qui utilisent des approximations directes aux MLE et des methodes qui utilisent techniques de Monte Carlo de chaine de Markov. Les techniques de simulation utilisees sont le Gibbs sampler et l'algorithm de Metropolis-Hastings.
The 1970 US Draft Lottery Revisited: A Spatial Analysis
2004
Summary We revise the result of the 1970 selective service draft lottery in the USA following an open question that was suggested by Fienberg in a paper published in Science in 1971. The result of the drawings can be viewed as a particular spatial pattern which can be analysed by using general spatial tools adapted to our context. Approaches for assessing the complete spatial randomness for this spatial process on a finite support are proposed. More specifically, these approaches involve the number of events in a square window and a k(r)-based function used to analyse stationary spatial point processes.
A spatial analysis of Italian unemployment differences
2008
Using spatial econometric models, this paper focuses attention on the spatial structure of provincial unemployment disparities of Italian provinces for the year 2003. On the basis of findings from the economic literature and of the available socio-economic data, various model specifications including supply- and demand-side variables are tested. Further we use ESDA analysis as equivalent to integration analysis on time series; therefore it is applied on each variable, dependent and independent, involved in the statistical model. The suggestions of ESDA lead us to the most adequate statistical model, which estimates indicate that there is a significant degree of neighbouring effect (i.e. pos…
Inhomogeneity and complexity measures for spatial patterns
2002
In this work, we examine two different measures for inhomogeneity and complexity that are derived from non-extensive considerations à la Tsallis. Their performance is then tested on theoretically generated patterns. All measures are found to exhibit a most sensitive behaviour for Sierpinski carpets. The procedures here introduced provide us with new, powerful Tsallis’ tools for analysing the inhomogeneity and complexity of spatial patterns.
Reference Posterior Distributions for Bayesian Inference
1979
Bayesian measures of surprise for outlier detection
2003
From a Bayesian point of view, testing whether an observation is an outlier is usually reduced to a testing problem concerning a parameter of a contaminating distribution. This requires elicitation of both (i) the contaminating distribution that generates the outlier and (ii) prior distributions on its parameters. However, very little information is typically available about how the possible outlier could have been generated. Thus easy, preliminary checks in which these assessments can often be avoided may prove useful. Several such measures of surprise are derived for outlier detection in normal models. Results are applied to several examples. Default Bayes factors, where the contaminating…
Resuming Shapes with Applications
2004
Many image processing tasks need some kind of average of different shapes. Frequently, different shapes obtained from several images have to be summarized. If these shapes can be considered as different realizations of a given random compact set, then the natural summaries are the different mean sets proposed in the literature. In this paper, new mean sets are defined by using the basic transformations of Mathematical Morphology (dilation, erosion, opening and closing). These new definitions can be considered, under some additional assumptions, as particular cases of the distance average of Baddeley and Molchanov. The use of the former and new mean sets as summary descriptors of shapes is i…