Search results for "Abstract data type"
showing 10 items of 1140 documents
Study Design in Causal Models
2014
The causal assumptions, the study design and the data are the elements required for scientific inference in empirical research. The research is adequately communicated only if all of these elements and their relations are described precisely. Causal models with design describe the study design and the missing-data mechanism together with the causal structure and allow the direct application of causal calculus in the estimation of the causal effects. The flow of the study is visualized by ordering the nodes of the causal diagram in two dimensions by their causal order and the time of the observation. Conclusions on whether a causal or observational relationship can be estimated from the coll…
Adaptive reference-free compression of sequence quality scores
2014
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing the vast datasets that are now routinely produced. Relatively little attention has been paid to compressing the quality scores that are assigned to each sequence, even though these scores may be harder to compress than the sequences themselves. By aggregating a set of reads into a compressed index, we find that the majority of bases can be predicted from the sequence of bases that are adjacent to them and hence are likely to be less informative for variant calling or other applications. The quality scores for such bases are aggressively compressed, leaving a relatively small number at full reso…
Comparative Evaluation of Community Detection Algorithms: A Topological Approach
2012
International audience; Community detection is one of the most active fields in complex networks analysis, due to its potential value in practical applications. Many works inspired by different paradigms are devoted to the development of algorithmic solutions allowing to reveal the network structure in such cohesive subgroups. Comparative studies reported in the literature usually rely on a performance measure considering the community structure as a partition (Rand Index, Normalized Mutual information, etc.). However, this type of comparison neglects the topological properties of the communities. In this article, we present a comprehensive comparative study of a representative set of commu…
Sharp dimension free quantitative estimates for the Gaussian isoperimetric inequality
2017
We provide a full quantitative version of the Gaussian isoperimetric inequality: the difference between the Gaussian perimeter of a given set and a half-space with the same mass controls the gap between the norms of the corresponding barycenters. In particular, it controls the Gaussian measure of the symmetric difference between the set and the half-space oriented so to have the barycenter in the same direction of the set. Our estimate is independent of the dimension, sharp on the decay rate with respect to the gap and with optimal dependence on the mass.
A geostatistical approach for dynamic life tables: The effect of mortality on remaining lifetime and annuities
2010
Dynamic life tables arise as an alternative to the standard (static) life table, with the aim of incorporating the evolution of mortality over time. The parametric model introduced by Lee and Carter in 1992 for projected mortality rates in the US is one of the most outstanding and has been used a great deal since then. Different versions of the model have been developed but all of them, together with other parametric models, consider the observed mortality rates as independent observations. This is a difficult hypothesis to justify when looking at the graph of the residuals obtained with any of these methods. Methods of adjustment and prediction based on geostatistical techniques which expl…
Noise-induced resistive switching in a memristor based on ZrO2(Y)/Ta2O5 stack
2019
Resistive switching (RS) is studied in a memristor based on a ZrO2(Y)/Ta2O5 stack under a white Gaussian noise voltage signal. We have found that the memristor switches between the low resistance state and the high resistance state in a random telegraphic signal (RTS) mode. The effective potential profile of the memristor shows from two to three local minima and depends on the input noise parameters and the memristor operation. These observations indicate the multiplicative character of the noise on the dynamical behavior of the memristor, that is the noise perceived by the memristor depends on the state of the system and its electrical properties are influenced by the noise signal. The det…
STATIS and DISTATIS: optimum multitable principal component analysis and three way metric multidimensional scaling
2012
STATIS is an extension of principal component analysis PCA tailored to handle multiple data tables that measure sets of variables collected on the same observations, or, alternatively, as in a variant called dual-STATIS, multiple data tables where the same variables are measured on different sets of observations. STATIS proceeds in two steps: First it analyzes the between data table similarity structure and derives from this analysis an optimal set of weights that are used to compute a linear combination of the data tables called the compromise that best represents the information common to the different data tables; Second, the PCA of this compromise gives an optimal map of the observation…
Testing equality of reliability and stability with simple linear constraints in multi-wave, multi-variable models
1998
Data from a longitudinal study on school achievement were used to develop new methods for analysing reliability of measurements and stability of behaviour over a long time interval. The proposed method of analysis makes it possible to test hypotheses about equality constraints on reliability and stability. It is known that the use of negative variances for imaginary latent variables with equality constraints between structural parameters produces standardized variances for endogenous latent variables and quality constraints for coefficients of stability. Reparameterization of random errors in measurement models allows equality constraints to be set for coefficients of cross-sectional and of…
A more efficient second order blind identification method for separation of uncorrelated stationary time series
2016
The classical second order source separation methods use approximate joint diagonalization of autocovariance matrices with several lags to estimate the unmixing matrix. Based on recent asymptotic results, we propose a novel unmixing matrix estimator which selects the best lag set from a finite set of candidate sets specified by the user. The theory is illustrated by a simulation study.
Assessment of the probabilities for evolutionary structural changes in protein folds.
2007
Abstract Motivation: The evolution of protein sequences can be described by a stepwise process, where each step involves changes of a few amino acids. In a similar manner, the evolution of protein folds can be at least partially described by an analogous process, where each step involves comparatively simple changes affecting few secondary structure elements. A number of such evolution steps, justified by biologically confirmed examples, have previously been proposed by other researchers. However, unlike the situation with sequences, as far as we know there have been no attempts to estimate the comparative probabilities for different kinds of such structural changes. Results: We have tried …