Search results for "Data type"
showing 10 items of 1183 documents
Analyzing Temperature Effects on Mortality Within theREnvironment: The Constrained Segmented Distributed Lag Parameterization
2010
Here we present and discuss the R package modTempEff including a set of functions aimed at modelling temperature effects on mortality with time series data. The functions fit a particular log linear model which allows to capture the two main features of mortality- temperature relationships: nonlinearity and distributed lag effect. Penalized splines and segmented regression constitute the core of the modelling framework. We briefly review the model and illustrate the functions throughout a simulated dataset.
Ranking Scientific Journals Via Latent Class Models for Polytomous Item Response Data
2015
Summary We propose a model-based strategy for ranking scientific journals starting from a set of observed bibliometric indicators that represent imperfect measures of the unobserved ‘value’ of a journal. After discretizing the available indicators, we estimate an extended latent class model for polytomous item response data and use the estimated model to cluster journals. We illustrate our approach by using the data from the Italian research evaluation exercise that was carried out for the period 2004–2010, focusing on the set of journals that are considered relevant for the subarea statistics and financial mathematics. Using four bibliometric indicators (IF, IF5, AIS and the h-index), some…
Study Design in Causal Models
2014
The causal assumptions, the study design and the data are the elements required for scientific inference in empirical research. The research is adequately communicated only if all of these elements and their relations are described precisely. Causal models with design describe the study design and the missing-data mechanism together with the causal structure and allow the direct application of causal calculus in the estimation of the causal effects. The flow of the study is visualized by ordering the nodes of the causal diagram in two dimensions by their causal order and the time of the observation. Conclusions on whether a causal or observational relationship can be estimated from the coll…
The conditional censored graphical lasso estimator
2020
© 2020, Springer Science+Business Media, LLC, part of Springer Nature. In many applied fields, such as genomics, different types of data are collected on the same system, and it is not uncommon that some of these datasets are subject to censoring as a result of the measurement technologies used, such as data generated by polymerase chain reactions and flow cytometer. When the overall objective is that of network inference, at possibly different levels of a system, information coming from different sources and/or different steps of the analysis can be integrated into one model with the use of conditional graphical models. In this paper, we develop a doubly penalized inferential procedure for…
Adaptive reference-free compression of sequence quality scores
2014
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing the vast datasets that are now routinely produced. Relatively little attention has been paid to compressing the quality scores that are assigned to each sequence, even though these scores may be harder to compress than the sequences themselves. By aggregating a set of reads into a compressed index, we find that the majority of bases can be predicted from the sequence of bases that are adjacent to them and hence are likely to be less informative for variant calling or other applications. The quality scores for such bases are aggressively compressed, leaving a relatively small number at full reso…
Comparative Evaluation of Community Detection Algorithms: A Topological Approach
2012
International audience; Community detection is one of the most active fields in complex networks analysis, due to its potential value in practical applications. Many works inspired by different paradigms are devoted to the development of algorithmic solutions allowing to reveal the network structure in such cohesive subgroups. Comparative studies reported in the literature usually rely on a performance measure considering the community structure as a partition (Rand Index, Normalized Mutual information, etc.). However, this type of comparison neglects the topological properties of the communities. In this article, we present a comprehensive comparative study of a representative set of commu…
Sharp dimension free quantitative estimates for the Gaussian isoperimetric inequality
2017
We provide a full quantitative version of the Gaussian isoperimetric inequality: the difference between the Gaussian perimeter of a given set and a half-space with the same mass controls the gap between the norms of the corresponding barycenters. In particular, it controls the Gaussian measure of the symmetric difference between the set and the half-space oriented so to have the barycenter in the same direction of the set. Our estimate is independent of the dimension, sharp on the decay rate with respect to the gap and with optimal dependence on the mass.
A geostatistical approach for dynamic life tables: The effect of mortality on remaining lifetime and annuities
2010
Dynamic life tables arise as an alternative to the standard (static) life table, with the aim of incorporating the evolution of mortality over time. The parametric model introduced by Lee and Carter in 1992 for projected mortality rates in the US is one of the most outstanding and has been used a great deal since then. Different versions of the model have been developed but all of them, together with other parametric models, consider the observed mortality rates as independent observations. This is a difficult hypothesis to justify when looking at the graph of the residuals obtained with any of these methods. Methods of adjustment and prediction based on geostatistical techniques which expl…
Noise-induced resistive switching in a memristor based on ZrO2(Y)/Ta2O5 stack
2019
Resistive switching (RS) is studied in a memristor based on a ZrO2(Y)/Ta2O5 stack under a white Gaussian noise voltage signal. We have found that the memristor switches between the low resistance state and the high resistance state in a random telegraphic signal (RTS) mode. The effective potential profile of the memristor shows from two to three local minima and depends on the input noise parameters and the memristor operation. These observations indicate the multiplicative character of the noise on the dynamical behavior of the memristor, that is the noise perceived by the memristor depends on the state of the system and its electrical properties are influenced by the noise signal. The det…
STATIS and DISTATIS: optimum multitable principal component analysis and three way metric multidimensional scaling
2012
STATIS is an extension of principal component analysis PCA tailored to handle multiple data tables that measure sets of variables collected on the same observations, or, alternatively, as in a variant called dual-STATIS, multiple data tables where the same variables are measured on different sets of observations. STATIS proceeds in two steps: First it analyzes the between data table similarity structure and derives from this analysis an optimal set of weights that are used to compute a linear combination of the data tables called the compromise that best represents the information common to the different data tables; Second, the PCA of this compromise gives an optimal map of the observation…